Latest in Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’
Sort by
584 items
-
Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations
Based on a new benchmark, Google DeepMind found Gemini 2.0 Flash to be the most factual LLM, with a score of 83.6%.VentureBeat - 11h -
Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’
AgentRefine gives AI agents and models the ability to recognize errors and self-correct to work better for general tasks.VentureBeat - 11h -
Self-invoking code benchmarks help you decide which LLMs to use for your programming tasks
LLMs are good at coding simple functions. But how good are they at calling their own functions to solve complex problems?VentureBeat - 19h -
How Narvar is using AI and data to enhance post-purchase customer experiences
How Narvar's new tech is using AI to harness the power of 42 billion consumer interactions a year to improve retail operations.VentureBeat - 1d -
Story uses Web3 to enable creators to capture the value they contribute to the AI ecosystem
Story, an intellectual property blockchain, believes that creators, developers and artists should be able to be rewarded for what they contribute to AI.VentureBeat - 2d -
Agent's Take: Player performance bonuses that can be earned in 2025 NFL playoffs
A look at what these players stand to earn in bonuses this postseasonCBS Sports - 15h -
Constellation’s stock soared on AI hopes. Now the energy company is using those gains to buy Calpine in $16 billion deal.
Constellation Energy Corp., an electricity provider whose stock has doubled over the last year on hopes it will help meet the voracious energy demand created by artificial-intelligence ...MarketWatch - 16h -
Musk and Ramaswamy sending agents across US government to seek cuts
So-called department of government efficiency charged by Trump to help effect radical government shake-up. Elon Musk and Vivek Ramaswamy have already dispatched emissaries across the US ...The Guardian - 17h -
Zuckerberg approved Meta’s use of ‘pirated’ books to train AI models, authors claim
Sarah Silverman and others file court case claiming CEO approved use of dataset despite warnings. Business live – latest updates Mark Zuckerberg approved Meta’s use of “pirated” versions of ...The Guardian - 20h -
Delta Just Announced Its Plan to Use AI to Solve the Worst Thing About Traveling
A new in-app concierge relies on AI to give customers a personalized experience.Inc. - 2d -
Europe can still win in AI despite US dominance, says Skype co-founder
Niklas Zennström believes continent can thrive by developing applications on top of artificial intelligence modelsFinancial Times - 3d -
-
How to use AI to be more productive and successful at work
Smarter by CNBC Make It's online course will help you understand new AI tools and how you can use them to save time at work, in your business, and in life.CNBC - 3d -
Agentic AI can help you to get a new software engineering job in 2025
Thanks to a widening skills shortage, software engineers don’t generally have to look too far for a new role on better money.VentureBeat - 3d -
More breast cancer cases found when AI used in screenings, study finds
First real-world test finds approach has higher detection rate without having a higher rate of false positives. The use of artificial intelligence in breast cancer screening increases the chance of ...The Guardian - 3d -
‘You’re gonna find this creepy’: my AI-cloned voice was used by the far right. Could I stop it? | Georgina Findlay
It was chilling to hear ‘my voice’ repeating lies – and to discover that deepfake audio is a growing threat to democracy. Georgina Findlay is a writer and presenter at the YouTube channel TLDR ...The Guardian - 4d -
Dungeons & Dragons shows that modish guff doesn’t serve diversity and inclusion
Tweaks to the cult game’s rule book are an object lesson in how not to promote changeFinancial Times - 4d -
Nvidia’s AI agent play is here with new models, orchestration blueprints
Nvidia enters the AI agent space with a family of language models for agentic instruction and orchestrarion frameworks it calls Blueprints.VentureBeat - 4d -
Nvidia launches blueprint for AI agents that can analyze video
Today as part of its CES 2025 opening keynote by CEO Jensen Huang, Nvidia launched a blueprint for AI agents that can analyze video.VentureBeat - 4d -
Nvidia unveils Project Digits personal AI supercomputer for researchers and students
Nvidia today unveiled Nvidia Project Digits, a personal AI supercomputer that serves AI researchers, data scientists and students worldwide.VentureBeat - 4d -
Nvidia using GenAI to integrate Omniverse virtual creations into physical AI apps
Nvidia unveiled generative AI models and blueprints that expand Nvidia Omniverse integration further into physical AI applications.VentureBeat - 4d -
Nvidia launches agentic AI blueprints to automate work for enterprises
Nvidia and its partners launched Agentic AI Blueprints to automate work for every enterprise.VentureBeat - 4d -
Nvidia’s Nemotron Model Families will advance AI agents
Nvidia announced Nemotron Model Families to advance agentic AI as part of its bevy of AI announcements at CES 2025 today.VentureBeat - 4d -
Sam Altman Says AI Agents Will Transform the Workforce in 2025
In a new blog post, the famous OpenAI CEO reflected on his firing, what the company could do better, and a pursuit of ‘superintelligence.’Inc. - 4d -
Google maps the future of AI agents: Five lessons for businesses
Google's groundbreaking white paper reveals how AI agents leverage advanced reasoning, real-time data access and autonomous decision-making.VentureBeat - 4d -
2025 playbook for enterprise AI success, from agents to evals
From scaling AI agents to evals, to inference reasoning, optimizing costs, and personalization, here are the five critical areas enterprises should prioritize for their AI strategy this year.VentureBeat - 4d -
LG rolls out new AI services to help consumers with daily tasks
LG kicked off the AI bandwagon today with a new set of AI services to help consumers in their daily tasks at home, in the car and in the office.VentureBeat - 4d -
Fewer than 1 in 1,000 US adolescents receive gender-affirming medications, researchers find
Fewer than 1 in 1,000 U.S. adolescents with commercial insurance received gender-affirming medications during a recent five-year periodABC News - 4d -
Intel unveils new Core Ultra processors with 2X to 3X performance on AI apps
Intel unveiled new Intel Core Ultra 9 processors today at CES 2025 with as much as two or three times the edge performance on AI apps.VentureBeat - 4d -
Paxlovid Improved Long Covid Symptoms in Some Patients, Researchers Report
But the report, on the experiences of 13 patients, found that the drug had no benefit for some people and that some who benefited said the improvement didn’t last.The New York Times - 4d -
Why context-aware AI agents will give us superpowers in 2025
In 2025, augmented mentality will emerge from the convergence of AI agents, conversational computing and augmented reality.VentureBeat - 5d -
Using Artificial Intelligence for Speech Writing? Here’s How to Overcome AI’s Biggest Defect
Artificial intelligence can’t hack authenticity. Here’s how to make sure you are connecting with people even if you’re getting help from GenAI.Inc. - 5d -
A Book App Used AI to ‘Roast’ Its Users. It Went Anti-Woke Instead
One year-end summary from Fable, a social app where people share what books they read, told the user, “Don’t forget to surface for the occasional white author, OK?”Wired - Jan. 3 -
How Meta’s latest research proves you can use generative AI to understand user intent
By thinking about recommendation as a generative problem, you can tackle it from new angles and use LLMs to better understand user intent.VentureBeat - Jan. 3 -
Editors at Science Journal Resign En Masse Over Bad Use of AI, High Fees
Members of the Elsevier-published Journal of Human Evolution quit, citing AI production processes introducing errors, high author fees, and concerns over editorial independence.Wired - Jan. 2 -
Taco Bell using AI voice to take orders at drive-thru
Fast food chain Taco Bell is turning to artificial intelligence to meet the demand of its drive-thru customers. The technology is said to cut the wait for an order by 29 seconds and improve ...CBS News - Jan. 2 -
Inside the AI agent revolution: How data-driven automation transformed the enterprise in 2024
In a recent survey, 82% of tech executives said they intend to integrate AI agents across their stacks within the next three years.VentureBeat - Dec. 31