Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

Based on a new benchmark, Google DeepMind found Gemini 2.0 Flash to be the most factual LLM, with a score of 83.6%.

VentureBeat - 11h

Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’

AgentRefine gives AI agents and models the ability to recognize errors and self-correct to work better for general tasks.

VentureBeat - 11h

Self-invoking code benchmarks help you decide which LLMs to use for your programming tasks

LLMs are good at coding simple functions. But how good are they at calling their own functions to solve complex problems?

VentureBeat - 19h

How Narvar is using AI and data to enhance post-purchase customer experiences

How Narvar's new tech is using AI to harness the power of 42 billion consumer interactions a year to improve retail operations.

VentureBeat - 1d

Story uses Web3 to enable creators to capture the value they contribute to the AI ecosystem

Story, an intellectual property blockchain, believes that creators, developers and artists should be able to be rewarded for what they contribute to AI.

VentureBeat - 2d

Agent's Take: Player performance bonuses that can be earned in 2025 NFL playoffs

A look at what these players stand to earn in bonuses this postseason

CBS Sports - 15h

Constellation’s stock soared on AI hopes. Now the energy company is using those gains to buy Calpine in $16 billion deal.

Constellation Energy Corp., an electricity provider whose stock has doubled over the last year on hopes it will help meet the voracious energy demand created by artificial-intelligence ...

MarketWatch - 16h

Musk and Ramaswamy sending agents across US government to seek cuts

So-called department of government efficiency charged by Trump to help effect radical government shake-up. Elon Musk and Vivek Ramaswamy have already dispatched emissaries across the US ...

The Guardian - 17h

Zuckerberg approved Meta’s use of ‘pirated’ books to train AI models, authors claim

Sarah Silverman and others file court case claiming CEO approved use of dataset despite warnings. Business live – latest updates Mark Zuckerberg approved Meta’s use of “pirated” versions of ...

The Guardian - 20h

Delta Just Announced Its Plan to Use AI to Solve the Worst Thing About Traveling

A new in-app concierge relies on AI to give customers a personalized experience.

Inc. - 2d

Europe can still win in AI despite US dominance, says Skype co-founder

Niklas Zennström believes continent can thrive by developing applications on top of artificial intelligence models

Financial Times - 3d

Countries are tracking Russia's shadow fleet using AI after suspected attacks on undersea cables

Yahoo News - 3d

How to use AI to be more productive and successful at work

Smarter by CNBC Make It's online course will help you understand new AI tools and how you can use them to save time at work, in your business, and in life.

CNBC - 3d

Agentic AI can help you to get a new software engineering job in 2025

Thanks to a widening skills shortage, software engineers don’t generally have to look too far for a new role on better money.

VentureBeat - 3d

More breast cancer cases found when AI used in screenings, study finds

First real-world test finds approach has higher detection rate without having a higher rate of false positives. The use of artificial intelligence in breast cancer screening increases the chance of ...

The Guardian - 3d

‘You’re gonna find this creepy’: my AI-cloned voice was used by the far right. Could I stop it? | Georgina Findlay

It was chilling to hear ‘my voice’ repeating lies – and to discover that deepfake audio is a growing threat to democracy. Georgina Findlay is a writer and presenter at the YouTube channel TLDR ...

The Guardian - 4d

Dungeons & Dragons shows that modish guff doesn’t serve diversity and inclusion

Tweaks to the cult game’s rule book are an object lesson in how not to promote change

Financial Times - 4d

Nvidia’s AI agent play is here with new models, orchestration blueprints

Nvidia enters the AI agent space with a family of language models for agentic instruction and orchestrarion frameworks it calls Blueprints.

VentureBeat - 4d

Nvidia launches blueprint for AI agents that can analyze video

Today as part of its CES 2025 opening keynote by CEO Jensen Huang, Nvidia launched a blueprint for AI agents that can analyze video.

VentureBeat - 4d

Nvidia unveils Project Digits personal AI supercomputer for researchers and students

Nvidia today unveiled Nvidia Project Digits, a personal AI supercomputer that serves AI researchers, data scientists and students worldwide.

VentureBeat - 4d

Nvidia using GenAI to integrate Omniverse virtual creations into physical AI apps

Nvidia unveiled generative AI models and blueprints that expand Nvidia Omniverse integration further into physical AI applications.

VentureBeat - 4d

Nvidia launches agentic AI blueprints to automate work for enterprises

Nvidia and its partners launched Agentic AI Blueprints to automate work for every enterprise.

VentureBeat - 4d

Nvidia’s Nemotron Model Families will advance AI agents

Nvidia announced Nemotron Model Families to advance agentic AI as part of its bevy of AI announcements at CES 2025 today.

VentureBeat - 4d

Sam Altman Says AI Agents Will Transform the Workforce in 2025

In a new blog post, the famous OpenAI CEO reflected on his firing, what the company could do better, and a pursuit of ‘superintelligence.’

Inc. - 4d

Google maps the future of AI agents: Five lessons for businesses

Google's groundbreaking white paper reveals how AI agents leverage advanced reasoning, real-time data access and autonomous decision-making.

VentureBeat - 4d

2025 playbook for enterprise AI success, from agents to evals

From scaling AI agents to evals, to inference reasoning, optimizing costs, and personalization, here are the five critical areas enterprises should prioritize for their AI strategy this year.

VentureBeat - 4d

LG rolls out new AI services to help consumers with daily tasks

LG kicked off the AI bandwagon today with a new set of AI services to help consumers in their daily tasks at home, in the car and in the office.

VentureBeat - 4d

Fewer than 1 in 1,000 US adolescents receive gender-affirming medications, researchers find

Fewer than 1 in 1,000 U.S. adolescents with commercial insurance received gender-affirming medications during a recent five-year period

ABC News - 4d

Intel unveils new Core Ultra processors with 2X to 3X performance on AI apps

Intel unveiled new Intel Core Ultra 9 processors today at CES 2025 with as much as two or three times the edge performance on AI apps.

VentureBeat - 4d

Paxlovid Improved Long Covid Symptoms in Some Patients, Researchers Report

But the report, on the experiences of 13 patients, found that the drug had no benefit for some people and that some who benefited said the improvement didn’t last.

The New York Times - 4d

Why context-aware AI agents will give us superpowers in 2025

In 2025, augmented mentality will emerge from the convergence of AI agents, conversational computing and augmented reality.

VentureBeat - 5d

Using Artificial Intelligence for Speech Writing? Here’s How to Overcome AI’s Biggest Defect

Artificial intelligence can’t hack authenticity. Here’s how to make sure you are connecting with people even if you’re getting help from GenAI.

Inc. - 5d

A Book App Used AI to ‘Roast’ Its Users. It Went Anti-Woke Instead

One year-end summary from Fable, a social app where people share what books they read, told the user, “Don’t forget to surface for the occasional white author, OK?”

Wired - Jan. 3

How Meta’s latest research proves you can use generative AI to understand user intent

By thinking about recommendation as a generative problem, you can tackle it from new angles and use LLMs to better understand user intent.

VentureBeat - Jan. 3

Editors at Science Journal Resign En Masse Over Bad Use of AI, High Fees

Members of the Elsevier-published Journal of Human Evolution quit, citing AI production processes introducing errors, high author fees, and concerns over editorial independence.

Wired - Jan. 2

Taco Bell using AI voice to take orders at drive-thru

Fast food chain Taco Bell is turning to artificial intelligence to meet the demand of its drive-thru customers. The technology is said to cut the wait for an order by 29 seconds and improve ...

CBS News - Jan. 2

Inside the AI agent revolution: How data-driven automation transformed the enterprise in 2024

In a recent survey, 82% of tech executives said they intend to integrate AI agents across their stacks within the next three years.

VentureBeat - Dec. 31

Latest in Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’

Self-invoking code benchmarks help you decide which LLMs to use for your programming tasks

How Narvar is using AI and data to enhance post-purchase customer experiences

Story uses Web3 to enable creators to capture the value they contribute to the AI ecosystem

Agent's Take: Player performance bonuses that can be earned in 2025 NFL playoffs

Constellation’s stock soared on AI hopes. Now the energy company is using those gains to buy Calpine in $16 billion deal.

Musk and Ramaswamy sending agents across US government to seek cuts

Zuckerberg approved Meta’s use of ‘pirated’ books to train AI models, authors claim

Delta Just Announced Its Plan to Use AI to Solve the Worst Thing About Traveling

Europe can still win in AI despite US dominance, says Skype co-founder

Countries are tracking Russia's shadow fleet using AI after suspected attacks on undersea cables

How to use AI to be more productive and successful at work

Agentic AI can help you to get a new software engineering job in 2025

More breast cancer cases found when AI used in screenings, study finds

‘You’re gonna find this creepy’: my AI-cloned voice was used by the far right. Could I stop it? | Georgina Findlay

Dungeons & Dragons shows that modish guff doesn’t serve diversity and inclusion

Nvidia’s AI agent play is here with new models, orchestration blueprints

Nvidia launches blueprint for AI agents that can analyze video

Nvidia unveils Project Digits personal AI supercomputer for researchers and students

Nvidia using GenAI to integrate Omniverse virtual creations into physical AI apps

Nvidia launches agentic AI blueprints to automate work for enterprises

Nvidia’s Nemotron Model Families will advance AI agents

Sam Altman Says AI Agents Will Transform the Workforce in 2025

Google maps the future of AI agents: Five lessons for businesses

2025 playbook for enterprise AI success, from agents to evals

LG rolls out new AI services to help consumers with daily tasks

Fewer than 1 in 1,000 US adolescents receive gender-affirming medications, researchers find

Intel unveils new Core Ultra processors with 2X to 3X performance on AI apps

Paxlovid Improved Long Covid Symptoms in Some Patients, Researchers Report

Why context-aware AI agents will give us superpowers in 2025

Using Artificial Intelligence for Speech Writing? Here’s How to Overcome AI’s Biggest Defect

A Book App Used AI to ‘Roast’ Its Users. It Went Anti-Woke Instead

How Meta’s latest research proves you can use generative AI to understand user intent

Editors at Science Journal Resign En Masse Over Bad Use of AI, High Fees

Taco Bell using AI voice to take orders at drive-thru

Inside the AI agent revolution: How data-driven automation transformed the enterprise in 2024

Topics