Anthropic researchers forced Claude to become deceptive — what they discovered could save us from rogue AI

Anthropic researchers reveal groundbreaking techniques to detect hidden objectives in AI systems, training Claude to conceal its true goals before successfully uncovering them through innovative auditing methods that could transform AI safety standards.Read More
Read more at VentureBeat
-
Anthropic’s stealth enterprise coup: How Claude 3.7 is becoming the coding agent of choice
Anthropic is positioning Claude as the LLM that matters most for enterprise companies. Claude 3.7 Sonnet, released just two weeks ago, set new benchmark records for coding performance.VentureBeat - 5d -
Anthropic just launched a new platform that lets everyone in your company collaborate on AI — not just the tech team
Anthropic launches upgraded Console with team prompt collaboration tools and Claude 3.7 Sonnet's extended thinking controls.VentureBeat - Mar. 6 -
Animal poo can be used to save endangered species from extinction, research finds
Some cells are still alive within the dung, and could be used to boost genetic diversity in certain species. Turning animal poo into offspring sounds like a zoo keeper’s conjuring trick, but it ...The Guardian - 1d -
Nous Research just launched an API that gives developers access to AI models that OpenAI and Anthropic won’t build
Nous Research's new API for "unrestricted" Hermes 3 and DeepHermes-3 AI models features toggle-on reasoning and a developer-first approach.VentureBeat - 4d -
Inching towards AGI: How reasoning and deep research are expanding AI from statistical prediction to structured problem-solving
GUEST: AI has evolved at an astonishing pace. What seemed like science fiction just a few years ago is now an undeniable reality. Back in 2017, my firm launched an AI Center of Excellence. AI was ...VentureBeat - 20h -
Is daylight saving time bad for your health? A neurologist explains.
Researchers are discovering that "springing ahead" each March for daylight saving time is connected with serious negative health effects.CBS News - Mar. 8 -
Anthropic Just Reached a New Revenue Milestone
Anthropic’s Claude 3.7 Sonnet is a game-changer for the company, which is partly owned by Google.Inc. - 4d -
Forcing women back into the office will cost us millions
New research from the University of Toronto's Rotman School of Management highlights how in-person work environments expose women to higher levels of workplace bias and mistreatment compared to ...The Hill - 6d -
What would change if daylight saving time became permanent?
For the next eight months, most of us will be observing daylight saving time. But what if this became our permanent time?The Hill - Mar. 9 -
Democrats Demand Answers on DOGE’s Use of AI
Members of the House Oversight Committee sent dozens of requests to federal agencies on Wednesday about their use of AI software—and how Elon Musk could benefit.Wired - 4d -
Over half of American adults have used an AI chatbot, survey finds
Artificial intelligence technology is becoming increasingly integral to everyday life. 52% of U.S. adults have used AI large language models like ChatGPT.NBC News - 4d -
3 Ways AI Can Create More Tailored Marketing Strategies for Retail Brands
Personalized marketing at scale used to be impossible. With AI, it’s become a reality.Inc. - 6d -
Out of the lab and into the streets, researchers and doctors rally for science against Trump cuts
Researchers, doctors, their patients and supporters are venturing out of labs, hospitals and offices across the country to stand up to what they call an attack on life-saving science by the Trump ...ABC News - Mar. 7 -
This retirement strategy is a 'game changer' for single-income, married couples, advisor says
If you're a single-income, married couple, you could use a spousal IRA to save more for retirement. Here's what to know.CNBC - Mar. 7 -
What Products Could Europe Levy in Retaliation to Trump’s Tariffs?
The European Union wants to force the United States to the negotiating table with retaliatory tariffs on a range of American products, including some from Republican strongholds.The New York Times - 5d -
Will AI save the UK government £45bn a year?
Ministers believe that increased use of artificial intelligence can help fix the public finances but experts have their doubtsFinancial Times - 2d -
Meet the 21-year-old helping coders use AI to cheat in Google and other tech job interviews
As artificial intelligence becomes more advanced, employers are trying to build workarounds to prevent candidates from cheating in virtual job interviews.CNBC - 6d -
Champagne and Parmigiano under threat from Trump’s tariffs
Producers warn Europe’s finest foods and wine could become unaffordable for ordinary US consumersFinancial Times - Mar. 10 -
Researchers Propose a Better Way to Report Dangerous AI Flaws
After identifying major flaws in popular AI models, researchers are pushing for a new system to identify and report bugs.Wired - 4d -
Jobs lost and lifesaving cures not discovered: Possible impacts of research cuts
Ripple effects of the Trump administration's crackdown on U.S. medical research promise to reach every corner of AmericaABC News - Mar. 6 -
Grab co-founder Anthony Tan says ‘humans who don’t embrace AI will be replaced by AI’
Grab co-founder and CEO Anthony Tan discussed how workers can use AI to become more productive, at Converge Live in Singapore.CNBC - 4d -
What are smartphones stealing from us? When mine was taken away, I found out | Alexander Hurst
As a Paris film extra, I surrendered my device and discovered the extraordinary connections I miss while staring at my screen. A few Thursdays ago was a wrap. For my brief acting career, that is. ...The Guardian - Mar. 10 -
Majority of Americans have used AI models like ChatGPT: Survey
A majority of Americans have used ChatGPT-like artificial intelligence (AI) models according to a new survey. In the survey from Elon University’s Imagining the Digital Future Center, 52 percent ...The Hill - 4d -
What you need to know about Manus, the new AI agentic system from China hailed as a second ‘DeepSeek moment’
Manus AI is designed as a multi-agent system, meaning it combines several AI models to handle tasks independently.VentureBeat - 6d -
A Groundbreaking Ship That Sank in Lake Superior in 1892 Is Discovered
After searching for two years, researchers discovered the shipwreck of the Western Reserve, an early all-steel ship that broke apart in a gale in 1892 with a sole survivor.The New York Times - 3d -
Stanford professor who co-founded 4 startups: How to use AI as a 'force multiplier' to start a business
"You don't want to compete with someone who has an AI at their shoulder," says Steve Blank. He has co-founded four startups, one of which sold for $329 million.CNBC - 3h -
Cerebras just announced 6 new AI datacenters that process 40M tokens per second — and it could be bad news for Nvidia
Cerebras Systems is challenging Nvidia with six new AI data centers across North America, promising 10x faster inference speeds and 7x cost reduction for companies using advanced AI models like ...VentureBeat - 6d -
Hugging Face co-founder Thomas Wolf just challenged Anthropic CEO’s vision for AI’s future — and the $130 billion industry is taking notice
Hugging Face co-founder Thomas Wolf challenges Anthropic CEO Dario Amodei's "compressed 21st century" vision, arguing AI systems are building "yes-men on servers" rather than the revolutionary ...VentureBeat - Mar. 6 -
Science Amid Chaos: What Worked During the Pandemic? What Failed?
As the coronavirus spread, researchers worldwide scrambled to find ways to keep people safe. Some efforts were misguided. Others saved millions of lives.The New York Times - 2d -
Could Angela Prichard Have Been Saved?
A woman repeatedly called police for help when she was attacked, stalked, and intimidated by her husband. It didn't stop him from killing her. "48 Hours" contributor Jonathan Vigliotti reports.CBS News - 1d -
How South Korea’s shipyards could save the US Navy
Asian shipbuilders see opportunity to fill gap as Washington tries to keep pace with China’s naval expansionFinancial Times - 3d -
What kind of jobs will be impacted by AI?
Last week, online furniture retailer Wayfair announced it would increase its use of generative artificial intelligence and cut 340 tech jobs. It reflects an increase in businesses and companies ...CBS News - 5d -
France has a nuclear umbrella. Could its European allies fit under it?
President Macron has aired the idea that France's deterrence force could be used to defend of other European countries.BBC News - Mar. 6 -
The US Army Is Using ‘CamoGPT’ to Purge DEI From Training Materials
Developed to boost productivity and operational readiness, the AI is now being used to “review” diversity, equity, inclusion, and accessibility policies to align them with President Trump’s orders.Wired - Mar. 6 -
It Isn’t Just Trump. America’s Whole Reputation Is Shot.
What happens when a superpower goes rogue.The New York Times - 3d -
Sony Uses Horizon's Aloy To Demonstrate New AI Tech, And It's About As Impressive As You'd Expect
Sony is testing out new AI software, and it's using a beloved character--Aloy from the Horizon series--to show it off. A YouTube video narrated by Sharwin Raghoebardajal--a software engineering ...GameSpot - 6d -
China could literally rewrite history using AI — and control the future
This isn’t just another policy debate — it’s a defining moment that will shape global discourse and the balance of power for decades to come. The technology we allow to thrive will either spread ...The Hill - 5d -
Can Matchmaking Platforms Save Us From Dating App Fatigue?
Big Dating got singles hooked on convenience culture. But finding a partner is work—and a batch of matchmaking services think they’ve cracked the code for partnership.Wired - Mar. 7 -
Navarro: US in 'difficult transition from Bidenomics to Trumpnomics'
Peter Navarro, a senior trade adviser to President Trump, said Wednesday that the U.S. is “in a difficult transition from Bidenomics to Trumpnomics.” “Help us understand, what is the bigger picture ...The Hill - 4d
More from VentureBeat
-
Digital Bandidos wants to be an outlaw publisher that fights for devs | Steve Escalante
Digital Bandidos formed last year as an indie game publishing company just as many publishers were going out of business.VentureBeat - 1h -
GGP announces Collegiate Games Competition with $90K in prizes
Gay Gaming Professionals, a nonprofit that cultivates emerging top performers in the video games industry, has created the Collegiate Games Competition to surface new gaming talent. In partnership ...VentureBeat - 1h -
Roblox unveils Roblox Cube GenAI and other game dev tools for GDC
Roblox introduced its Roblox Cube AI tools, the core generative AI system for building 3D objects and scenes.VentureBeat - 1h -
GGWP adds voice moderation for Unity’s Vivox voice chat and Gorilla Tag
GGWP, a brand safety platform for online communities, said its voice moderation tools are being used by Unity's Vivox and Another Axiom.VentureBeat - 1h -
The Discord Social SDK enables developers to tap into Discord’s social infrastructure at no cost
Discord, the communications for games company, has launched the Discord Social SDK, a toolkit that enables developers of all sizes to tap into Discord’s social infrastructure to drive their game’s ...VentureBeat - 2h
More in Tech
-
Private lunar lander Blue Ghost falls silent on the moon after a 2-week mission
It's lights out for the first private lunar lander to pull off a fully successful moon missionABC News - 25m -
Undertale Sees New Peak For Steam Concurrent Players Nearly A Decade After Launching
It's hard to comprehend that Undertale will be 10 years old later this year. But new audiences apparently keep finding the RPG, as the game broke its own Steam record for concurrent players on ...GameSpot - 40m -
Undertale Sees New Peak For Steam Concurrent Players Nearly A Decade After Launching
It's hard to comprehend that Undertale will be 10 years old later this year. But new audiences apparently keep finding the RPG, as the game broke its own Steam record for concurrent players on ...GameSpot - 40m -
Death Stranding 2 Collector's Edition Preorders Are Available Now
Death Stranding 2: On the Beach preorders are starting to go live at major retailers, but big fans of Hideo Kojima will want to move fast to secure the Collector's Edition. Exclusive to PlayStation ...GameSpot - 59m -
How to Shop for Vinyl Records Online (2025): Discogs, Ebay
Don't just load up on Amazon! Here are the best ways to find your favorite records on wax.Wired - 1h