Anthropic researchers forced Claude to become deceptive — what they discovered could save us from rogue AI

Anthropic researchers reveal groundbreaking techniques to detect hidden objectives in AI systems, training Claude to conceal its true goals before successfully uncovering them through innovative auditing methods that could transform AI safety standards.Read More
Read more at VentureBeat
-
Anthropic’s stealth enterprise coup: How Claude 3.7 is becoming the coding agent of choice
Anthropic is positioning Claude as the LLM that matters most for enterprise companies. Claude 3.7 Sonnet, released just two weeks ago, set new benchmark records for coding performance.VentureBeat - 6d -
Anthropic just launched a new platform that lets everyone in your company collaborate on AI — not just the tech team
Anthropic launches upgraded Console with team prompt collaboration tools and Claude 3.7 Sonnet's extended thinking controls.VentureBeat - Mar. 6 -
Animal poo can be used to save endangered species from extinction, research finds
Some cells are still alive within the dung, and could be used to boost genetic diversity in certain species. Turning animal poo into offspring sounds like a zoo keeper’s conjuring trick, but it ...The Guardian - 1d -
Nous Research just launched an API that gives developers access to AI models that OpenAI and Anthropic won’t build
Nous Research's new API for "unrestricted" Hermes 3 and DeepHermes-3 AI models features toggle-on reasoning and a developer-first approach.VentureBeat - 4d -
Is daylight saving time bad for your health? A neurologist explains.
Researchers are discovering that "springing ahead" each March for daylight saving time is connected with serious negative health effects.CBS News - Mar. 8 -
Anthropic Just Reached a New Revenue Milestone
Anthropic’s Claude 3.7 Sonnet is a game-changer for the company, which is partly owned by Google.Inc. - 4d -
Forcing women back into the office will cost us millions
New research from the University of Toronto's Rotman School of Management highlights how in-person work environments expose women to higher levels of workplace bias and mistreatment compared to ...The Hill - 6d -
What would change if daylight saving time became permanent?
For the next eight months, most of us will be observing daylight saving time. But what if this became our permanent time?The Hill - Mar. 9 -
Democrats Demand Answers on DOGE’s Use of AI
Members of the House Oversight Committee sent dozens of requests to federal agencies on Wednesday about their use of AI software—and how Elon Musk could benefit.Wired - 5d -
Over half of American adults have used an AI chatbot, survey finds
Artificial intelligence technology is becoming increasingly integral to everyday life. 52% of U.S. adults have used AI large language models like ChatGPT.NBC News - 4d -
3 Ways AI Can Create More Tailored Marketing Strategies for Retail Brands
Personalized marketing at scale used to be impossible. With AI, it’s become a reality.Inc. - 6d -
Out of the lab and into the streets, researchers and doctors rally for science against Trump cuts
Researchers, doctors, their patients and supporters are venturing out of labs, hospitals and offices across the country to stand up to what they call an attack on life-saving science by the Trump ...ABC News - Mar. 7 -
This retirement strategy is a 'game changer' for single-income, married couples, advisor says
If you're a single-income, married couple, you could use a spousal IRA to save more for retirement. Here's what to know.CNBC - Mar. 7 -
What Products Could Europe Levy in Retaliation to Trump’s Tariffs?
The European Union wants to force the United States to the negotiating table with retaliatory tariffs on a range of American products, including some from Republican strongholds.The New York Times - 5d -
Will AI save the UK government £45bn a year?
Ministers believe that increased use of artificial intelligence can help fix the public finances but experts have their doubtsFinancial Times - 3d -
Meet the 21-year-old helping coders use AI to cheat in Google and other tech job interviews
As artificial intelligence becomes more advanced, employers are trying to build workarounds to prevent candidates from cheating in virtual job interviews.CNBC - 6d -
Champagne and Parmigiano under threat from Trump’s tariffs
Producers warn Europe’s finest foods and wine could become unaffordable for ordinary US consumersFinancial Times - Mar. 10 -
Researchers Propose a Better Way to Report Dangerous AI Flaws
After identifying major flaws in popular AI models, researchers are pushing for a new system to identify and report bugs.Wired - 4d -
Jobs lost and lifesaving cures not discovered: Possible impacts of research cuts
Ripple effects of the Trump administration's crackdown on U.S. medical research promise to reach every corner of AmericaABC News - Mar. 6 -
Grab co-founder Anthony Tan says ‘humans who don’t embrace AI will be replaced by AI’
Grab co-founder and CEO Anthony Tan discussed how workers can use AI to become more productive, at Converge Live in Singapore.CNBC - 4d -
Majority of Americans have used AI models like ChatGPT: Survey
A majority of Americans have used ChatGPT-like artificial intelligence (AI) models according to a new survey. In the survey from Elon University’s Imagining the Digital Future Center, 52 percent ...The Hill - 4d -
What are smartphones stealing from us? When mine was taken away, I found out | Alexander Hurst
As a Paris film extra, I surrendered my device and discovered the extraordinary connections I miss while staring at my screen. A few Thursdays ago was a wrap. For my brief acting career, that is. ...The Guardian - Mar. 10 -
What you need to know about Manus, the new AI agentic system from China hailed as a second ‘DeepSeek moment’
Manus AI is designed as a multi-agent system, meaning it combines several AI models to handle tasks independently.VentureBeat - Mar. 10 -
A Groundbreaking Ship That Sank in Lake Superior in 1892 Is Discovered
After searching for two years, researchers discovered the shipwreck of the Western Reserve, an early all-steel ship that broke apart in a gale in 1892 with a sole survivor.The New York Times - 3d -
Stanford professor who co-founded 4 startups: How to use AI as a 'force multiplier' to start a business
"You don't want to compete with someone who has an AI at their shoulder," says Steve Blank. He has co-founded four startups, one of which sold for $329 million.CNBC - 6h -
Cerebras just announced 6 new AI datacenters that process 40M tokens per second — and it could be bad news for Nvidia
Cerebras Systems is challenging Nvidia with six new AI data centers across North America, promising 10x faster inference speeds and 7x cost reduction for companies using advanced AI models like ...VentureBeat - 6d -
Hugging Face co-founder Thomas Wolf just challenged Anthropic CEO’s vision for AI’s future — and the $130 billion industry is taking notice
Hugging Face co-founder Thomas Wolf challenges Anthropic CEO Dario Amodei's "compressed 21st century" vision, arguing AI systems are building "yes-men on servers" rather than the revolutionary ...VentureBeat - Mar. 6 -
Could Angela Prichard Have Been Saved?
A woman repeatedly called police for help when she was attacked, stalked, and intimidated by her husband. It didn't stop him from killing her. "48 Hours" contributor Jonathan Vigliotti reports.CBS News - 1d -
How South Korea’s shipyards could save the US Navy
Asian shipbuilders see opportunity to fill gap as Washington tries to keep pace with China’s naval expansionFinancial Times - 3d -
Science Amid Chaos: What Worked During the Pandemic? What Failed?
As the coronavirus spread, researchers worldwide scrambled to find ways to keep people safe. Some efforts were misguided. Others saved millions of lives.The New York Times - 2d -
What kind of jobs will be impacted by AI?
Last week, online furniture retailer Wayfair announced it would increase its use of generative artificial intelligence and cut 340 tech jobs. It reflects an increase in businesses and companies ...CBS News - 5d -
France has a nuclear umbrella. Could its European allies fit under it?
President Macron has aired the idea that France's deterrence force could be used to defend of other European countries.BBC News - Mar. 6 -
The US Army Is Using ‘CamoGPT’ to Purge DEI From Training Materials
Developed to boost productivity and operational readiness, the AI is now being used to “review” diversity, equity, inclusion, and accessibility policies to align them with President Trump’s orders.Wired - Mar. 6 -
It Isn’t Just Trump. America’s Whole Reputation Is Shot.
What happens when a superpower goes rogue.The New York Times - 3d -
Sony Uses Horizon's Aloy To Demonstrate New AI Tech, And It's About As Impressive As You'd Expect
Sony is testing out new AI software, and it's using a beloved character--Aloy from the Horizon series--to show it off. A YouTube video narrated by Sharwin Raghoebardajal--a software engineering ...GameSpot - 6d -
Inching towards AGI: How reasoning and deep research are expanding AI from statistical prediction to structured problem-solving
In March 2023, OpenAI released GPT-4, which promised "sparks" of AGI. Two years on, the flame is beginning to appear.VentureBeat - 23h -
China could literally rewrite history using AI — and control the future
This isn’t just another policy debate — it’s a defining moment that will shape global discourse and the balance of power for decades to come. The technology we allow to thrive will either spread ...The Hill - 5d -
Can Matchmaking Platforms Save Us From Dating App Fatigue?
Big Dating got singles hooked on convenience culture. But finding a partner is work—and a batch of matchmaking services think they’ve cracked the code for partnership.Wired - Mar. 7 -
Navarro: US in 'difficult transition from Bidenomics to Trumpnomics'
Peter Navarro, a senior trade adviser to President Trump, said Wednesday that the U.S. is “in a difficult transition from Bidenomics to Trumpnomics.” “Help us understand, what is the bigger picture ...The Hill - 4d
More from VentureBeat
-
Inworld AI showcases AI case studies as they move to production
While impressive in controlled demos, today's AI technologies expose critical limitations when transitioning to production-ready games.VentureBeat - 1h -
Gaming luminaries explore industry’s struggles and how its best days are still ahead
Matthew Ball started a big conversation about the state of the game industry when he launched a 224-slide PowerPoint deck. We continued it.VentureBeat - 1h -
Game studios deal with uncertainty by doing more with less | Unity
Unity said in its annual report that game developers faced with uncertainty are doing more with less resources.VentureBeat - 2h -
Visa’s AI edge: How RAG-as-a-service and deep learning are strengthening security and speeding up data retrieval
Visa has reduced data retrieval from hours to mere minutes and blocked $40 billion in fraud thanks to gen AI tools.VentureBeat - 2h -
Zynga teams with Fast & Furious for year-long event series in CSR2
Zynga, in collaboration with Universal Products & Experiences, is speeding into a year of events bringing Fast & Furious film content into CSR2.VentureBeat - 3h
More in Tech
-
Meta Tries to Stop Sarah Wynn-Williams From Further Selling Scathing Memoir
An arbitrator has prevented the employee from promoting her book and disparaging the company until private arbitration concludes.The New York Times - 47m -
Inworld AI showcases AI case studies as they move to production
While impressive in controlled demos, today's AI technologies expose critical limitations when transitioning to production-ready games.VentureBeat - 1h -
Gaming luminaries explore industry’s struggles and how its best days are still ahead
Matthew Ball started a big conversation about the state of the game industry when he launched a 224-slide PowerPoint deck. We continued it.VentureBeat - 1h -
Who are the NASA astronauts who have been stuck in space for 9 months?
NASA's stuck astronauts are heading home now that a replacement crew has arrived at the International Space StationABC News - 1h -
Game studios deal with uncertainty by doing more with less | Unity
Unity said in its annual report that game developers faced with uncertainty are doing more with less resources.VentureBeat - 2h