Small model, big impact: Patronus AI’s Glider outperforms GPT-4 in key AI benchmarks
Patronus AI launches Glider, a breakthrough 3.8B parameter language model that rivals GPT-4's evaluation capabilities while running on-device, offering transparent AI assessment with detailed explanations for developers and enterprises.Read More
Read more at VentureBeat
Topics
-
UAE’s Falcon 3 challenges open-source leaders amid surging demand for small AI models
The UAE-backed institute has released Falcon 3 in four different sizes with the goal of democratizing access to advanced AI capabilities.VentureBeat - 1d -
Nvidia Introduces Device Aimed at Small Firms, Hobbyists for AI Use
The $249 version of its Jetson computer for artificial-intelligence applications is half the price of its predecessor.The Wall Street Journal - 2d -
4 Rules for Going From Small to Big
Thinking big isn’t enough. You need to plan and act big as wellInc. - 5d -
Microsoft’s smaller AI model beats the big guys: Meet Phi-4, the efficiency king
Microsoft’s new AI model, Phi-4, outperforms larger competitors like Google’s Gemini Pro with superior mathematical reasoning while using fewer resources.VentureBeat - 6d -
How AI could impact medicine
Chief medical correspondent Dr. Jon LaPook joins "CBS Mornings Plus" to discuss the impact of artificial intelligence in medicine. AI is being used for things like interpreting imaging, flagging ...CBS News - Dec. 10 -
A new benchmark for AI investment: Swift Ventures unveils system to separate talk from action
Swift Ventures launches an index of public companies truly investing in AI, showing 37% annual growth and outperforming the Nasdaq and S&P.VentureBeat - Dec. 9 -
Sakana AI’s CycleQD outperforms traditional fine-tuning methods for multi-skill language models
CycleQD merges skills of experts models in clever ways to create many new models with multiple skills, no fine-tuning required.VentureBeat - Dec. 6 -
Google says its AI agent outperformed the world's best weather predictions
Google said Wednesday its artificial intelligence (AI) agent outperformed the world’s best weather predictions. In a blog post, Ilan Price and Matthew Willson, researchers with Google’s DeepMind, ...The Hill - Dec. 5 -
A New Benchmark for the Risks of AI
MLCommons provides benchmarks that test the abilities of AI systems. It wants to measure the bad side of AI next.Wired - Dec. 4
More from VentureBeat
-
Women leaders are creating opportunities at the crossroad of gaming and media
VentureBeat - 2h -
Samsung and Netflix partner on Squid Game Season 2 and Squid Game Unleashed
VentureBeat - 3h -
Zentry brings Ragnarok Landverse to Ronin Web3 network
VentureBeat - 4h -
Nazara’s Nodwin Gaming acquires AFK Gaming
VentureBeat - 9h -
Active investors in game devs could fall in 2025 | Pitchbook
VentureBeat - 13h