Small model, big impact: Patronus AI’s Glider outperforms GPT-4 in key AI evaluation tasks
Patronus AI launches Glider, a breakthrough 3.8B-parameter language model that rivals GPT-4's evaluation capabilities while running on-device, offering transparent AI assessment with detailed explanations for developers and enterprises.Read More
Read more at VentureBeat
-
UAE’s Falcon 3 challenges open-source leaders amid surging demand for small AI models
The UAE-backed institute has released Falcon 3 in four different sizes with the goal of democratizing access to advanced AI capabilities.VentureBeat - 2d -
House task force releases sweeping end-of-year report on AI
The House Task Force on Artificial Intelligence (AI) released its sweeping end-of-year report Tuesday, laying out a roadmap for Congress as it crafts policy surrounding the advancing technology. ...The Hill - 2d -
4 Rules for Going From Small to Big
Thinking big isn’t enough. You need to plan and act big as wellInc. - 5d -
Microsoft’s smaller AI model beats the big guys: Meet Phi-4, the efficiency king
Microsoft’s new AI model, Phi-4, outperforms larger competitors like Google’s Gemini Pro with superior mathematical reasoning while using fewer resources.VentureBeat - Dec. 13 -
How RapidCanvas automates 70% of data tasks for gen AI projects
With its context-aware AI agents, RapidCanvas is taking on the likes of leading players like DataRobot, Dataiku, Palantir and Alteryx.VentureBeat - Dec. 11 -
How AI could impact medicine
Chief medical correspondent Dr. Jon LaPook joins "CBS Mornings Plus" to discuss the impact of artificial intelligence in medicine. AI is being used for things like interpreting imaging, flagging ...CBS News - Dec. 10 -
How Databricks is using synthetic data to simplify evaluation of AI agents
Multiple enterprises are using Databricks' synthetic data API, seeing a significant time reduction to improve agent quality and deployment.VentureBeat - Dec. 9 -
Sakana AI’s CycleQD outperforms traditional fine-tuning methods for multi-skill language models
CycleQD merges skills of experts models in clever ways to create many new models with multiple skills, no fine-tuning required.VentureBeat - Dec. 6 -
Google says its AI agent outperformed the world's best weather predictions
Google said Wednesday its artificial intelligence (AI) agent outperformed the world’s best weather predictions. In a blog post, Ilan Price and Matthew Willson, researchers with Google’s DeepMind, ...The Hill - Dec. 5
More from VentureBeat
-
ChatGPT adds more PC and Mac app integrations, getting closer to piloting your computer
VentureBeat - 10h -
SpongeBob SquarePants is the latest icon to join the UEFN platform
VentureBeat - 11h -
Gamefam closes year of growth with 5 of top 15 branded games on Roblox
VentureBeat - 13h -
Google unveils new reasoning model Gemini 2.0 Flash Thinking to rival OpenAI o1
VentureBeat - 14h -
Stable Diffusion 3.5 hits Amazon Bedrock: What it means for enterprise AI workflows
VentureBeat - 14h