‘Insane’: OpenAI introduces GPT-4o native image generation and it’s already wowing users

As AI-generated images become more precise and accessible, GPT-4o represents a significant step forward in the space.Read More
Read more at VentureBeat
Topics
-
OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds
Three, all new proprietary voice models called gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts.VentureBeat - 5d -
OpenAI Unveils New Image Generator for ChatGPT
The company’s chatbot can now create elaborate and unusual images.The New York Times - 14h -
OpenAI Unveils New Image Generator for ChatGPT
The company’s chatbot can now create elaborate and unusual images.The New York Times - 14h -
Google’s native multimodal AI image generation in Gemini 2.0 Flash impresses with fast edits, style transfers
Google Gemini 2.0 Flash enables developers to create illustrations, refine images through conversation, and generate detailed visualsVentureBeat - Mar. 12 -
OpenAI’s Sora Is Plagued by Sexist, Racist, and Ableist Biases
WIRED tested the popular AI video generator from OpenAI and found that it amplifies sexist stereotypes and ableist tropes, perpetuating the same biases already present in AI image tools.Wired - 3d -
Mistral AI drops new open-source model that outperforms GPT-4o Mini with fraction of parameters
France's Mistral AI launches efficient open-source model that outperforms Google and OpenAI offerings with just 24 billion parameters, challenging U.S. tech giants' dominance in artificial ...VentureBeat - Mar. 17 -
How Yelp reviewed competing LLMs for correctness, relevance and tone to develop its user-friendly AI assistant
Yelp found that when it first launched a GPT-4o-powered AI chatbot, usage rates dropped. But training it to sound like a human changed that.VentureBeat - Mar. 7 -
The new best AI image generation model is here: say hello to Reve Image 1.0!
One of the model’s standout capabilities is its strong text rendering performance, addressing a common challenge in AI-generated imagery.VentureBeat - 19h -
Adobe previews AI generated PowerPoints from raw customer data with ‘Project Slide Wow’
The fate of Project Slide Wow depends on user interest and engagement, as Adobe monitors social media conversations.VentureBeat - 6d
More from VentureBeat
-
Microsoft infuses enterprise agents with deep reasoning, unveils data Analyst agent that outsmarts competitors
Microsoft announced Tuesday two significant additions to its Copilot Studio platform: deep reasoning capabilities that enable agents to tackle complex problems through careful, methodical thinking, ...VentureBeat - 8h -
Beyond transformers: Nvidia’s MambaVision aims to unlock faster, cheaper enterprise computer vision
Nvidia is updating its computer vision models with new versions of MambaVision that combine the best of Mamba and transformers to improve efficiency.VentureBeat - 12h -
Gunzilla Games acquires, resurrects Game Informer
Game Informer returns as Gunzilla Games has acquired and relaunched Game Informer, bringing back the staff and the website.VentureBeat - 12h -
METASCALE improves LLM reasoning with adaptive strategies
METASCALE uses a three-stage approach to dynamically choose the right reasoning technique for each promblem.VentureBeat - 13h -
Google releases ‘most intelligent model to date,’ Gemini 2.5 Pro
Gemini 2.5 Pro is now available for Gemini Advanced users and is Google's most capable model with a 1 million token context window.VentureBeat - 14h
More in Tech
-
I Went Undercover in Crypto’s Answer to ‘Squid Game.’ It Nearly Broke Me
I spent 10 days competing in Crypto: The Game, a winner-takes-all contest where hundreds of players try to finesse and backstab their way to claiming a $140,000 cryptocurrency prize.Wired - 46m -
I Went Undercover in Crypto’s Answer to ‘Squid Game.’ It Nearly Broke Me
I spent 10 days competing in Crypto: The Game, a winner-takes-all contest where hundreds of players try to finesse and backstab their way to claiming a $140,000 cryptocurrency prize.Wired - 46m -
The Worm That No Computer Scientist Can Crack
One of the simplest, most over-studied organisms in the world is the C. elegans nematode. For 13 years, a project called OpenWorm has tried—and utterly failed—to simulate it.Wired - 1h -
The Best Programming Language for the End of the World
Once the grid goes down, an old programming language called Forth—and a new operating system called Collapse OS—may be our only salvation.Wired - 1h -
101 Best Amazon Spring Sale Deals (2025)
Now’s your chance to save on our favorite WIRED-tested home and tech gadgets.Wired - 2h