Google's Veo 3 AI model can generate videos with sound

As part of this year's announcements at its I/O developer conference, Google has revealed its latest media generation models. Most notable, perhaps, is the Veo 3, which is the first iteration of the model that can generate videos with sounds. It can, for instance, create a video of birds with an audio of their singing, or a city street with the sounds of traffic in the background. Google says Veo 3 also excels in real-world physics and in lip syncing. At the moment, the model is only available for Gemini Ultra subscribers in the US within the Gemini app and for enterprise users on Vertex AI. It's also available in Flow, Google's new AI filmmaking tool. 

Flow brings Veo, Imagen and Gemini together to create cinematic clips and scenes. Users can describe the final output they want in natural language, and Flow will go to work making it for them. The new tool will only be available to Google AI Pro and Ultra subscribers in the US for now, but Google says it will roll out to more countries soon. 

While the company has released a brand new video-generating model, it hasn't abandoned Veo 2 just yet. Users will be able to give Veo 2 images of people, scenes, styles and objects to use as reference for their desired output in Flow. They'll have access to camera controls that will allow them to rotate scenes and zoom into specific objects for Flow, as well. Plus, they'll be able to broaden their frames from portrait to landscape if they want to and add or remove objects from their videos. 

Google has also introduced its latest image-generating model, Imagen 4, at the event. The company said Imagen 4 does fine details like intricate fabrics and animal fur with "remarkable clarity" and excels at generating both photorealistic and abstract images. It's also significantly better at rendering typography than its predecessors and can create images in various aspect ratios with resolutions of up to 2K. Imagen 4 is now available via the Gemini app, Vertex AI and in Workspace apps, including Docs and Slides. Google said it's also releasing a version of Imagen 4 that's 10 times faster than Imagen 3 "soon." 

Finally, to help people identify AI-generated content, which is becoming more and more difficult these days, Google has launched SynthID Detector. It's a portal where users can upload a piece of media they think could be AI-generated, and Google will determine if it contains SynthID, its watermarking and identification tool for AI art. Google had open sourced its watermarking tool, but not all image generators use it, so the portal still won't be able to identify all AI-generated images. 

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss
Created 1mo | May 20, 2025, 6:50:15 PM


Login to add comment

Other posts in this group

NYC proposes 5 percent raise for rideshare drivers in a bid to appease Uber and Lyft

New York City's Taxi and Limousine Commission (TLC) have settled on new minimum-wage rules for rideshare drivers,

Jun 20, 2025, 11:30:13 PM | Engadget
Windows parental controls are blocking Chrome

Stop me if you've heard this one before: Microsoft is making it harder to use Chrome on Windows. The culprit? This time, it's

Jun 20, 2025, 6:50:15 PM | Engadget
Meta tells the Oversight Board it isn't removing the word 'transgenderism' from its hate speech rules

If anyone was holding out hope that the Oversight Board would provide some kind of check on Meta's

Jun 20, 2025, 6:50:14 PM | Engadget
How to buy the Nintendo Switch 2: Latest stock updates at Best Buy, Walmart, Target and more

The Nintendo Switch 2 has been available in the US for more than two weeks — but good luck finding one. The

Jun 20, 2025, 4:30:30 PM | Engadget
What to expect at the next Samsung Galaxy Unpacked

The next Samsung Galaxy Unpacked event could be announced any day now. The summertime event usually happens in July or August. No date has been officially set, but at least one

Jun 20, 2025, 4:30:29 PM | Engadget
Nothing’s first over-ear headphones leak ahead of July unveiling

Nothing has probably made its biggest impression in the tech world with its distinctive mid-range Android phones (like the 3a Pro pictured above). But the UK-based brand’s

Jun 20, 2025, 4:30:27 PM | Engadget