Google's Veo 3 AI model can generate videos with sound

As part of this year's announcements at its I/O developer conference, Google has revealed its latest media generation models. Most notable, perhaps, is the Veo 3, which is the first iteration of the model that can generate videos with sounds. It can, for instance, create a video of birds with an audio of their singing, or a city street with the sounds of traffic in the background. Google says Veo 3 also excels in real-world physics and in lip syncing. At the moment, the model is only available for Gemini Ultra subscribers in the US within the Gemini app and for enterprise users on Vertex AI. It's also available in Flow, Google's new AI filmmaking tool. 

Flow brings Veo, Imagen and Gemini together to create cinematic clips and scenes. Users can describe the final output they want in natural language, and Flow will go to work making it for them. The new tool will only be available to Google AI Pro and Ultra subscribers in the US for now, but Google says it will roll out to more countries soon. 

While the company has released a brand new video-generating model, it hasn't abandoned Veo 2 just yet. Users will be able to give Veo 2 images of people, scenes, styles and objects to use as reference for their desired output in Flow. They'll have access to camera controls that will allow them to rotate scenes and zoom into specific objects for Flow, as well. Plus, they'll be able to broaden their frames from portrait to landscape if they want to and add or remove objects from their videos. 

Google has also introduced its latest image-generating model, Imagen 4, at the event. The company said Imagen 4 does fine details like intricate fabrics and animal fur with "remarkable clarity" and excels at generating both photorealistic and abstract images. It's also significantly better at rendering typography than its predecessors and can create images in various aspect ratios with resolutions of up to 2K. Imagen 4 is now available via the Gemini app, Vertex AI and in Workspace apps, including Docs and Slides. Google said it's also releasing a version of Imagen 4 that's 10 times faster than Imagen 3 "soon." 

Finally, to help people identify AI-generated content, which is becoming more and more difficult these days, Google has launched SynthID Detector. It's a portal where users can upload a piece of media they think could be AI-generated, and Google will determine if it contains SynthID, its watermarking and identification tool for AI art. Google had open sourced its watermarking tool, but not all image generators use it, so the portal still won't be able to identify all AI-generated images. 

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss
Creato 1mo | 20 mag 2025, 18:50:15


Accedi per aggiungere un commento

Altri post in questo gruppo

Engadget review recap: Switch 2, Playdate games and a Framework laptop

The Nintendo Switch 2 has been all the rage around the Engadget HQ for the last few weeks. Even the editors who didn't write the official review have had their hands glued to their new toys. Of cou

21 giu 2025, 13:30:09 | Engadget
Silky soccer, romancing everything and other new indie games worth checking out

Summer is finally here — at least for those of us north of the equator — and you might be planning to spend more time outdoors. Thanks to a swathe of great handheld devices, it's never been easier

21 giu 2025, 11:10:14 | Engadget
This Amazon bundle includes the Sony WH-1000XM6 headphones and a free $30 gift card

There are a few undeniable truths in this world: the sky is blue, Mario Kart is always a good idea and

21 giu 2025, 11:10:13 | Engadget
NYC proposes 5 percent raise for rideshare drivers in a bid to appease Uber and Lyft

New York City's Taxi and Limousine Commission (TLC) have settled on new minimum-wage rules for rideshare drivers,

20 giu 2025, 23:30:13 | Engadget
Windows parental controls are blocking Chrome

Stop me if you've heard this one before: Microsoft is making it harder to use Chrome on Windows. The culprit? This time, it's

20 giu 2025, 18:50:15 | Engadget