Google's Veo 3 AI model can generate videos with sound

As part of this year's announcements at its I/O developer conference, Google has revealed its latest media generation models. Most notable, perhaps, is the Veo 3, which is the first iteration of the model that can generate videos with sounds. It can, for instance, create a video of birds with an audio of their singing, or a city street with the sounds of traffic in the background. Google says Veo 3 also excels in real-world physics and in lip syncing. At the moment, the model is only available for Gemini Ultra subscribers in the US within the Gemini app and for enterprise users on Vertex AI. It's also available in Flow, Google's new AI filmmaking tool. 

Flow brings Veo, Imagen and Gemini together to create cinematic clips and scenes. Users can describe the final output they want in natural language, and Flow will go to work making it for them. The new tool will only be available to Google AI Pro and Ultra subscribers in the US for now, but Google says it will roll out to more countries soon. 

While the company has released a brand new video-generating model, it hasn't abandoned Veo 2 just yet. Users will be able to give Veo 2 images of people, scenes, styles and objects to use as reference for their desired output in Flow. They'll have access to camera controls that will allow them to rotate scenes and zoom into specific objects for Flow, as well. Plus, they'll be able to broaden their frames from portrait to landscape if they want to and add or remove objects from their videos. 

Google has also introduced its latest image-generating model, Imagen 4, at the event. The company said Imagen 4 does fine details like intricate fabrics and animal fur with "remarkable clarity" and excels at generating both photorealistic and abstract images. It's also significantly better at rendering typography than its predecessors and can create images in various aspect ratios with resolutions of up to 2K. Imagen 4 is now available via the Gemini app, Vertex AI and in Workspace apps, including Docs and Slides. Google said it's also releasing a version of Imagen 4 that's 10 times faster than Imagen 3 "soon." 

Finally, to help people identify AI-generated content, which is becoming more and more difficult these days, Google has launched SynthID Detector. It's a portal where users can upload a piece of media they think could be AI-generated, and Google will determine if it contains SynthID, its watermarking and identification tool for AI art. Google had open sourced its watermarking tool, but not all image generators use it, so the portal still won't be able to identify all AI-generated images. 

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss
Utworzony 2d | 20 maj 2025, 18:50:15


Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

Peacock Premium deal: Get one year for only $25

Another great streaming deal has hit the internet: one year of

22 maj 2025, 14:50:16 | Engadget
OpenAI's first device with Jony Ive reportedly won't be a phone or a wearable

The first device OpenAI is putting out with Jony Ive won't be a phone or a wearable

22 maj 2025, 14:50:14 | Engadget
Google's most powerful AI tools aren't for us

At I/O 2025, nothing Google showed off felt new. Instead, we got a retread of the company's familiar obsession with its own AI prowess. For the better part of two hours, Google spent playing up pro

22 maj 2025, 14:50:12 | Engadget
Fujifilm's X Half is an $850 digital camera with an analog film aesthetic

Fujifilm has already released one unusual camera this year in the

22 maj 2025, 12:30:11 | Engadget
The Dyson PencilVac is the most stick-like stick vacuum ever

It's been almost ten years since Dyson first unveiled its Supe

22 maj 2025, 03:20:12 | Engadget
Signal will block Microsoft Recall from snooping on your texts

Encrypted messaging platform Signal is rolling out a feature called Screen Security to its Windows app. It's broadly a way to prevent a computer from logging screenshots of your messages when the a

22 maj 2025, 00:50:17 | Engadget
Android 16 includes a desktop interface Google built from Samsung DeX

Devices running Android 16 will pick up a new trick when the software update rolls out later this year: The ability to run a desktop-style interface while connected to an external display. An early

21 maj 2025, 22:40:11 | Engadget