Google's Veo 3 AI model can generate videos with sound

As part of this year's announcements at its I/O developer conference, Google has revealed its latest media generation models. Most notable, perhaps, is the Veo 3, which is the first iteration of the model that can generate videos with sounds. It can, for instance, create a video of birds with an audio of their singing, or a city street with the sounds of traffic in the background. Google says Veo 3 also excels in real-world physics and in lip syncing. At the moment, the model is only available for Gemini Ultra subscribers in the US within the Gemini app and for enterprise users on Vertex AI. It's also available in Flow, Google's new AI filmmaking tool. 

Flow brings Veo, Imagen and Gemini together to create cinematic clips and scenes. Users can describe the final output they want in natural language, and Flow will go to work making it for them. The new tool will only be available to Google AI Pro and Ultra subscribers in the US for now, but Google says it will roll out to more countries soon. 

While the company has released a brand new video-generating model, it hasn't abandoned Veo 2 just yet. Users will be able to give Veo 2 images of people, scenes, styles and objects to use as reference for their desired output in Flow. They'll have access to camera controls that will allow them to rotate scenes and zoom into specific objects for Flow, as well. Plus, they'll be able to broaden their frames from portrait to landscape if they want to and add or remove objects from their videos. 

Google has also introduced its latest image-generating model, Imagen 4, at the event. The company said Imagen 4 does fine details like intricate fabrics and animal fur with "remarkable clarity" and excels at generating both photorealistic and abstract images. It's also significantly better at rendering typography than its predecessors and can create images in various aspect ratios with resolutions of up to 2K. Imagen 4 is now available via the Gemini app, Vertex AI and in Workspace apps, including Docs and Slides. Google said it's also releasing a version of Imagen 4 that's 10 times faster than Imagen 3 "soon." 

Finally, to help people identify AI-generated content, which is becoming more and more difficult these days, Google has launched SynthID Detector. It's a portal where users can upload a piece of media they think could be AI-generated, and Google will determine if it contains SynthID, its watermarking and identification tool for AI art. Google had open sourced its watermarking tool, but not all image generators use it, so the portal still won't be able to identify all AI-generated images. 

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss https://www.engadget.com/ai/googles-veo-3-ai-model-can-generate-videos-with-sound-174541183.html?src=rss
Created 2mo | May 20, 2025, 6:50:15 PM


Login to add comment

Other posts in this group

Never fear, reaction videos are still allowed under YouTube's new 'inauthentic content' policy

YouTube has clarified its rules about repetitious content and your favorite reaction video channel won't be impacted. Earlier this month, the platform said it would be changing its rules for moneti

Jul 15, 2025, 12:40:12 AM | Engadget
Claude AI now integrates with Canva

Anthropic's Claude can now create and edit designs with visual studio Canva from within an AI chat. This integration is powered by a Canva server that uses Anthropic's Model Context Protocol, or MC

Jul 14, 2025, 10:20:21 PM | Engadget
Meta says it's cracking down on Facebook creators who steal content

Meta is going after creators who rip off other users' content as part of a broader effort to fix Facebook's feed. In its

Jul 14, 2025, 10:20:20 PM | Engadget
TikTok owner ByteDance is reportedly building its own mixed reality goggles

ByteDance, the parent company of TikTok, is reportedly working on mixed reality goggles,

Jul 14, 2025, 10:20:19 PM | Engadget
Meta announces huge new data centers, but they could gobble up millions of gallons of water per day

Meta is building several gigawatt-sized data centers to power AI,

Jul 14, 2025, 8:10:13 PM | Engadget
Best Buy is restocking the Nintendo Switch 2 on July 17

If you've been hunting high and low for a Nintendo Swi

Jul 14, 2025, 8:10:12 PM | Engadget