Google's Veo AI Model Turns Images And Text Into High-Quality HD Videos
![hero Veo](https://images.hothardware.com/contentimages/newsitem/66201/content/hero-Veo.jpg)
![Google Veo AI example%20(2)](https://images.hothardware.com/contentimages/newsitem/66201/content/Google_Veo_AI_example%20(2).jpg)
Ad created using static images fed into Veo (Credit: Agoda)
At present, Veo is able to produce 1080p resolution videos from static pictures, of which users can set different cinematic and visual elements through text prompts. Google's announcement doesn't specify how long videos can be, but at Google I/O, the company said that it would be "beyond a minute," whatever that means exactly.
If users so choose, they can feed Veo with images created by Google's latest Imagen 3 text-to-image generator. Google calls the tool the first hyperscaler to offer an image-to-video model, allowing companies to not only edit images via textual prompts, but also infuse said images with brand assets, style, logos, etc. The tool will be open to all Google Cloud subscribers beginning next week.
In either case, Google assures users that steps have been taken to prevent the tools from creating questionable content or that infringes on copyrights. Moreover, Google will embed all content with digital watermarks using its SynthID tool.
Based on the samples provided by Google, video and image quality are high enough that it could fool most viewers. A dead giveaway is that all the created videos are in slow motion, but in terms of execution, Veo and Imagen produce content on par with some of the best we've seen so far, such as Sora. If only Coca-Cola had their hands on these tools before it made this monstrosity.