Google launched its new video model Veo 3.1 with improved audio output, granular editing controls, and better output for image to video. It said that Veo 3.1 builds on May’s Veo 3 release and generates more realistic clips and adheres to prompts better.
The model allows users to add an object to the video and have it blend into the clip’s style, Google said. Soon, users will be able to remove an existing object from the video in Flow, too.
Veo 3 already has edit features such as adding reference images to drive a character, providing the first and last frame to generate a clip using AI, and the ability to extend an existing video based on the last few frames. With Veo 3.1, Google is adding audio to all these features to make the clips more lively.
The company is rolling out the model to its video editor Flow, the Gemini App, along with Vertex and Gemini APIs. It said that since Flow’s launch in May, users have created more than 275 million videos on the app.