Gemini Omni: Google's New AI Video Creation & Editing Tool

May 21·0:00 listen·Source: blog.google

Summary

VocoLife News. Gemini Omni is a new model that can create anything from any input, starting with video. This new tool allows users to combine images, audio, video, and text as input to generate high-quality videos. You can also edit these videos through conversation. Gemini Omni Flash, the first model in the Omni family, is rolling out to the Gemini app, Google Flow, and YouTube Shorts. It allows users to edit videos with natural language, where each instruction builds on the last. Characters and physics remain consistent, and the scene remembers previous edits. Users can transform their videos by changing specific elements or everything. They can also reimagine the action, add new characters or objects, or alter a moment. Omni also allows for refining videos across multiple turns, changing the environment, angle, style, or specific details without losing the original scene's context. What's interesting is that Gemini Omni doesn't just build realistic scenes; it reasons about what should happen next. It combines an understanding of physics with Gemini's knowledge of history, science, and cultural context to create meaningful storytelling. This matters because it could change how people create and interact with video content.

Read the full article on blog.google

This is an AI-generated audio summary. Always check the original source for complete reporting.

Share
Keep Listening