The AI Video Race Is Moving Beyond Pretty Clips
✨ AI Summary
🔊 جاري الاستماع
InnovationAIThe AI Video Race Is Moving Beyond Pretty ClipsByRon Schmelzer,Contributor.Forbes contributors publish independent expert analyses and insights. Ron Schmelzer covers AI and data best practices at Forbes since 2018Follow AuthorMay 22, 2026, 12:01pm EDT--:-- / --:--This voice experience is generated by AI. Learn more.This voice experience is generated by AI. Learn more.Google CEO Sundar Pichai speaks during the 2026 Google I/O technology developer conference in Mountain View, California, on May 19, 2026. (Photo by Karl Mondon / AFP via Getty Images)AFP via Getty ImagesGoogle used its latest I/O event this week to introduce Gemini Omni Flash, a new AI model that can take text, photos, video, and audio as inputs, then produce short video clips with audio. It is launching through the Gemini app, Google Flow, and YouTube Shorts, with current clips up to 10 seconds and longer formats planned. Google’s latest video announcements show that the industry is focusing on more than just another text-to-video demo. AI is working its way more into the process of video creation.Early AI video tools worked like most other prompt-to-output generators. Type a prompt and get a clip, and if you don’t like it then just try again. Gemini Omni Flash moves closer to a video assistant. You can give it existing media, ask it to change that media and use conversation to guide the result. Google says the Omni family is designed around creating “anything from any input,” with video as the first major format. Reports from Google I/O 2026 say Gemini Omni Flash is launching through the Gemini app, Google Flow, and YouTube Shorts. The Verge also reported that current clips are up to ten seconds, with longer formats planned. A Broad Range of Google AI Video OptionsGoogle is adding more to their already somewhat overwhelming line of video-oriented AI models. Google already has Veo, its dedicated AI video model. Veo 3.1 is built for high fidelity video generation, with native audio, stronger...





