Live now · Replacing Veo in the Gemini app

Gemini Omni
Speak it. See it. Share it.

Announced on the Google I/O 2026 main stage, Gemini Omni unifies Gemini's world understanding with native multimodal generation — text, image, video and synced audio in a single architecture. It now replaces Veo 3.1 inside the Gemini app and ships with image-to-video, video-to-video editing and a personal AI avatar.

Open the prompt builder See the capabilities Articles

Unified model Synced audio In-chat editing

Omni

Text

Image

Video

Audio

5–10s Clip length

1080p Max output

16:9 · 9:16 · 1:1 Aspect ratios

I/O 2026 Expected reveal

Official demos

See what Gemini Omni actually outputs

Every clip below is embedded straight from Google's official Gemini Omni product page: text-to-video, image-to-video, style transfer, chat editing, video-to-video and the AI avatar — the full capability surface.

All demo videos are © Google, used here for informational aggregation; streamed directly from storage.googleapis.com/gweb-gemini-cdn.

Speak it. See it. Share it.

Gemini Omni's main hero reel: create, remix and edit videos through conversation.

View the official page

Text → video

Step into the story

A single text prompt produces a multi-shot clip with cohesive environment and camera language.

Image → video

Bring photos to life

Upload reference images and Omni drives the motion, filling in the timeline automatically.

Style · template

Keep the soul of the shot

Swap backgrounds, change the wardrobe or transfer styles — your subject keeps its details.

Video → video

Remix an existing clip

Re-cast an existing piece of footage in a new style — lighting, lens or even material rewritten by prompt.

Chat editing

Easy editing

Re-cast characters, adjust lighting, stabilise shots — all by chatting, no regeneration needed.

AI avatar

Be the star of your own show

Set up an AI avatar once, then star in every future video without re-uploading photos.

Capabilities

The whole pipeline collapses into one model

Unlike specialised video models such as Veo, Sora 2, Seedance 2.0 or Kling, Gemini Omni keeps language reasoning, image generation, video generation and audio synthesis under one architecture.

Native multimodal output

A single prompt produces matching text, keyframes and video, with consistent characters, style and lighting carrying across formats.

One unified Gemini stack

No more chaining of specialised models. Text, image, video and audio share the same weights and the same long context.

Synced native audio

Ambient sound, score and dialogue are aligned with the picture in the same forward pass — footsteps land on the beat, lips match speech on first export.

Direct in-chat editing

Swap an object, change the lighting, adjust a camera move in natural language — no full regeneration, echoing the Nano Banana editing playbook.

Remix and steer

Upload an existing clip and redirect it with prompts. Reference images, videos and audio can be combined in a single instruction.

Templates & styles

Built-in templates for product ads, Reels, music videos and cinematic shorts lower the floor for first-time users while keeping camera language consistent.

Specs

What can be pieced together before the keynote

Numbers below are aggregated from Reddit/X leaks and reporting by TestingCatalog, Programming Insider and OfficeChai.

Dimension	Known signal
Model family	Google Gemini — successor branding for the Veo line
Model ID	bard_eac_video_generation_omni / v3smm-lora-prod
Clip length	5 / 8 / 10 seconds per generation, chainable in-app
Resolution	480p / 720p / 1080p
Aspect ratios	16:9, 9:16, 1:1
Audio	Natively synthesized, synced in a single pass
Inputs	Text / image / video / audio references
Access	Live inside the Gemini app for 18+ Google AI Plus / Pro / Ultra subscribers
Quota signal	Reports say two Omni generations burn ~86% of an AI Pro daily quota

Architecture

Three product lines collapse into one Omni

Google's generative stack used to be split across Veo for video, Nano Banana / Imagen for image and Gemini for text. Omni rolls those into a single architecture.

Before

Veo 3.1

Video + native audio

Nano Banana / Imagen

Image generation & editing

Gemini 2.5 / 3.x

Reasoning · long context

Now · Omni

Gemini Omni

Text · image · video · audio, one model, one prompt

Text Image Video Audio

Use cases

From a single brief to publishable content

A unified model with long context and synced audio means teams can write one coherent brief and walk away with a finished cut.

Product ads

Hero shots, packaging reveals and lifestyle cuts shipped with ambient audio already locked.

Reels & Shorts

Vertical 9:16 clips with on-mic dialogue and beat-synced motion, built for scroll-stopping social.

Music videos

Reference a track and Omni cuts visuals to the beat, keeping a consistent character across shots.

Cinematic shorts

Chain multiple 10-second omni-clips into multi-shot sequences with continuous lighting and audio bed.

Landing-page hero loops

Loopable 16:9 atmospheric clips for SaaS, fashion and DTC sites — branded and silent-friendly.

Explainers & tutorials

Turn a script into a narrated sequence with lip-synced dialogue and matching ambient sound.

Compare

Where Omni sits in the 2026 video stack

Aggregated from Artificial Analysis, Looksy AI, Oimi AI and the official keynotes — for orientation, not benchmark scores.

Model	Maker	Architecture	Native audio	Clip length
Gemini Omni Omni	Google	Unified omni (video + image + audio)	Synced in one pass	5 / 8 / 10s
Veo 3.1	Google	Specialised video model	Yes	~8s
Seedance 2.0	ByteDance	Specialised multi-modal video	Yes	up to 15s / shot
Sora 2	OpenAI	Specialised video model	Yes	~20s
Kling V3.0	Kuaishou	Specialised video model	Limited	~10s

Free access

Is Gemini Omni free? How to use it for free in 2026

Gemini Omni Flash is free on Google Flow's free tier, YouTube Shorts and the YouTube Create app. The standalone Gemini app needs Google AI Plus, Pro or Ultra. Open the official surfaces below.

labs.google Free

Google Flow · Free tier + plans

Google's AI filmmaking studio. The free tier includes Gemini Omni Flash with usage limits; upgrade to Plus / Pro / Ultra for higher limits and pro tools.

Open

youtube.com Free

YouTube Shorts · Free Gemini Omni

Generate Gemini Omni Flash clips inside Shorts at no cost. The cheapest official way to try Omni for free.

Open

youtube.com Free

YouTube Create App · Free mobile editor

Mobile-first editor with Gemini Omni Flash built in. No AI subscription required.

Open

gemini.google.com Paid plan

Gemini app · Plus / Pro / Ultra

Use Omni inside the official Gemini app. Requires a Google AI Plus, Pro or Ultra plan.

Open

How to generate Gemini Omni videos for free

Fastest free path: sign in to YouTube Shorts or the YouTube Create app, pick a template and prompt with the same multi-shot hooks the Gemini app uses.

Draft for free in YouTube Shorts to lock camera language and pacing.
Move to a Google AI Plus or Pro plan only when you need brand-grade output.
Use in-chat editing instead of re-running to stretch every paid credit further.

Read the free-access guide Compare paid plans Read the full pricing breakdown Open the free prompt builder

Free quotas and prices change by region and account. Always confirm on the official surfaces linked above.

Timeline

From the first leak to launch — and what ships next

Ordered by public report date. Updated for the May 19, 2026 launch with what is live now and what is still on the way.

2026 · 05 · 02
First "Powered by Omni" string

X user @Thomas16937378 spotted "Start with an idea or try a template. Powered by Omni." inside the Gemini video tab.
2026 · 05 · 11
Full preview card inside Gemini mobile

TestingCatalog and Chetaslua surfaced the "Meet our new video model" card, the full model ID and the 10-second clip cap.
2026 · 05 · 12 – 18
Demos circulate in the wild

A "professor solving trig on a chalkboard" clip showcased text coherence and physical fidelity, sparking heavy comparison with Veo 3.1.
2026 · 05 · 19
Official launch at Google I/O 2026

Gemini Omni Flash goes live globally inside the Gemini app, Google Flow, YouTube Shorts Remix and YouTube Create — 10‑second clips, paid surfaces from $7.99/mo AI Plus and free on YouTube.
2026 · 05 · 19 onward
Avatars, character consistency and conversational editing

Launch ships with a personal AI Avatar, persistent character identity across scenes, physics‑aware rendering and chat‑style multi‑turn editing — every clip carries an imperceptible SynthID watermark.
Mid-2026 · API still pending
Developer & enterprise API via Gemini API and Vertex AI

As of mid-June 2026 the developer API is still not live. Google maintains it is "coming in the coming weeks" via the Gemini API and Vertex AI, with no official pricing yet — watch the Gemini API changelog for the drop.
On the roadmap
Gemini Omni Pro + image & audio outputs

Google has announced a more capable Gemini Omni Pro with no release date ("when it sees a step change above Flash"), plus image and audio output modalities beyond the current video-first launch — the full "any input → any output" promise.

FAQ

The questions people ask most about Gemini Omni

What exactly is Gemini Omni?

It's Google's upcoming unified multimodal model that natively generates text, image, video and synced audio inside one architecture — effectively merging Veo, Imagen and Gemini.

Is Gemini Omni free?

Partly. Gemini Omni Flash is free on Google Flow's free tier, YouTube Shorts and the YouTube Create app. Using Omni inside the standalone Gemini app requires a paid Google AI Plus, Pro or Ultra plan.

How much does Gemini Omni cost?

Google AI Plus starts around US$7.99 per month, AI Pro is the most common creator tier, and AI Ultra is roughly US$100 per month. Two Omni Flash generations consume about 86% of the AI Pro daily quota, so budget retries carefully. The developer API will arrive with its own pricing.

When will it ship?

It already shipped. Google announced Gemini Omni on the Google I/O 2026 main stage on May 19, 2026, simultaneously publishing the official product page and demo videos.

How does it relate to Veo 3.1?

Gemini Omni is the successor to Veo inside the Gemini app — Google explicitly says Omni "will replace Veo in the Gemini app". The video stack is now folded into the same architecture as Gemini text and image.

Does it really generate sound?

Yes. Ambient sound, score and dialogue are produced in the same pass as the video — that's the whole reason for the 'omni' name.

What is the current clip-length limit?

The official product page states up to 10-second clips, with native audio, up to 5 photo references and multi-turn editing.

How will pricing work?

Gemini Omni requires a Google AI Plus, Pro or Ultra plan and you must be 18+. Some features (avatars, video-to-video editing) may be restricted in certain countries.

What is the Gemini Omni AI avatar?

An optional digital version of you that lets Gemini generate videos which look and sound like you, with no need to re-upload photos each time — and only you can use your own avatar.

Sources

Primary reports and public links

Everything on this page is aggregated from the public sources below. Cross-reading is recommended.

blog.google Read source

Gemini Omni Speak it. See it. Share it.

Quick stats

See what Gemini Omni actually outputs

Speak it. See it. Share it.

Step into the story

Bring photos to life

Keep the soul of the shot

Remix an existing clip

Easy editing

Be the star of your own show

The whole pipeline collapses into one model

Native multimodal output

One unified Gemini stack

Synced native audio

Direct in-chat editing

Remix and steer

Templates & styles

What can be pieced together before the keynote

Three product lines collapse into one Omni

From a single brief to publishable content

Product ads

Reels & Shorts

Music videos

Cinematic shorts

Landing-page hero loops

Explainers & tutorials

Where Omni sits in the 2026 video stack

Is Gemini Omni free? How to use it for free in 2026

Google Flow · Free tier + plans

YouTube Shorts · Free Gemini Omni

YouTube Create App · Free mobile editor

Gemini app · Plus / Pro / Ultra

How to generate Gemini Omni videos for free

From the first leak to launch — and what ships next

First "Powered by Omni" string

Full preview card inside Gemini mobile

Demos circulate in the wild

Official launch at Google I/O 2026

Avatars, character consistency and conversational editing

Developer & enterprise API via Gemini API and Vertex AI

Gemini Omni Pro + image & audio outputs

The questions people ask most about Gemini Omni

Primary reports and public links

Google Blog · Introducing Gemini Omni

Google Blog · 100 things from Google I/O 2026

Google · Official Gemini Omni page

DataCamp · Google I/O 2026 deep dive

TestingCatalog · Programming Insider report

OfficeChai · Gemini Omni Spotted

Looksy AI · Gemini Omni product page

Gemini 2.5 technical report

Gemini Omni
Speak it. See it. Share it.