Google model

Veo 3.1

Name: Veo 3.1
Brand: Google DeepMind
Availability: InStock

Directable, cinematic clips from text or images — with sound that matches the scene.

Built for brand hero shots, polished product reveals, and repeatable campaign variants.

Text→VideoImage→Video1080p8s16:9 / 9:16Audio

Start generating with Veo 3.1 Compare Veo 3.1 vs Veo 3.1 Fast →

Pay-as-you-go · Price shown before you generate

Google Veo 3 Fast AI video example: Cinematic 8-second TV commercial in 16:9 with sound. From a tiny FPV-style camera flying indoors, we... — Price$0.52/s
Duration8s
Format16:9
View render →

Best use cases

Brand hero shotsProduct revealsCampaign variantsSocial ads & format variantsCinematic B-rollPre-viz & concept tests

Why Veo 3.1 is powerful

Directable framing (Strong composition and camera language for repeatable shots.)
Sound in the same pass (Dialogue, ambience, and SFX when you want it (or keep it silent for post).)
Better prompt follow-through (Clearer intent transfer from “director notes” to the final clip.)
Built for continuity workflows (References + extension tools help you carry a look across multiple beats.)

Real Specs – Veo 3.1 in MaxVideoAI (720p/1080p, 4–8s)

The limits that shape your renders.

Price / secondAudio on $0.52/s · Audio off $0.26/s

Text-to-Video

Image-to-Video

Video-to-Video

Reference image / style reference

Reference video

Max resolution1080p

Max duration8s

Aspect ratios16:9 / 9:16

FPS options24 fps

Output formatMP4

Audio output

Native audio generation

Lip sync

Camera / motion controlsAdvanced

WatermarkNo (MaxVideoAI)

Release dateOct 2025

Directable framingDetails

Strong at director notes, framing, and camera language for repeatable shots. Great when you need consistent brand composition.

Use wide/medium/close and camera verbs.
Anchor composition with a hero subject.
Describe movement before styling.
Reuse the same shot recipe for variants.

Sound & continuityDetails

Sound can be generated in the same pass, and references help carry a look across beats. Keep the visual recipe stable for continuity.

Add light SFX or ambience cues.
Lock palette and lighting between shots.
Make small, controlled prompt deltas.
Use reference frames when possible.

Example Gallery: Real Veo 3.1 Outputs

See live Veo 3.1 renders powered by the same settings you have in MaxVideoAI.

Google Veo 3.1 AI video example: Shot 1 (0 - 3 s): macro close-up of one earbud rotating slowly on a wooden desk, shallow depth of field...

Google Veo 3.1 · 8s

Google Veo 3.1 · 6s

Google Veo 3.1 · 8s

Google Veo 3.1 · 8s

Google Veo 3.1 · 8s

Google Veo 3 Fast · 8s

Recreate this shot →

View all Veo 3.1 examples →

How to Write a Great Veo 3.1 Prompt

Google DeepMind

Veo works best when you specify subject, action, context, camera, and style in plain language.

Tip: duration + aspect ratio are set in the UI - your prompt controls subject, action, camera, lighting, style, and sound cues.

Quick prompt (fast iteration)

Use 1–2 sentences when you want variations.

Quick = variations. Use for fast iteration.

Template (copy/paste)

[Subject] [action] in [context], [camera shot + movement], [lighting], [style], [ambience/sound cue].

Example

Handheld smartphone UGC clip of a woman unboxing a new skincare bottle at a kitchen table. She peels the seal, smiles, and turns the bottle toward camera. Soft window daylight, natural colors, subtle room tone + packaging crinkle.

Structured prompt (best for reliable results)

Separate information so the model can follow it consistently.

Structured = consistency. Use when you need reliable results.

Template (copy/paste)

Subject:
[Who/what + 2-3 defining details.]

Action:
[One clear physical action.]

Context:
[Location + time of day + environment.]

Camera:
- Shot + angle
- Movement: [pan / dolly / handheld / static]
- Lens feel: [e.g., 35mm, shallow DOF]

Style + lighting:
[Look / palette / key light.]

Ambience + sound (optional):
[Atmosphere + 1-2 SFX cues.]

Constraints:
State the desired result (e.g., "clean background") instead of "no/don't".

Pro prompt (ultra-specific "studio brief")

Use this when you need a very specific look or continuity across shots.

Pro = continuity. Use for precise, repeatable looks.

Template (copy/paste)

Director brief:
[Goal for the shot + intended vibe.]

Subject + wardrobe/props:
[Key traits + textures.]

Set + continuity:
[Location, time, weather, 3 visual anchors.]

Cinematography:
- Framing + angle
- Movement (one move)
- Lens + depth of field
- Composition rule (centered / thirds / negative space)

Lighting + grade:
[Key light + contrast + palette anchors.]

Sound (optional):
[Ambience + 1-2 SFX cues.]

Constraints:
Describe what to include, not what to exclude.

Storyboard prompt (shot list)

Use for short multi-beat sequences in one clip.

Storyboard = beat timing. Use for mini-stories in one clip.

Template (copy/paste)

Shot list (same subject + same setting)
Duration: [4/6/8s]

Shot 1 (0-2s):
[Framing + action + camera move]

Shot 2 (2-5s):
[Framing + action + camera move]

Shot 3 (5-8s):
[Final action/reveal + camera settle]

Look + mood:
[Lighting + palette + atmosphere]

Sound (optional):
[Ambience + 1-2 cues]

Demo: a sequenced prompt (with native audio)

Tips & Limitations

Veo 3.1 is easiest to control when you write like a shot brief: framing, one camera move, and clear lighting cues.

What works best

Director notes win: shot size + angle + one camera move (dolly / pan / handheld) before you describe style.
Keep one hero subject per shot; make the action physical and easy to read.
For continuity, reuse the same “shot recipe” (palette, lighting, lens feel) and change only one variable at a time.
Audio works best with minimal cues: ambience + 1 key sound, or one short VO line.

Common problems → fast fixes

Prompt drift / ignores details → cut extra actions, move camera + framing to the first lines, and keep constraints positive (“clean background”, “centered subject”).
Motion feels messy → one move only, slower action, simpler background.
Off-brand look → lock palette + lighting, reuse the same wording across takes, use a reference frame when possible.
Text/signage breaks → keep readable text off-screen; plan to overlay critical copy in post.
VO / lip sync feels off → shorten lines and avoid long monologues.

Hard limits to keep in mind

Up to 8 seconds per render; go longer by chaining clips (or Extend).
1080p max in this routing.
24 fps only.
Tiny UI text and small lettering are unreliable — add in post.

Veo 3.1 vs Veo 3.1 Fast

View Veo 3.1 Fast details →

Use Veo 3.1 when you need:

Higher-fidelity frames and polish
Sound in the same pass when you want it
More reliable follow-through on prompts

Use Veo 3.1 Fast when you want:

Rapid concept testing and volume drafts
Cheaper A/B ad variants and social loops
Quick iteration before upgrading winners

Compare Veo 3.1 vs other AI video models

Not sure if Veo 3.1 is the best fit for your shot? These side-by-side comparisons break down the tradeoffs — price per second, resolution, audio, speed, and motion style — so you can pick the right engine fast.

Each page includes real outputs and practical best-use cases.

kling

Veo 3.1 vs Kling 3 Pro

Direct Kling 3 Pro renders with multi-prompt sequencing, element references, and voice controls. Generate cinematic 3–15s clips in 1080p.

Compare Veo 3.1 vs Kling 3 Pro →

google-veo

Veo 3.1 vs Google Veo 3.1 Fast

Use Veo 3.1 Fast for affordable, fast AI video generation. Up to 8-second clips with optional native audio—ideal for social formats and iterative testing.

Compare Veo 3.1 vs Google Veo 3.1 Fast →

openai

Veo 3.1 vs OpenAI Sora 2

Create rich AI-generated videos from text or image prompts using Sora 2. Native voice-over, ambient effects, and motion sync via MaxVideoAI.

Compare Veo 3.1 vs OpenAI Sora 2 →

Safety & people / likeness

No sexual content, and nothing involving minors.
No hateful, harassing, or graphic-violence content.
Don’t impersonate real people or public figures; use consent for any likeness/voice.
No non-consensual intimate imagery; don’t include private personal data (addresses, phone numbers, documents).
Keep outputs brand-safe — some prompts or reference images may be blocked by provider safety filters.

FAQ – Veo 3.1 in MaxVideoAI

Is Veo 3.1 available in Europe or the UK?

Yes. MaxVideoAI routes Veo jobs through licensed DeepMind endpoints, so you can render from Europe, the UK and most supported regions without separate Veo contracts.

Can Veo 3.1 generate vertical videos?

Yes. Veo 3.1 supports 16:9, 9:16 and 1:1. Choose 9:16 for Reels/TikTok/Shorts and keep key action centered.

Does Veo 3.1 support image-to-video?

Yes. Start from a single still (Image→Video) or use 1–4 reference images to guide a Text→Video clip.

Can I go beyond 8 seconds?

Base clips are 4/6/8 s. Use Extend and chain clips in timelines; treat each extension as another 4–8 s block.

How do I keep Veo 3.1 on-brand?

Use reference stills (Nano Banana or your brand library), keep character/setting descriptions consistent, and call out palette/lighting.