← Back to models

Veo 3.1 First/Last – First & Last Frame to Video in MaxVideoAI (720p/1080p, up to 8s)

3.1 First/Last

Veo 3.1 First/Last – Seamless transitions between two frames, with native audio

Best use cases

  • Before/after transitions: product upgrades, glow-ups, environmental changes
  • Scene bridges in short films, ads or explainers
  • Logo or brand transitions using designed first and last keyframes
  • Stylized morphs between two character designs or environments
  • Camera moves from one layout to another without cutting

Compare Veo 3.1 First/Last vs Veo 3.1 / Veo 3.1 Fast →

Overview

Brand
Google Veo
Engine ID
veo-3-1-first-last
Slug
/models/veo-3-1-first-last
Logo policy
Text-only (wordmark)
Live pricing updates inside the Generate workspace.
Live pricing updates inside the Generate workspace.

Prompt ideas

Office → Rooftop bridge (6s, 16:9)

Animate a 6s 16:9 cinematic transition from a bright office desk still to a night rooftop still with the same person. Slow dolly forward that gently arcs as the rooftop appears; daylight fades into cool blue/neon; audio shifts from office ambience to distant traffic and wind, no dialogue.

FAQ

Do I have to provide both a first and a last frame?

Yes. First/Last is built to transition between two images; both are required. Use standard Veo 3.1 for pure text/image → video runs.

How long can these clips be?

Up to ~8 seconds in the underlying APIs. MaxVideoAI exposes 4/6/8s presets to keep prompts predictable.

Does it always generate audio?

You choose audio on/off per render. Audio on gives native ambience/SFX/VO; audio off is cheaper and ready for your own soundtrack.

Can I use First/Last for logo or UI transitions?

Yes. It’s ideal for animating logos or UI layouts from an initial design to a final one. Keep critical tiny text as post-production graphics.

When should I use normal Veo instead?

Use standard Veo 3.1 / Veo Fast when you don’t have fixed start/end frames or need multi-beat clips from text/image input.

Explore other engines

Compare price tiers, latency, and prompt presets across the rest of the catalog.

google-veo

Google Veo 3.1

Generate cinematic 8-second videos with native audio using Veo 3.1 by Google DeepMind on MaxVideoAI. Reference-to-video guidance, multi-image fidelity, pay-as-you-go pricing from $0.52/s.

Try Google Veo 3.1

google-veo

Google Veo 3.1 Fast

Use Veo 3.1 Fast for affordable, fast AI video generation. Up to 8-second clips with optional native audio—ideal for social formats and iterative testing.

Try Google Veo 3.1 Fast

kling

Kling 2.5 Turbo

Route cinematic Kling 2.5 Turbo shots through MaxVideoAI with instant switching between Pro text, Pro image, and Standard budget tiers.

Try Kling 2.5 Turbo