AI Video Models (Specs, Limits, and Pricing)

Check input support, limits, and pricing per model before you generate.

  • Open any model for full specs, limits, and pricing details.
  • Filter by input type and constraints to shortlist valid models.

AI video and image models with specs, limits, and pricing on MaxVideoAI

Sora 2

OpenAI
7.2

Strengths: Human Fidelity, Visual Quality

From
$0.13/s
Max dur.
12s
Max res.
720p
T2VText-to-videoI2VImage-to-videoAudio available

Best for cinematic scenes and character continuity with strong Human Fidelity and Visual Quality in text-to-video, imag…

Sora 2 Pro

OpenAI
7.4

Strengths: Human Fidelity, Visual Quality

From
$0.39/s
Max dur.
12s
Max res.
1792×1024
T2VText-to-videoI2VImage-to-videoV2VVideo-to-videoAudio available

Best for studio-grade cinematic shots and hero scenes with strong Human Fidelity and Visual Quality in text-to-video, i…

Veo 3.1

Google
7.0

Strengths: Audio & Lip Sync, Human Fidelity

From
$0.52/s
Max dur.
8s
Max res.
1080p
T2VText-to-videoI2VImage-to-videoV2VVideo-to-videoExtendExtend / continueAudio available

Best for ad-ready shots and precise framing control with strong Audio & Lip Sync and Human Fidelity in text-to-video, i…

Seedance 2.0

ByteDance
8.5

Strengths: Audio & Lip Sync, Visual Quality

From
TBD at launch
Max dur.
15s
Max res.
1080p
T2VText-to-videoI2VImage-to-videoV2VVideo-to-videoExtendExtend / continueAudio available

Best for Text + Image to Video with strong Audio & Lip Sync and Visual Quality in text-to-video, image-to-video, video-…

Final pricing will be published at launch (official date TBA).

Seedance 1.5 Pro

ByteDance
8.2

Strengths: Audio & Lip Sync, Visual Quality

From
$0.03/s
Max dur.
12s
Max res.
1080p
T2VText-to-videoI2VImage-to-videoFirst/LastFirst frame / last frameAudio available

Best for cinematic motion with camera lock with strong Audio & Lip Sync and Visual Quality in text-to-video, image-to-v…

Veo 3.1 Fast

Google
5.7

Strengths: Speed & Stability, Audio & Lip Sync

From
$0.20/s
Max dur.
8s
Max res.
1080p
T2VText-to-videoI2VImage-to-videoV2VVideo-to-videoExtendExtend / continueAudio available

Best for fast ad cuts and rapid iteration with strong Speed & Stability and Audio & Lip Sync in text-to-video, image-to…

Use model cards to review specs, limits, and pricing before selecting a model for production.

Model checks by common scenarios

Open side-by-side checks only when you need a decision view after reviewing specs.

Choose by input type and constraints

Start from supported inputs, limits, and pricing constraints.

Text-to-video models

Shortlist models that support prompt-only generation.

Image-to-video models

Check which models support references and image-led workflows.

Video-to-video and extension support

Identify models that support continuation or edit-style workflows.

Limits and formats

Duration, max resolution, audio, and format constraints by model.

Pricing per model and mode

Use per-second pricing and mode support to estimate cost accurately.

Examples and prompt references

Open real outputs per model before selecting your production preset.

Model specs and constraints that matter

Use these checks to validate feasibility, output constraints, and price exposure before generation.

Input type support

See text-to-video, image-to-video, and edit capabilities by model.

Limits and formats

Check max duration, resolution, audio, and format constraints.

Pricing signals

Review model-level price ranges before running full batches.

Which models support image-to-video?

Use the model cards and filters to see which engines support image-to-video inputs and reference-based modes.

Which models support video-to-video workflows?

Video-to-video and continuation support differ by model and mode. Check each card for the exact capabilities before running production jobs.

What are the typical limits by model?

Duration, resolution, audio support, and available formats vary by provider and mode. The catalog shows the latest known limits per model.

How is pricing calculated per model and mode?

Pricing is based on model, mode, duration, resolution, and optional add-ons. Use model-level pricing signals as planning inputs before launch.

Where can I see real outputs per model?

Use the examples gallery to inspect outputs, prompts, and settings tied to each model before choosing presets for production.

How often are limits and prices updated?

Limits and pricing references are refreshed as providers update their capabilities and as new model versions are validated in production.

Start generating in seconds

Pick a model above, then generate — or browse proven prompts and outputs before you commit.

Live pricingModel limitsPrompt references

Model specs • Live pricing • Real output references

Next steps