LTX PRODUCTION WORKFLOW ROUTE

LTX 2.3 Pro

Audio-led workflows, Extend and Retake controls, and 4K generate passes for production video.

Use LTX 2.3 Pro when the LTX job needs more than a draft: text or image generation, audio-to-video, source video extension and selective retake controls in one production route.

Generate with LTX 2.3 Pro View examples

Compare vs Fast View pricing Prompt examples

LTX 2.3 Pro cinematic generate render — LTX 2.3 Pro example
Production workflow render

Text and image video

Generate new shots from prompts or start images with optional end frames.

Audio-to-video

Use an uploaded audio file as timing input when sound drives the motion.

Extend

Continue a source video at the start or end with context controls.

Retake

Replace a selected section instead of regenerating the full clip.

Up to 4K generate

Use higher resolutions for approved generate passes.

Pay-as-you-go

See exact live price before you generate.

LTX 2.3 Pro pricing at a glance

Preset production totals - see the exact live price in the app before you generate.

View full pricing

Pro workflow

$0.47

6s · 1080p

Audio-led workflow

$1.25

4K reference

$3.12

10s · 4k

Max duration

20s

20s route; generate presets vary by mode

All prices are MaxVideoAI display prices in USD credits for preset scenarios.

LTX 2.3 Pro examples

Representative LTX 2.3 Pro examples for reviewing audio-led motion, extension, retake and higher-resolution production workflows.

View all examples

LTX 2.3 Pro use the provided image as the exact first frame and prese...

16:9

cinematic

Use the provided image as the exact first frame and prese...

View render Recreate this shot

16:9

portrait

Use the provided image as the exact first frame and prese...

View render Recreate this shot

LTX 2.3 Pro use the provided image as the exact first frame

16:9

cinematic

Use the provided image as the exact first frame

View render Recreate this shot

LTX 2.3 Pro a stylized 1970s office scene frozen at the exact moment...

16:9

cinematic

A stylized 1970s office scene frozen at the exact moment...

View render Recreate this shot

Real community renders

See what's possible with LTX 2.3 Pro — current LTX model for audio, extend and retake workflows.

Recreate any shot

Jump into the app with one click and reuse the setup.

Native audio

Dialogue, ambience and SFX generated in sync.

Multi-shot continuity

Keep characters, style and scene consistency across sequences.

Production-aware

Built-in guardrails and safety filters for responsible review.

Pro or Fast?

Use Fast for simple draft loops. Use Pro when the job needs audio input, extension, retake or higher-resolution production control.

Compare Fast vs Pro

Fixing a partial clip?

Use Extend when you need more footage, or Retake when one section needs replacement.

Open Prompt Lab

Comparing production engines?

Compare LTX 2.3 Pro with Veo 3.1 when deciding between editorial controls and premium short-video polish.

Compare LTX Pro vs Veo

How to Prompt LTX 2.3 Pro by Workflow

LTX 2.3 Pro changes shape depending on the job. Prompt fresh generation, audio-led animation, extend, and retake differently instead of using one universal template.

Tip: duration, resolution, fps, and workflow controls already live in the UI. Use the prompt to steer the active route: new shot, soundtrack-led motion, continuation, or surgical fix.

How LTX 2.3 Pro uses Generate, Audio, Extend and Retake

Generate video

Use text or a start image for the base shot, with optional end-frame guidance.

Audio-to-video

Upload audio when rhythm, dialogue or music should drive the visual timing.

Extend

Continue a source clip before or after the current footage.

Retake

Target a broken time window and replace audio, video or both.

Final pass

Use higher-resolution generate settings after the route and prompt are approved.

Generate prompt

Use when LTX is creating a fresh shot from text or a start image.

Subject:
Rugged desert adventurer holding an ancient brass compass at golden hour.

Action:
He opens the compass and a glowing dust-map unfolds above it.

Camera:
Wide rear establishing shot over dunes, then a slow push-in to his eyes reflecting the map.

Look:
Warm sun, drifting dust, bronze textures, clean premium adventure-film grade.

Audio:
Desert wind, soft metallic compass click, low atmospheric rumble.

Format:
10 seconds, 16:9, no text, no logos.

EXAMPLE

Format: 10 seconds, 16:9, no text, no logos.

View example render Use this prompt

Global principles

Define one subject, one core beat and one camera move before layering style.
Use the workflow that matches the job: generate, audio, extend or retake.
For retakes, describe only the section that must change instead of rewriting the full video.

Engine quirks / what to watch for

Text and image generation stay direct and cinematic.
Audio-to-Video works best when the soundtrack carries timing and the prompt steers staging.
Extend and Retake are strongest when the prompt is specific about continuity and what must remain unchanged.

Demo prompt - cinematic compass reveal

Text-to-video generate prompt

Subject: Adventurer at a desert cliff • Action: Opens an ancient compass that reveals a glowing map
Camera: Wide rear shot, then slow push-in toward the eyes • Style: Golden hour, drifting dust, bronze textures, premium adventure tone
Audio: Desert wind, metallic compass click, low atmospheric rumble

View full prompt

A rugged adventurer stands on the edge of a vast desert cliff at golden hour, sunburned face, worn leather jacket, scarf moving in the wind. He opens an ancient brass compass in his hand, and a glowing map made of floating light and dust unfolds above it, illuminating his face and the sand around him. The camera starts wide behind him to show the endless dunes, then pushes in slowly as he raises the compass, ending on a dramatic close view of his eyes reflecting the glowing map. Warm sun, drifting dust, rich bronze textures, epic but clean composition, premium cinematic adventure tone, simple and iconic. Wind over the dunes, faint metallic click from the compass, low atmospheric desert rumble, no text, no logos, no watermark.

10s16:9Audio on

LTX 2.3 Pro desert compass adventure render

Before you generate

Prepare the frame before video

Lock the character, fix the viewpoint, or build the source still before you spend credits on motion.

Keep the character consistent

Lock identity, outfit, and reference quality.

Change the camera angle before video

Change the viewpoint before you spend video credits.

Build the source still in Image

Build or clean the source still first.

Practical strengths and boundaries

Best practices, common fixes, and important limitations to help you get the strongest results with LTX 2.3 Pro.

What works best

Most complete LTX workflow surface: generate, audio-driven motion, extension and retake in one place.
Useful for vertical and landscape planning when the same concept must ship to multiple placements.
Extend and Retake reduce waste because you can preserve the strong parts of a clip.

Common problems → fast fixes

Audio-to-Video feels visually disconnected -> reduce the scene complexity and let the soundtrack drive the pacing.
Extend drifts too far from the original clip -> describe continuity anchors and tighten the new action.
Retake changes more than the target section -> shorten the retake window and be explicit about what must stay the same.
Image-to-Video feels generic -> make the start frame stronger and simplify the motion request.
4K runs feel expensive for ideation -> validate the motion in a cheaper pass first, then rerun the winner.

Hard limits to keep in mind

4K, 9:16 and fps controls currently apply to the standard generate modes, not every editing workflow.
Audio-to-Video, Extend and Retake are standard LTX 2.3 Pro only, not Fast.
Retake still needs a precise source window and a clear prompt to avoid drifting too far from the original clip.

Compare LTX 2.3 Pro vs other AI video models

These side-by-side comparisons break down price, resolution, audio, speed, and motion style so you can pick the right engine fast.

Each page includes real outputs and practical best-use cases.

LTX 2.3 Pro vs LTX 2.3 Fast

Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and 25/50 fps options.

Compare LTX 2.3 Pro vs LTX 2.3 Fast →

LTX 2.3 Pro vs LTX Video 2.0 Fast

Generate fast cinematic AI videos with LTX-2 Fast. Text and image to video with synchronized audio, up to 4K, ideal for rapid iteration and social content.

Compare LTX 2.3 Pro vs LTX Video 2.0 Fast →

LTX 2.3 Pro vs OpenAI Sora 2

Create rich AI-generated videos from text or image prompts using Sora 2. Native voice-over, ambient effects, and motion sync via MaxVideoAI.

Compare LTX 2.3 Pro vs OpenAI Sora 2 →

Real Specs - LTX 2.3 Pro in MaxVideoAI

The limits that shape your renders.

How we benchmark View full specs

Price / second

1080p $0.08/s1440p $0.16/s4k $0.31/s

Text-to-Video

Image-to-Video

Video-to-Video

Supported (extend / retake workflows)

First/Last frame

Supported (start + end image in I2V)

Start / reference image

Supported (single start image; no separate style-reference stack)

Reference video

Supported (source clip for extend / retake)

Max resolution

4K on T2V/I2V generate; workflow-specific limits for Audio/Extend/Retake

Max duration

Generate 6–10s; Audio/Extend/Retake up to 20s

Aspect ratios

16:9 generate / 9:16 generate

FPS options

24 fps generate / 25 fps generate / 48 fps generate / 50 fps generate

Output format

MP4

Audio output

Native audio generation

Lip sync

Camera / motion controls

Prompt-based only

Watermark

No (MaxVideoAI)

Generate workflows

Text-to-Video and Image-to-Video are the broadest modes: 6 to 10 seconds, 1080p to 4K, vertical or landscape, with native audio available.

Details

Use Text-to-Video for net-new story beats.
Use Image-to-Video when the first frame must stay on-brand.
Add an end image only when the final composition matters.
Pick 9:16 when the clip is designed for short-form social.

Audio, extend and retake

The biggest difference versus simpler engines is not just image quality. It is that LTX 2.3 Pro covers timing-led animation and source-clip edits from the same page.

Details

Audio-to-Video turns uploaded audio into the timing spine of the clip.
Extend can grow a strong shot at the head or tail.
Retake is useful when only one section needs replacement.
Context controls help preserve continuity during extension.

Safety and likeness guidance

Built-in safeguards and best practices for responsible creation with LTX 2.3 Pro.

Use original characters and owned references.
Avoid real people, celebrities and protected characters.
Do not use someone's likeness without consent.
Avoid copyrighted franchises, logos and protected IP.

FAQ

What is new in LTX 2.3 Pro compared with the older LTX 2.0 pages?

LTX 2.3 Pro adds a unified workflow surface around generate modes plus Audio-to-Video, Extend and Retake, while also supporting 16:9 and 9:16 in the generate flows.

What is the difference between Extend and Retake?

Extend continues a source video before or after the existing footage. Retake replaces a selected section inside the source video, with controls for start time, duration and replacement mode.

Does LTX 2.3 Pro support native audio?

Yes. Text-to-Video and Image-to-Video expose a native audio toggle, and Audio-to-Video uses an uploaded audio file as the driving input.

Does LTX 2.3 Fast include Audio-to-Video, Extend or Retake?

No. LTX 2.3 Fast is limited to text-to-video and image-to-video in the current MaxVideoAI route.

How much does LTX 2.3 Pro cost?

Use the pricing card for preset MaxVideoAI display totals, then confirm the exact live quote in Generate. Price varies by workflow, duration, resolution, fps and audio settings.

Does 4K work with Audio-to-Video, Extend and Retake?

4K, 9:16 and fps controls are scoped to standard Text-to-Video and Image-to-Video generate modes in the current route. Audio-to-Video, Extend and Retake can have workflow-specific limits.