LTX 2.3 Fast audio-enabled video example: A powerful boxer walks alone through
This LTX 2.3 Fast text to video example shows A powerful boxer walks alone through. It highlights audio-enabled output with 10-second timing · 16:9 · 1080p output.
Prompt breakdown
Prompt used to generate this render.
A powerful boxer walks alone through a dark arena tunnel toward the ring, athletic frame wrapped in a hooded robe, jaw set, shoulders relaxed but dangerous. As he steps forward, a shower of golden sparks rains from abov…Show full promptHide full prompt
A powerful boxer walks alone through a dark arena tunnel toward the ring, athletic frame wrapped in a hooded robe, jaw set, shoulders relaxed but dangerous. As he steps forward, a shower of golden sparks rains from above and bright stadium light pours in ahead of him, turning the final moment into a bold silhouette reveal. The camera tracks backward in front of him in one steady move, ending in a dramatic medium-wide shot as he enters the light. Dense haze, warm gold against deep black shadows, sweat catching the light, premium sports-cinema aesthetic, simple, intense, and highly usable for social ads. Crowd roar in the distance, chain rattle, footsteps on concrete, no text, no logos, no watermark.
Workflow
Text to video
Camera
Audio Enabled
Output
10s · 16:9 · 1080p
Estimated price
$0.52
Audio
Enabled
Constraints
Text To Video, Audio Enabled
Prompt improvement notes
Note 1
Keep the subject, camera move, lighting, duration, aspect ratio and audio requirement grouped so the render has one clear production brief.
Note 2
Change one variable at a time when cloning this prompt: model, duration, camera motion or reference input. That makes quality and price differences easier to compare.
Note 3
Add a short negative prompt if you need to block text overlays, logos, distorted hands, face warping or unwanted camera shake.
Compare this model
Review this example beside nearby engines before choosing a render path.
Why LTX 2.3 Fast fits this shot
Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and 25/50 fps options.
Image input
Audio option
20s max
Key frames



Related examples
View all examples
LTX 2.3 ProLTX 2.3 Pro rooftop lightning fashion shot example
This LTX 2.3 Pro page shows a rooftop fashion prompt with storm lighting, neon city atmosphere and cinematic subject isolation.
LTX 2.3 FastLTX 2.3 Fast neon racer reveal example
This LTX 2.3 Fast example shows a neon racer reveal beside a futuristic motorcycle, testing character motion and product-ad framing.
Wan 2.5 Text & Image to VideoWan 2.5 vertical spy-to-Zoom comedy video example
This Wan 2.5 watch page shows a vertical comedy prompt that opens like a spy action scene and ends with a Zoom-call reveal.
OpenAI Sora 2Sora 2 gorilla dance video example with strobe lighting
This Sora 2 watch page shows a gorilla-mask dance prompt rendered with strobe lighting, changing camera angles, native audio and a 16:9 output.