LTX 2.3 Fast image-to-video example: Use the uploaded image as the
This LTX 2.3 Fast image to video example shows Use the uploaded image as the. It highlights audio-enabled output with 14-second timing · 16:9 · 1080p output.
Prompt breakdown
Text-to-video prompt used to generate this render.
Subject
Use the uploaded image as the strict start-frame anchor. Preserve the exact same crowded metro carriage, the same gorilla in a dark tailored suit, the same alpaca in a formal suit and glasses, and the same surrounding c…
Workflow
Image to video
Camera
Image To Video
Output
14s · 16:9 · 1080p
Audio
Enabled
Constraints
Image To Video, Audio Enabled, Reference Image
Reference image
Provided
Show full promptHide full prompt
Use the uploaded image as the strict start-frame anchor. Preserve the exact same crowded metro carriage, the same gorilla in a dark tailored suit, the same alpaca in a formal suit and glasses, and the same surrounding commuters. Keep the framing realistic and cinematic. The train is moving steadily through the tunnel with subtle carriage sway, soft metallic rattling, low rail noise, distant tunnel rumble, fluorescent hum, and realistic motion blur outside the windows. No exaggerated action. The entire scene is driven by performance, timing, breathing, silence, and eye contact. The gorilla and the alpaca stand face to face in the middle of the crowded metro, both completely serious, tired, and slightly awkward, like two strangers who are not sure whether a social interaction has just happened. Performance direction: - very subtle body movement only - natural breathing visible in the chest and shoulders - tiny eye movements - slight hesitation before each line - uncomfortable but controlled silence - deadpan British-style social awkwardness - surrounding commuters remain mostly quiet and serious, with minimal reaction Dialogue timing and acting: 0:00–0:03 The train sways gently. The gorilla briefly glances toward the alpaca, then away, then back again. Gorilla, low voice, awkward, almost apologetic: “Sorry… were you talking to me?” 0:03–0:05 A short silence. The alpaca blinks once, keeps a straight face, tiny inhale. Alpaca, calm and dry: “No.” 0:05–0:07 Another pause. The gorilla looks slightly confused, shifts his grip, breathes out through the nose. Gorilla: “Right… and you?” 0:07–0:09 The alpaca gives the smallest possible side glance, still perfectly serious. Alpaca: “No, not really.” 0:09–0:11 A longer silence. The train rattles. One nearby commuter subtly looks up, then looks away again. Gorilla, almost to himself: “No one talks anymore anyway.” 0:11–0:14 Silence. The alpaca stares forward, then gives a tiny thoughtful nod. Alpaca, quietly: “That’s true, actually.” Audio direction: - realistic moving metro ambience throughout - soft rail clatter and low tunnel rumble - fluorescent carriage hum - subtle clothing movement and breathing during pauses - dialogue clean, dry, understated, intimate, no theatrical projection - leave natural silence between lines - no music - no subtitles - no text on screen - no logos - no extra fantasy elements Visual direction: prestige cinematic realism, restrained performance comedy, subtle depth of field, grounded lighting, natural commuter stillness, premium film look, humor comes entirely from timing, silence, and serious acting.
Why LTX 2.3 Fast fits this shot
Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and 25/50 fps options.
Image input
Audio option
20s max
Key frames



Related examples
View all examples
LTX 2.3 FastLTX 2.3 Pro office image-to-video transition example
This LTX 2.3 Pro example uses image-to-video controls to preserve a scene across a directed office transition with strong frame continuity.
LTX 2.3 FastLTX 2.3 Fast neon racer reveal example
This LTX 2.3 Fast example shows a neon racer reveal beside a futuristic motorcycle, testing character motion and product-ad framing.
LTX 2.3 FastKling 2.6 Pro futuristic hangar duel example
This Kling 2.6 Pro watch page shows a futuristic hangar duel with glowing weapons, wet metal surfaces and audio-enabled action pacing.
LTX 2.3 FastVeo 3.1 Fast living room TV commercial example
This Veo 3.1 Fast watch page shows a bright living-room TV commercial prompt with native audio, controlled staging and a 16:9 ad format.