LTX 2.3 Pro image-to-video example: Cinematic medium close-up of the same
This LTX 2.3 Pro image to video example shows Cinematic medium close-up of the same. It highlights audio-enabled output and camera motion control with 10-second timing · 16:9 · 1080p output.
Prompt
Cinematic medium close-up of the same character from the start image, standing in an urban skatepark environment. Keep the exact same character identity, face, hairstyle, bandana, headphones, outfit, and framing from th…
Show full promptHide full prompt
Cinematic medium close-up of the same character from the start image, standing in an urban skatepark environment. Keep the exact same character identity, face, hairstyle, bandana, headphones, outfit, and framing from the start image. No redesign, no variation. The character is listening to music through over-ear headphones. At the beginning: eyes gently closed, calm and introspective expression, completely still. Subtle natural motion throughout the shot: slow breathing, slight finger movement adjusting the headphones, minimal head tilt, very subtle body sway. After 2–3 seconds: the character slowly opens their eyes. Expression evolves: from calm and introspective to more focused and emotionally engaged, as if reacting to the music. Slight tension appears in the eyes, stronger presence. After 5–6 seconds: a subtle energy shift happens, as if the music intensifies. The character slightly lifts their head, gaze more confident and grounded. Environmental motion: soft wind gently moves the hair and t-shirt, natural and realistic. Camera: very slow cinematic push-in toward the face during the entire shot, smooth and stable. Depth of field: shallow depth of field, background softly blurred with cinematic bokeh. Lighting: golden hour, soft warm tones, natural skin rendering, cinematic contrast, subtle rim light. Style: high-end music commercial (Nike / Apple / Spotify), ultra-realistic, grounded, premium production quality. Motion style: no sudden movements, no exaggerated animation, everything remains subtle, natural, and believable. Mood: immersive, emotional, quiet build-up turning into confidence. Important: - preserve exact identity and outfit - no scene change - no fantasy elements - focus on facial performance and micro-expression - smooth continuous single shot
Render details
Workflow
Image-to-video workflow
10-second render in 16:9
Audio-enabled output
Single reference image
Push-in camera move
Controls
Reference image
Provided
Engine
LTX 2.3 Pro
Use LTX 2.3 Pro on MaxVideoAI for text-to-video, image-to-video, audio-to-video, extend-video and retake-video workflows with Fal’s official 1080p/1440p/4K and 24/25/48/50 fps options.
Specs
Engine
LTX 2.3 Pro
Mode
Image to video
Duration
10s
Aspect ratio
16:9
Resolution
1080p
FPS
25
Audio
Enabled
Render cost
$0.78
Created
2026-03-23
Related examples

Same example family
LTX 2.3 Pro image-to-video example: city camera move
This LTX 2.3 Pro image to video example shows city camera move. It highlights audio-enabled output and camera motion control with 10-second timing · 16:9 · 1080p output.

Same example family
LTX 2.3 Pro audio-enabled video example: city close-up
This LTX 2.3 Pro text to video example shows city close-up. It highlights audio-enabled output with 10-second timing · 16:9 · 1080p output.

Shared capability
Google Veo 3.1 Fast camera movement example: living room commercial
This Google Veo 3.1 Fast text to video example shows living room commercial. It highlights audio-enabled output and camera motion control with 6-second timing · 16:9 output.

Shared capability
Google Veo 3.1 Fast camera movement example: studio interview push-in
This Google Veo 3.1 Fast text to video example shows studio interview push-in. It highlights audio-enabled output and camera motion control with 8-second timing · 16:9 output.
Recreate
Load this render in the workspace
Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.