LTX 2.3 Pro image-to-video example: Cinematic medium close-up of the same

LTX 2.3 ProImage to video10s16:9Audio

This LTX 2.3 Pro image to video example shows Cinematic medium close-up of the same. It highlights audio-enabled output and camera motion control with 10-second timing · 16:9 · 1080p output.

Prompt

Cinematic medium close-up of the same character from the start image, standing in an urban skatepark environment. Keep the exact same character identity, face, hairstyle, bandana, headphones, outfit, and framing from th…

Show full prompt

Cinematic medium close-up of the same character from the start image, standing in an urban skatepark environment. Keep the exact same character identity, face, hairstyle, bandana, headphones, outfit, and framing from the start image. No redesign, no variation. The character is listening to music through over-ear headphones. At the beginning: eyes gently closed, calm and introspective expression, completely still. Subtle natural motion throughout the shot: slow breathing, slight finger movement adjusting the headphones, minimal head tilt, very subtle body sway. After 2–3 seconds: the character slowly opens their eyes. Expression evolves: from calm and introspective to more focused and emotionally engaged, as if reacting to the music. Slight tension appears in the eyes, stronger presence. After 5–6 seconds: a subtle energy shift happens, as if the music intensifies. The character slightly lifts their head, gaze more confident and grounded. Environmental motion: soft wind gently moves the hair and t-shirt, natural and realistic. Camera: very slow cinematic push-in toward the face during the entire shot, smooth and stable. Depth of field: shallow depth of field, background softly blurred with cinematic bokeh. Lighting: golden hour, soft warm tones, natural skin rendering, cinematic contrast, subtle rim light. Style: high-end music commercial (Nike / Apple / Spotify), ultra-realistic, grounded, premium production quality. Motion style: no sudden movements, no exaggerated animation, everything remains subtle, natural, and believable. Mood: immersive, emotional, quiet build-up turning into confidence. Important: - preserve exact identity and outfit - no scene change - no fantasy elements - focus on facial performance and micro-expression - smooth continuous single shot

Render details

Workflow

Image-to-video workflow

10-second render in 16:9

Audio-enabled output

Single reference image

Push-in camera move

Controls

Reference image

Provided

Engine

LTX 2.3 Pro

Use LTX 2.3 Pro on MaxVideoAI for text-to-video, image-to-video, audio-to-video, extend-video and retake-video workflows with Fal’s official 1080p/1440p/4K and 24/25/48/50 fps options.

Image input
Audio option
20s max

Specs

Engine

LTX 2.3 Pro

Mode

Image to video

Duration

10s

Aspect ratio

16:9

Resolution

1080p

FPS

25

Audio

Enabled

Render cost

$0.78

Created

2026-03-23

Related examples

Recreate

Load this render in the workspace

Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.