LTX 2.3 Fast audio-enabled video example: Vertical 9 16 TikTok-style UGC selfie

LTX 2.3 FastText to video10s9:16Audio

This LTX 2.3 Fast text to video example shows Vertical 9 16 TikTok-style UGC selfie. It highlights audio-enabled output with 10-second timing · 9:16 · 1080p output.

Browse LTX video examples Open LTX 2.3 Fast model page

Prompt

Show full prompt

Vertical 9:16 TikTok-style UGC selfie video, handheld smartphone feel, natural indoor daylight near a window. A friendly creator speaks directly to camera with natural blinking, subtle head nods, and a warm smile. Add small human imperfections: a tiny hesitation, a soft breath, a quick smile mid-sentence, and a micro-pause before the last line. Realistic skin texture, stable identity, no face warping, minimal flicker, clean audio with natural room tone. No subtitles. No on-screen text. No logos. No watermarks. The creator says (exactly, with the same pacing and hesitations): “Okay, so… um… quick thing. If you’re feeling stuck, just do the tiniest first step… like, set a two-minute timer and start. (smiles) That’s it. You’ll be surprised how fast it gets easier.”

Render details

Workflow

Text-to-video workflow

10-second render in 9:16

Audio-enabled output

Realistic styling

Scene focus: Vertical 9 16 TikTok-style UGC selfie

Engine

LTX 2.3 Fast

Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and Fal’s 25/50 fps options.

Image input

Audio option

20s max

Open LTX 2.3 Fast model page Browse LTX video examples

Specs

Engine

LTX 2.3 Fast

Mode

Text to video

Duration

10s

Aspect ratio

9:16

Resolution

1080p

FPS

Audio

Enabled

Render cost

$0.52

Created

2026-03-06

Related examples

Same example family

LTX Video 2.0 Fast audio-enabled video example: office

This LTX Video 2.0 Fast text to video example shows office. It highlights audio-enabled output with 8-second timing · 16:9 output.

Same example family

LTX 2.3 Pro audio-enabled video example: city close-up

This LTX 2.3 Pro text to video example shows city close-up. It highlights audio-enabled output with 10-second timing · 16:9 · 1080p output.

Same watch-page intent

Kling 2.6 Pro audio-enabled video example: 10-second 16 9 cinematic shot in

This Kling 2.6 Pro text to video example shows 10-second 16 9 cinematic shot in. It highlights audio-enabled output with 10-second timing · 16:9 output.

Same watch-page intent

OpenAI Sora 2 audio-enabled video example: A man wearing a gorilla head

This OpenAI Sora 2 text to video example shows A man wearing a gorilla head. It highlights audio-enabled output with 4-second timing · 16:9 · 720p output.

Recreate

Load this render in the workspace

Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.

Recreate in workspace Browse LTX video examples Open LTX 2.3 Fast model page