LTX 2.3 Fast audio-enabled video example: Vertical 9 16 TikTok-style UGC selfie
This LTX 2.3 Fast text to video example shows Vertical 9 16 TikTok-style UGC selfie. It highlights audio-enabled output with 10-second timing · 9:16 · 1080p output.
Prompt
Vertical 9:16 TikTok-style UGC selfie video, handheld smartphone feel, natural indoor daylight near a window. A friendly creator speaks directly to camera with natural blinking, subtle head nods, and a warm smile. Add s…
Show full promptHide full prompt
Vertical 9:16 TikTok-style UGC selfie video, handheld smartphone feel, natural indoor daylight near a window. A friendly creator speaks directly to camera with natural blinking, subtle head nods, and a warm smile. Add small human imperfections: a tiny hesitation, a soft breath, a quick smile mid-sentence, and a micro-pause before the last line. Realistic skin texture, stable identity, no face warping, minimal flicker, clean audio with natural room tone. No subtitles. No on-screen text. No logos. No watermarks. The creator says (exactly, with the same pacing and hesitations): “Okay, so… um… quick thing. If you’re feeling stuck, just do the tiniest first step… like, set a two-minute timer and start. (smiles) That’s it. You’ll be surprised how fast it gets easier.”
Render details
Workflow
Text-to-video workflow
10-second render in 9:16
Audio-enabled output
Realistic styling
Scene focus: Vertical 9 16 TikTok-style UGC selfie
Engine
LTX 2.3 Fast
Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and Fal’s 25/50 fps options.
Specs
Engine
LTX 2.3 Fast
Mode
Text to video
Duration
10s
Aspect ratio
9:16
Resolution
1080p
FPS
24
Audio
Enabled
Render cost
$0.52
Created
2026-03-06
Related examples

Same example family
LTX Video 2.0 Fast audio-enabled video example: office
This LTX Video 2.0 Fast text to video example shows office. It highlights audio-enabled output with 8-second timing · 16:9 output.

Same example family
LTX 2.3 Pro audio-enabled video example: city close-up
This LTX 2.3 Pro text to video example shows city close-up. It highlights audio-enabled output with 10-second timing · 16:9 · 1080p output.

Same watch-page intent
Kling 2.6 Pro audio-enabled video example: 10-second 16 9 cinematic shot in
This Kling 2.6 Pro text to video example shows 10-second 16 9 cinematic shot in. It highlights audio-enabled output with 10-second timing · 16:9 output.

Same watch-page intent
OpenAI Sora 2 audio-enabled video example: A man wearing a gorilla head
This OpenAI Sora 2 text to video example shows A man wearing a gorilla head. It highlights audio-enabled output with 4-second timing · 16:9 · 720p output.
Recreate
Load this render in the workspace
Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.