Wan 2.6 Text & Image to Video audio-enabled video example: Vertical 9 16…
This Wan 2.6 Text & Image to Video text to video example shows Vertical 9 16 TikTok-style UGC selfie. It highlights audio-enabled output with 10-second timing · 9:16 · 1080p output.
Prompt breakdown
Text-to-video prompt used to generate this render.
Subject
Vertical 9:16 TikTok-style UGC selfie video, handheld smartphone feel, natural indoor daylight near a window. A friendly creator speaks directly to camera with natural blinking, subtle head nods, and a warm smile. Add s…
Workflow
Text to video
Camera
Audio Enabled
Output
10s · 9:16 · 1080p
Audio
Enabled
Constraints
Text To Video, Audio Enabled
Show full promptHide full prompt
Vertical 9:16 TikTok-style UGC selfie video, handheld smartphone feel, natural indoor daylight near a window. A friendly creator speaks directly to camera with natural blinking, subtle head nods, and a warm smile. Add small human imperfections: a tiny hesitation, a soft breath, a quick smile mid-sentence, and a micro-pause before the last line. Realistic skin texture, stable identity, no face warping, minimal flicker, clean audio with natural room tone. No subtitles. No on-screen text. No logos. No watermarks. The creator says (exactly, with the same pacing and hesitations): “Okay, so… um… quick thing. If you’re feeling stuck, just do the tiniest first step… like, set a two-minute timer and start. (smiles) That’s it. You’ll be surprised how fast it gets easier.”
Why Wan 2.6 Text & Image to Video fits this shot
Wan 2.6 merges text, image, and reference-to-video in one card with multi-shot prompting and 720p/1080p tiers.
Text prompts
Image input
Reference video
Key frames



Related examples
View all examples
Wan 2.6 Text & Image to VideoWan 2.5 Text & Image to Video audio-enabled video example: A vertical cinematic mini…
This Wan 2.5 Text & Image to Video text to video example shows A vertical cinematic mini action scene. It highlights audio-enabled output with 5-second timing · 26:15 output.
Wan 2.6 Text & Image to VideoWan 2.5 Text & Image to Video audio-enabled video example: smartwatch runner ad
This Wan 2.5 Text & Image to Video text to video example shows smartwatch runner ad. It highlights audio-enabled output with 5-second timing · 9:16 output.
Wan 2.6 Text & Image to VideoKling 2.6 Pro audio-enabled video example: 10-second 16 9 cinematic shot in
This Kling 2.6 Pro text to video example shows 10-second 16 9 cinematic shot in. It highlights audio-enabled output with 10-second timing · 16:9 output.
Wan 2.6 Text & Image to VideoLTX Video 2.0 Fast audio-enabled video example: office
This LTX Video 2.0 Fast text to video example shows office. It highlights audio-enabled output with 8-second timing · 16:9 output.