Wan 2.5 Text & Image to Video audio-enabled video example: city camera move
This Wan 2.5 Text & Image to Video text to video example shows city camera move. It highlights audio-enabled output with 10-second timing · 9:16 output.
Prompt
Ultra-realistic walking selfie shot filmed with a smartphone held in one hand. The person is speed-walking through a busy urban street in daylight. Camera movement is dynamic: fast steps, sudden micro-shakes, quick tilt…
Show full promptHide full prompt
Ultra-realistic walking selfie shot filmed with a smartphone held in one hand. The person is speed-walking through a busy urban street in daylight. Camera movement is dynamic: fast steps, sudden micro-shakes, quick tilts as the person avoids people and obstacles. Natural motion blur, realistic stabilization drift, shifting sunlight and shadows on their face. High-detail skin texture, real reflections in the eyes. The person speaks extremely fast, slightly out of breath, trying to explain something urgently while walking. Lip-sync must perfectly match the following rapid line: “Okay listen, I don’t have much time but everything’s happening way faster than I expected and I swear I’ll explain everything once I get there!” Audio: realistic city ambience (footsteps, passing cars, faint horns), wind hitting the phone mic, breath sounds, occasional clothing rustle. Keep the phone-mic quality: compressed, slightly distorted on loud peaks. Mood: energetic, chaotic, spontaneous. No filters, no beautification. Keep it raw and real.
Render details
Workflow
Text-to-video workflow
10-second render in 9:16
Audio-enabled output
camera-move
Realistic styling
Engine
Wan 2.5 Text & Image to Video
Wan 2.5 handles 5 or 10 second clips with optional background audio plus prompt expansion when you need extra detail.
Specs
Engine
Wan 2.5 Text & Image to Video
Mode
Text to video
Duration
10s
Aspect ratio
9:16
Audio
Enabled
Render cost
$0.65
Created
2025-11-16
Related examples

Same example family
Wan 2.5 Text & Image to Video audio-enabled video example: A vertical cinematic mini…
This Wan 2.5 Text & Image to Video text to video example shows A vertical cinematic mini action scene. It highlights audio-enabled output with 5-second timing · 26:15 output.

Same example family
Wan 2.5 Text & Image to Video audio-enabled video example: smartwatch runner ad
This Wan 2.5 Text & Image to Video text to video example shows smartwatch runner ad. It highlights audio-enabled output with 5-second timing · 9:16 output.

Same watch-page intent
Kling 2.6 Pro audio-enabled video example: 10-second 16 9 cinematic shot in
This Kling 2.6 Pro text to video example shows 10-second 16 9 cinematic shot in. It highlights audio-enabled output with 10-second timing · 16:9 output.

Same watch-page intent
LTX Video 2.0 Fast audio-enabled video example: office
This LTX Video 2.0 Fast text to video example shows office. It highlights audio-enabled output with 8-second timing · 16:9 output.
Recreate
Load this render in the workspace
Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.