Wan 2.6 Text & Image to Video camera movement example: city tracking

Wan 2.6 Text & Image to VideoText to video10s16:9Audio

This Wan 2.6 Text & Image to Video text to video example shows city tracking. It highlights audio-enabled output and camera motion control with 10-second timing · 16:9 · 720p output.

Prompt

Wide 16:9 cinematic action shot, a runner sprints through a rainy city street at night, water splashes realistically with each step, reflections on wet asphalt, handheld tracking camera following from the side. Dynamic…

Show full prompt

Wide 16:9 cinematic action shot, a runner sprints through a rainy city street at night, water splashes realistically with each step, reflections on wet asphalt, handheld tracking camera following from the side. Dynamic motion with believable inertia and physics, no rubbery limbs, no wobbling background, stable scene geometry, minimal temporal flicker, sharp details despite fast movement, realistic motion blur.

Render details

Workflow

Text-to-video workflow

10-second render in 16:9

Audio-enabled output

Tracking camera move

Cinematic styling

Engine

Wan 2.6 Text & Image to Video

Wan 2.6 merges text, image, and reference-to-video in one card with multi-shot prompting and 720p/1080p tiers.

Text prompts
Image input
Reference video

Specs

Engine

Wan 2.6 Text & Image to Video

Mode

Text to video

Duration

10s

Aspect ratio

16:9

Resolution

720p

FPS

24

Audio

Enabled

Render cost

$1.30

Created

2026-01-29

Related examples

Recreate

Load this render in the workspace

Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.