Wan 2.6 Text & Image to Video camera movement example: studio push-in
This Wan 2.6 Text & Image to Video text to video example shows studio push-in. It highlights audio-enabled output and camera motion control with 10-second timing · 16:9 · 720p output.
Prompt
Wide 16:9 full-body unboxing video in a clean studio/kitchen setting. A person is fully visible (head-to-toe or at least head-to-knees) standing behind a minimalist tabletop. They unbox a small generic gadget from a pla…
Show full promptHide full prompt
Wide 16:9 full-body unboxing video in a clean studio/kitchen setting. A person is fully visible (head-to-toe or at least head-to-knees) standing behind a minimalist tabletop. They unbox a small generic gadget from a plain matte cardboard box: peel the seal, open the lid, remove the inner tray, take out the device and accessories, and lay everything neatly on the table. The person occasionally lifts the item toward the camera for a closer look, then places it back down. Realism requirements: natural body proportions, stable identity, realistic skin and clothing fabric, no face warping, no unnatural limb bending. Hands must be highly realistic: correct finger count, natural grip, believable pressure/contact with the box and device, consistent shadows, no extra fingers, no “floating” objects. Keep object geometry stable, no wobbling background, minimal temporal flicker. Camera: single continuous shot, tripod-stable, slight cinematic push-in (very slow), eye-level or slightly above table height. Natural soft daylight, clean shadows, realistic materials and textures. No logos, no brand names, no watermarks. No subtitles. Optional on-screen title at the top (perfectly readable and stable, no jitter): "UNBOXING — FIRST LOOK"
Render details
Workflow
Text-to-video workflow
10-second render in 16:9
Audio-enabled output
Push-in camera move
Cinematic styling
Engine
Wan 2.6 Text & Image to Video
Wan 2.6 merges text, image, and reference-to-video in one card with multi-shot prompting and 720p/1080p tiers.
Specs
Engine
Wan 2.6 Text & Image to Video
Mode
Text to video
Duration
10s
Aspect ratio
16:9
Resolution
720p
FPS
24
Audio
Enabled
Render cost
$1.30
Created
2026-02-02
Related examples

Same example family
Wan 2.6 Text & Image to Video camera movement example: Global look elegant thriller…
This Wan 2.6 Text & Image to Video text to video example shows Global look elegant thriller rainy night. It highlights audio-enabled output and camera motion control with 15-second timing · 16:9 · 720p output.

Same watch-page intent
Google Veo 3.1 Fast camera movement example: living room commercial
This Google Veo 3.1 Fast text to video example shows living room commercial. It highlights audio-enabled output and camera motion control with 6-second timing · 16:9 output.

Same watch-page intent
MiniMax Hailuo 02 Standard camera movement example: studio camera move
This MiniMax Hailuo 02 Standard text to video example shows studio camera move. It highlights camera motion control with 10-second timing · 16:9 output.

Same example family
Wan 2.5 Text & Image to Video audio-enabled video example: A vertical cinematic mini…
This Wan 2.5 Text & Image to Video text to video example shows A vertical cinematic mini action scene. It highlights audio-enabled output with 5-second timing · 26:15 output.
Recreate
Load this render in the workspace
Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.