OpenAI Sora 2 audio-enabled video example: city camera move
This OpenAI Sora 2 text to video example shows city camera move. It highlights audio-enabled output with 12-second timing · 9:16 output.
Prompt
Logline A vertical, cinematic mini action scene where a spy-style hero runs like in a blockbuster trailer, only to reveal at the end that they are just late for a Zoom call. The tone is intense at first, then funny and…
Show full promptHide full prompt
Logline A vertical, cinematic mini action scene where a spy-style hero runs like in a blockbuster trailer, only to reveal at the end that they are just late for a Zoom call. The tone is intense at first, then funny and self-aware. Global style and format Vertical 9:16, 1080x1920 if supported (otherwise 720x1280). Photorealistic, high-contrast, dramatic “blockbuster” lighting. Intended use: TikTok, Instagram Reels, YouTube Shorts. Total duration: 12 seconds. Shot 1 – 0–4s: Rooftop chase Exterior, early morning. Vertical shot of a generic action hero sprinting across a city rooftop, wearing casual clothes with a light tactical vibe (no specific franchise, no logos). The camera is in front of the hero, running backwards, very close to their face, cinema-style. Wind in their hair, serious expression, city skyline behind in soft focus. The motion is smooth and dynamic, like a movie trailer. Dialogue (Shot 1) The hero, slightly out of breath, mutters to themselves: “Okay… no pressure… totally under control…” Shot 2 – 4–8s: Almost-mission-impossible Hard cut to a side angle: the hero charges towards what looks like the edge of the rooftop. The camera moves with them in profile. At the last second, they make a dramatic leap in slow motion, as if jumping between two skyscrapers. We briefly see their feet flying over… a very small gap with a single air-conditioning unit and a tiny puddle. The shot is still framed and graded like a serious action scene, but the gap is obviously ridiculous. Dialogue (Shot 2) As they jump, we hear them think out loud: “This is just like those movies… please don’t trip now…” Shot 3 – 8–12s: Comedic reveal and punchline Cut to a medium shot inside a modern apartment. The hero lands with a heavy, cinematic “thud” in front of a desk with a laptop open on a video call screen. The camera settles in a vertical mid-shot. The hero straightens their shirt, grabs a headset from the desk and looks straight into the camera with a mix of intensity and embarrassment. On the laptop screen we see generic, blurred silhouettes of people on a call (no identifiable faces). Dialogue (Shot 3) The hero, suddenly very polite, says: “Hi everyone, sorry I’m late… little… parkour situation.” On-screen text At the bottom of the frame between 10 and 12 seconds, add clean white text: TURN YOUR DAY INTO A MOVIE Simple sans-serif font, high contrast, sharp and stable. Audio Generate synchronized audio: - Big, dramatic trailer-style percussion and pulses in Shots 1 and 2, matching the running and the jump. - Subtle city ambience under the music on the rooftop (distant traffic, wind). - A heavy “whoosh” and impact sound as the hero lands in Shot 3. - As soon as they deliver the punchline, the music drops into a light, cheeky sting, as if the trailer suddenly became a comedy. No additional voiceover besides the hero’s lines. Constraints Keep the hero generic and non-identifiable: no specific actor likeness, no real brand logos. Do not show real software UI or a recognisable logo on the laptop, just a generic video call layout with blurred silhouettes. Avoid excessive camera shake; the energy should come from motion, framing and sound, not from random jitter.
Render details
Workflow
Text-to-video workflow
12-second render in 9:16
Audio-enabled output
camera-move
Cinematic styling
Engine
OpenAI Sora 2
OpenAI Sora 2 handles cinematic narratives with lip-sync and audio - ideal for hero renders.
Specs
Engine
OpenAI Sora 2
Mode
Text to video
Duration
12s
Aspect ratio
9:16
Audio
Enabled
Render cost
$1.56
Created
2025-11-14
Related examples

Same example family
OpenAI Sora 2 audio-enabled video example: A man wearing a gorilla head
This OpenAI Sora 2 text to video example shows A man wearing a gorilla head. It highlights audio-enabled output with 4-second timing · 16:9 · 720p output.

Same example family
OpenAI Sora 2 audio-enabled video example: hallway
This OpenAI Sora 2 text to video example shows hallway. It highlights audio-enabled output with 12-second timing · 16:9 output.

Same watch-page intent
Kling 2.6 Pro audio-enabled video example: 10-second 16 9 cinematic shot in
This Kling 2.6 Pro text to video example shows 10-second 16 9 cinematic shot in. It highlights audio-enabled output with 10-second timing · 16:9 output.

Same watch-page intent
LTX 2.3 Pro audio-enabled video example: city close-up
This LTX 2.3 Pro text to video example shows city close-up. It highlights audio-enabled output with 10-second timing · 16:9 · 1080p output.
Recreate
Load this render in the workspace
Start from the same prompt and settings, then remix duration, aspect ratio, references, or audio.