Duration & Output
- Durations: 6 s or 10 s
- Resolutions: 512p or 768p
- Routes tuned for speed and cost
Generate fast, physics-aware drafts at 512p/768p from text or a single image, with optional end frame to lock the landing.
Pick 6 or 10 seconds, choose 16:9/9:16/1:1, and get silent clips ready for storyboards, loops and motion tests.
MiniMax Hailuo 02 – Text & Image-to-Video in MaxVideoAI (512p/768p, 6–10s)
Using the input image of the helicopter flying over a burning city in daylight, create an 8–10 second cinematic 16:9 shot where…
View render →Why MiniMax Hailuo 02 is powerful inside MaxVideoAI
Best use cases
MiniMax’s headline model for short sequences with strong physics and prompt adherence.
In MaxVideoAI it’s wired as a fast, silent draft engine at 512p/768p.
MaxVideoAI flow
Specs as used today in MaxVideoAI.
MiniMax Hailuo 02 is a fast, physics-savvy draft engine for cheap, silent motion tests.
See text and image runs rendered with the same configuration you have.
View all MiniMax Hailuo 02 examples →

MiniMax Hailuo 02 Standard (Image to Video) · 10s
Using the input image of the helicopter flying over a burning city in daylight, create an 8–10 second cinematic 16:9 shot where…
Recreate this shot →
MiniMax Hailuo 02 Standard (Text to Video) · 10s
A cinematic 10-second shot in 16:9. At night, the camera flies smoothly through a modern city full of soft neon lights and…
Recreate this shot →
MiniMax Hailuo 02 Standard (Text to Video) · 6s
Wide shot of a modern creative space. A male director stands beside a green-screen, gesturing with enthusiasm. Over his shoulder you glimpse…
Recreate this shot →
MiniMax Hailuo 02 Standard (Text to Video) · 6s
Close-up profile of a middle-aged man standing under a neon sign in a rainy alley at night, reflections on wet pavement, rim…
Recreate this shot →
MiniMax Hailuo 02 Standard (Text to Video) · 6s
Sweeping aerial view of a mountain range at golden hour, voluminous clouds drifting between peaks, camera height slowly rising, ultra-wide angle, photorealisti…
Recreate this shot →
Short, motion-driven prompts: subject, environment, camera, physics, style, negative prompt.
[Duration] second [aspect ratio] shot of [subject] moving through [environment]. Camera [movement]; [secondary motion] reacts naturally. Style: [realistic/stylized]. No text/logos/UI.
One idea per clip; keep wording tight.
Animate boards/renders/concepts quickly with optional end frame.
6s: 1–2 beats; 10s: 2–3 beats max in one environment.
Focus on physics-driven transitions in a single setting.
Demo Prompt – 10s city drift (9:16)
A cinematic 10-second shot in 16:9. At night, the camera flies smoothly through a modern city full of soft neon lights and warm windows, then glides towards a single bright window high on a building. Without cutting, the camera passes through the glass into a cozy creator studio with a large desk and an ultra-wide monitor glowing in the dark. The room is lit by the screen and a warm desk lamp. The camera continues to push in until the monitor fills most of the frame. On the screen there is a clean AI video workspace UI (generic, no real logos) showing four small video previews playing at the same time: one realistic city street shot, one colourful animation, one product hero shot and one abstract motion-graphics scene. The overall style is cinematic, with smooth camera motion, gentle depth of field and rich contrast.
View render →10s city drift
10s vertical shot drifting through a neon-soaked city street at night, stylized but grounded in real physics.
Camera floats forward above puddles, reflections rippling as people/umbrellas pass in soft focus.
Wind gusts send trash and steam swirling; painterly look, saturated colors, no text/logos.
Hailuo 02 is your fast, physics-savvy sketch engine—use it to explore motion before finalizing in higher-end models.
Respect guardrails; Hailuo is for professional, ethical use.
Great motion/physics at lower per-second cost, ideal for volume ads, explainers and experiments without burning budget.
2–3 sentences covering subject, environment, camera and physics; one main idea per clip.
Yes—16:9, 9:16, 1:1 are available in MaxVideoAI.
No. Hailuo 02 outputs silent video; add sound in post or pair with audio-capable engines.
Use Hailuo 02 for drafts at 512p/768p, then upscale or rerun the prompt in 1080p engines (Veo, Kling, Wan, Sora 2 Pro).
Compare pricing, latency, and output options across MaxVideoAI.
openai
Create rich AI-generated videos from text or image prompts using Sora 2. Native voice-over, ambient effects, and motion sync via MaxVideoAI.
Compare MiniMax vs Pika 2.2 →openai
Create longer, more immersive AI videos from text or images using Sora 2 Pro. Native voice, ambient sound, prompt chaining, and advanced control via MaxVideoAI.
Compare MiniMax vs Pika 2.2 →google-veo
Generate cinematic 8-second videos with native audio using Veo 3.1 by Google DeepMind on MaxVideoAI. Reference-to-video guidance, multi-image fidelity, pay-as-you-go pricing from $0.52/s.
Compare MiniMax vs Pika 2.2 →MiniMax Hailuo 02 is your fast, physics-savvy sketch engine for testing motion and transitions cheaply.
Use it to explore ideas, then promote winners to higher-end models for audio and 1080p finals.
Open Generate