CURRENT ALIBABA VIDEO MODEL

Happy Horse 1.1

Native audio, lip-sync, image-to-video and reference-to-video in one current Alibaba route.

Use Happy Horse 1.1 when a shot needs synchronized speech or sound from text, a starting image, or up to nine reference images. Keep Happy Horse 1.0 for legacy video-edit jobs.

Happy Horse 1.1 native audio reference-guided video example
Native audio
Reference video1080p

Happy Horse 1.1 example

Native-audio text, image and reference video route

View render

Native audio

Generate dialogue, ambience and SFX with the render when the route supports it.

Text or image

Start from a scene brief or a still image to lock subject and composition.

Reference images

Use up to nine references with character1 through character9 prompt anchors.

Expanded ratios

Use landscape, vertical, square, classic, wide, tall, 5:4 or 4:5 composition.

720p or 1080p

Choose the exposed MaxVideoAI resolution before generation.

Lower 1080p provider rate

Happy Horse 1.1 uses the current 1080p provider rate before MaxVideoAI margin.

Happy Horse 1.1 pricing at a glance

Preset native-audio totals - see the exact live price in the app before you generate.

View full pricing

Native-audio workflow

$0.91

5s · 720p

Common production check

$1.82

10s · 720p

Final delivery

$3.51

Most popular

15s · 1080p

Max duration

15s

Up to 1080p

All prices are MaxVideoAI display prices in USD credits for preset scenarios.

Happy Horse 1.1 model demos

Review the model page clips for native audio, lip-sync, and reference-to-video behavior. Comparison pages intentionally stay text/spec focused for this launch.

Real community renders

See what's possible with Happy Horse 1.1.

Recreate any shot

Jump into the app with one click and reuse the setup.

Native audio

Dialogue, ambience and SFX generated in sync.

Multi-shot continuity

Keep characters, style and scene consistency across sequences.

Production-aware

Built-in guardrails and safety filters for responsible review.

Happy Horse 1.1 or Seedance?

Use Happy Horse 1.1 for Alibaba native-audio text, image and reference generation. Use Seedance 2.0 when multimodal references, longer production continuity and current Seedance behavior are the priority.

Compare Happy Horse vs Seedance

Need video edit?

Use Happy Horse 1.0 only when you specifically need the legacy video-edit endpoint. New text, image and reference jobs should start on 1.1.

Open legacy Happy Horse 1.0

Working from references?

Assign each file one job: identity, wardrobe, movement, environment or audio mood.

Open Prompt Lab

Prompt Lab — Happy Horse 1.1

How Happy Horse 1.1 uses references

Text-to-video

Write the subject, action, camera, style and audio beats in a compact brief.

Image-to-video

Use a still image to anchor subject, product, wardrobe or composition.

Reference-to-video

Name each reference as character1, character2 and onward to keep roles clear.

Legacy video edit

Switch to Happy Horse 1.0 only when a source video must be edited rather than regenerated.

Audio handling

Keep dialogue short and tie SFX to visible actions for cleaner synchronized output.

Global principles

    Engine quirks / what to watch for

      Demo prompt - Happy Horse 1.1

      Text-to-video

      Subject: Night market noodle stall chef  •  Action: Flips noodles in a wok and plates the bowl after rain
      Camera: Neon wide shot, macro wok, side plate-up, slow push-in  •  Style: Cinematic food film, wet street reflections, steam and lantern bokeh
      Audio: Wok sizzle, oil whoosh, rain on the awning, no dialogue

      View full prompt
      Four-shot energetic studio food-film sequence in a small night market noodle stall after rain. Shot 1: neon reflections on wet pavement, a chef silhouette places a black wok over a blue gas flame, steam already rising. Shot 2: macro close-up of noodles flipping in the wok with orange sparks, camera locked, sizzling oil and quick whoosh. Shot 3: medium side shot as the chef slides the noodles into a ceramic bowl, steam curls across the lens, background lanterns soft and out of focus, no signs or readable text. Shot 4: slow push-in on the finished bowl on a stainless counter while rain taps the awning and steam fades into the neon light, no dialogue, no logos.
      10s16:9Audio on
      Happy Horse 1.1 AI video example: Demo prompt - Happy Horse 1.1
      10s16:9
      View full render

      Tips and boundaries

      Best practices, common fixes, and important limitations to help you get the strongest results with Happy Horse 1.1.

      What works best

      • Use T2V for fresh ideas and spokesperson-style native audio shots.
      • Use I2V when a key visual or first frame is already approved.
      • Use reference-to-video when identity, wardrobe, product shape, or character continuity matters.
      • Use Happy Horse 1.0 only when the job specifically needs legacy video edit.

      Common problems → fast fixes

      • Feels random / inconsistent → simplify to: subject + action + camera + lighting. Re-run 2–3 takes.
      • Motion looks weird → reduce movement: one camera move, slower action, fewer props.
      • Subject drifts off-brand → start from a reference image and lock palette + lighting.
      • Text looks wrong → avoid readable signage, tiny UI, micro labels. Keep text off-screen.
      • Dialogue drifts → keep lines short and punchy; avoid long monologues.

      Hard limits to keep in mind

      • Output is short-form (15s output). For longer edits, stitch multiple clips.
      • Resolution tops out at 1080p for this tier.
      • No fixed seeds — iteration = re-run + refine.

      Compare Happy Horse 1.1 vs other AI video models

      These side-by-side comparisons break down price, resolution, audio, speed, and motion style so you can pick the right engine fast.

      Each page includes real outputs and practical best-use cases.

      Technical overview

      The limits that shape your renders.

      View full specs

      Price / second

      720p $0.18/s1080p $0.24/s

      Text-to-Video

      Supported

      Image-to-Video

      Supported

      Video-to-Video

      Not supported in the current Happy Horse 1.1 route

      First/Last frame

      First frame supported via Image-to-Video; last frame not supported

      Start / reference image

      Supported (1-9 reference stills)

      Reference video

      Not supported in the current Happy Horse 1.1 route

      Max resolution

      1080p

      Max duration

      15s output

      Aspect ratios

      16:9 / 9:16 / 1:1 / 4:3 / 3:4 / 21:9 / 9:21 / 5:4 / 4:5

      FPS options

      24 fps

      Output format

      MP4

      Audio output

      Supported

      Native audio generation

      Supported

      Lip sync

      Supported

      Camera / motion controls

      Basic

      Watermark

      No (MaxVideoAI)

      Technical overview

      Details
      • Workflows: Text-to-video, image-to-video, and reference-to-video are exposed as one current Happy Horse 1.1 model in MaxVideoAI.
      • Duration: 3-15 s for generation outputs.
      • Resolution: 720p or 1080p
      • Reference images: 1-9 images, addressed as character1 through character9 in the prompt.
      • Video edit: Not exposed on Happy Horse 1.1. Use the legacy Happy Horse 1.0 video-edit route when a source clip must be edited.
      • Audio: Native synchronized audio and lip-sync are treated as part of the generation.

      Safety & people / likeness

      Built-in safeguards and best practices for responsible creation with Happy Horse 1.1.

      • Use original characters and owned references.
      • Avoid real people, celebrities and protected characters.
      • Do not use someone's likeness without consent.
      • Avoid copyrighted franchises, logos and protected IP.

      FAQ

      What inputs does Happy Horse 1.1 support?

      MaxVideoAI exposes Happy Horse 1.1 as one current model with text-to-video, image-to-video, and reference-to-video workflows.

      Does Happy Horse 1.1 support lip-sync?

      Yes. Happy Horse 1.1 is treated as a native-audio model with synchronized speech and lip-sync integrated into the generation flow.

      Does Happy Horse 1.1 support video edit?

      No. Video edit remains available through the legacy Happy Horse 1.0 route.