← Back to models

Nano Banana – Photoreal Image Generation

Generate on-brand stills, keyframes, and product shots before you hit Render. Nano Banana is the image sidekick for Sora, Veo, Kling, and the rest of the stack.

Text → ImageImage → Image1–8 outputs

Generate photoreal stills and edits to lock style, lighting, and framing before you animate in Sora, Veo, or Kling.

Text-to-image and image-to-image share the same wallet and prompt lab as your video engines, with 1–8 outputs per run.

Sora 2 preview – Nano Banana demo still from MaxVideoAI

Nano Banana – Photoreal Image Generation

Nano Banana demo still from MaxVideoAI

Why Nano Banana inside MaxVideoAI

  • Text → Image and Image → Image in one lab
  • Prep photoreal keyframes before Sora/Veo/Kling
  • Great for local edits: cleanup, relight, background replacement, small prop tweaks
  • 1–8 outputs per run with seeds for consistency
  • Wallet-based pricing shared with video engines; no separate Nano Banana account needed

Best use cases

  • Product hero stills and thumbnails
  • Brand-style keyframes for Sora and Veo
  • Cleaning and standardising client visuals

How Nano Banana works in MaxVideoAI

Choose text-to-image or image-to-image, set the batch size (1–8), write your prompt or edit instructions, and generate.

Use it to lock style, lighting, and framing before animating in video engines.

Workflow

  1. Open the Image lab with Nano Banana selected
  2. Pick Text → Image or Image → Image
  3. Set num_images (1–8) and seed if you need consistency
  4. Add your prompt; for edits, upload a clean reference
  5. Check the live price chip and generate

Technical overview

Live pricing updates inside the workspace. Check the chip before you render.

Technical overview

  • Input types: Text prompts, reference images (JPG/PNG/WebP), and edit mode for restyles or cleanups.
  • Outputs: High-res still images tuned for photorealism and text legibility.
  • Modes: Text-to-image; image-to-image for restyle, cleanup, and background replacement.
  • Typical use: Prepare 1–4 keyframes before animating in Sora, Veo, Kling, or Wan.

Nano Banana examples

Recent photoreal stills and edits used as keyframes for video workflows. View image examples →

Prompt ideas for Nano Banana

Describe subject, lighting, lens, and how you’ll use the frame. For edits, spell out the transformation.

1Subject and setting (product/person/scene)
2Lighting and lens (studio/soft/50mm/grade)
3Mood and composition (hero angle, clean background)
4Output intent (keyframe/thumbnail/reference)
5Edit instructions if applicable

Photoreal [subject] in/on [environment/surface], lit by [lighting], shot on [lens/look], clean background for [use case].

For edits: “Clean up / restyle this image: [instructions].”

    Tips & limits

    • Fast photoreal stills for keyframes and thumbnails
    • Edit mode for cleanup, relight, and background swaps
    • Batch 1–8 outputs, reuse seeds for consistency
    • Image-only; use video engines for motion
    • One prompt per batch; keep instructions concise
    • No audio or video inputs in this route

    FAQ

    Is Nano Banana for images or video?

    Images. Use it to prep photoreal references, thumbnails, and keyframes before sending shots to video engines like Sora or Veo.

    How many images per run?

    Set `num_images` to 1–8. Batch a few variants, then reuse a seed to stay consistent.

    Does edit mode need masks?

    No masks needed. Upload a clean still and describe the transformation (cleanup, background swap, relight).

    Explore other models

    google

    Nano Banana Pro

    Generate studio-quality stills with Google’s Gemini 3-powered Nano Banana Pro. 1K, 2K, and 4K outputs, multi-image reference editing, and razor-sharp typography in MaxVideoAI.

    View model →

    openai

    OpenAI Sora 2

    Create rich AI-generated videos from text or image prompts using Sora 2. Native voice-over, ambient effects, and motion sync via MaxVideoAI.

    View model →

    openai

    OpenAI Sora 2 Pro

    Create longer, more immersive AI videos from text or images using Sora 2 Pro. Native voice, ambient sound, prompt chaining, and advanced control via MaxVideoAI.

    View model →
    Generate an image