Technical overview
- Input types: Text prompts, reference images (JPG/PNG/WebP), and edit mode for restyles or cleanups.
- Outputs: High-res still images tuned for photorealism and text legibility.
- Modes: Text-to-image; image-to-image for restyle, cleanup, and background replacement.
- Typical use: Prepare 1–4 keyframes before animating in Sora, Veo, Kling, or Wan.
