> For the complete documentation index, see [llms.txt](https://docs.qolaba.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.qolaba.ai/model-reference/image-models.md).

# Image Models

Qolaba provides access to a curated set of image generation models across text-to-image, image-to-image, and image editing workflows. Each model has distinct strengths, supported quality levels, credit costs, and generation limits. Use this page as a reference when selecting a model for your image generation task.

***

### How to Read This Page

| Column                   | What It Means                                                       |
| ------------------------ | ------------------------------------------------------------------- |
| **Credits / Image**      | Credits consumed per generated image at the specified quality level |
| **Quality**              | Supported resolution or quality tiers for that model                |
| **Max Generations**      | Maximum number of images producible in a single run                 |
| **Max Reference Images** | Maximum number of reference images uploadable (image-to-image only) |

{% hint style="info" %}
Models marked with ⭐ are available on **paid plans only**.
{% endhint %}

***

### Text-to-Image Models

Text-to-image models generate images from written prompts. Select based on the visual style, quality level, and credit budget required for your task.

#### **Flagship Models**

| Model                 | Quality | Credits / Image | Max Generations | Best For                                                                                   |
| --------------------- | ------- | --------------- | --------------- | ------------------------------------------------------------------------------------------ |
| **Nano Banana 2**     | 0.5K    | 15              | 4               | General-purpose generation — fast, versatile, strong quality across a wide range of styles |
|                       | 1K      | 21              |                 |                                                                                            |
|                       | 2K      | 32              |                 |                                                                                            |
|                       | 4K      | 48              |                 |                                                                                            |
| **Nano Banana Pro** ⭐ | 1K / 2K | 42              | 4               | Premium quality — higher fidelity for demanding, production-ready outputs                  |
|                       | 4K      | 75              |                 |                                                                                            |

***

#### **OpenAI Models**

GPT Image 2 uses quality tiers — **Low**, **Medium**, and **High** — instead of resolution values. These correspond to increasing levels of detail, prompt adherence, and visual fidelity:

* **Low** — fast generation, basic detail, suitable for drafts and concept testing
* **Medium** — balanced quality for most professional use cases
* **High** — maximum detail, photorealism, and prompt accuracy for final production output

| Model             | Quality | Credits / Image | Max Generations | Best For                                                                        |
| ----------------- | ------- | --------------- | --------------- | ------------------------------------------------------------------------------- |
| **GPT Image 2** ⭐ | Low     | 4               | 8               | Photorealism, text-in-image, structured scenes with strong prompt comprehension |
|                   | Medium  | 19              |                 |                                                                                 |
|                   | High    | 69              |                 |                                                                                 |

***

#### **Google Models**

| Model                  | Quality | Credits / Image | Max Generations | Best For                                                                           |
| ---------------------- | ------- | --------------- | --------------- | ---------------------------------------------------------------------------------- |
| **ImageGen 4**         | 1K / 2K | 11              | 4               | General-purpose — Google's latest image model, strong all-rounder for varied tasks |
| **ImageGen 4 Fast**    | 1K / 2K | 6               | 4               | Fast generation — quick iterations, draft testing, high-volume workflows           |
| **ImageGen 4 Ultra** ⭐ | 1K / 2K | 16              | 4               | Maximum quality — highest fidelity tier from Google for production-ready output    |

***

#### **Flux Models**

*Black Forest Labs*

| Model            | Quality  | Credits / Image | Max Generations | Best For                                                                                     |
| ---------------- | -------- | --------------- | --------------- | -------------------------------------------------------------------------------------------- |
| **Flux.2 Pro** ⭐ | Standard | 19              | 4               | Creative, high-fidelity generation — next-generation Flux model for detailed artistic output |
| **Flux.1 Dev**   | Standard | 8               | 4               | Developer-friendly experimentation — open model, great for testing and creative exploration  |

***

#### **ByteDance Models**

| Model            | Quality  | Credits / Image | Max Generations | Best For                                                                                             |
| ---------------- | -------- | --------------- | --------------- | ---------------------------------------------------------------------------------------------------- |
| **Seedream 4.5** | Standard | 13              | 6               | Creative and stylized images — ByteDance's flagship, exceptional for artistic and expressive outputs |

***

#### **Recraft Models**

| Model          | Quality  | Credits / Image | Max Generations | Best For                                                                                              |
| -------------- | -------- | --------------- | --------------- | ----------------------------------------------------------------------------------------------------- |
| **Recraft V4** | Standard | 13              | 4               | Design assets and icons — clean graphic design output, brand-consistent visuals, vector-style imagery |

***

### Image-to-Image Models

Image-to-image models accept one or more reference images to guide generation. Everything else — prompt, keywords, output controls — works the same as text-to-image. The reference images influence composition, style, subject, or structure of the output.

#### **Qolaba Flagship Models**

| Model                 | Quality | Credits / Image | Max Generations | Max Reference Images | Best For                                                                               |
| --------------------- | ------- | --------------- | --------------- | -------------------- | -------------------------------------------------------------------------------------- |
| **Nano Banana 2**     | 0.5K    | 15              | 4               | 13                   | General-purpose reference-guided generation — versatile, consistent quality            |
|                       | 1K      | 21              |                 |                      |                                                                                        |
|                       | 2K      | 32              |                 |                      |                                                                                        |
|                       | 4K      | 48              |                 |                      |                                                                                        |
| **Nano Banana Pro** ⭐ | 1K / 2K | 42              | 4               | 13                   | Premium reference-guided generation — highest fidelity for brand and production assets |
|                       | 4K      | 75              |                 |                      |                                                                                        |

***

#### **OpenAI Models**

<table><thead><tr><th width="113.29296875">Model</th><th>Quality</th><th>Credits / Image</th><th>Max Generations</th><th>Max Reference Images</th><th>Best For</th></tr></thead><tbody><tr><td><strong>GPT Image 2</strong> ⭐</td><td>Low</td><td>4</td><td>8</td><td>15</td><td>Photorealistic transformations, text-in-image, strong prompt-guided reference editing</td></tr><tr><td></td><td>Medium</td><td>19</td><td></td><td></td><td></td></tr><tr><td></td><td>High</td><td>69</td><td></td><td></td><td></td></tr></tbody></table>

***

#### **Flux Models**

| Model            | Quality  | Credits / Image | Max Generations | Max Reference Images | Best For                                                                         |
| ---------------- | -------- | --------------- | --------------- | -------------------- | -------------------------------------------------------------------------------- |
| **Flux.2 Pro** ⭐ | Standard | 19              | 4               | 8                    | Creative, high-fidelity style and content transformation with reference guidance |

***

### Image Editing Models

Image editing models are purpose-built tools used within the four editing workflows — Inpainting, Background Removal, Upscaling, and Image Variation. They run automatically based on the tool selected and are not manually chosen in the model selector.

| Model                       | Used For             | What It Does                                                                                        |
| --------------------------- | -------------------- | --------------------------------------------------------------------------------------------------- |
| **BiRefNet V2**             | Background Removal   | One-click background removal — accurately detects and isolates the subject from any background      |
| **Flux General Inpainting** | Inpainting & Cleanup | Mask-based editing — removes or replaces specific areas of an image based on a text prompt          |
| **Nano Banana 2**           | Image Variation      | Generates creative variations inspired by uploaded reference images — up to 13 references supported |

***

### Model Comparison at a Glance

| Use Case                           | Recommended Model                  | Reason                                                                           |
| ---------------------------------- | ---------------------------------- | -------------------------------------------------------------------------------- |
| **General-purpose generation**     | Nano Banana 2                      | Versatile, strong quality, lowest cost entry point for flagship output           |
| **Premium production output**      | Nano Banana Pro ⭐                  | Highest fidelity for demanding, client-facing creative work                      |
| **Photorealism and text-in-image** | GPT Image 2 ⭐                      | OpenAI's latest — best-in-class for photorealistic scenes and legible text       |
| **Creative and artistic output**   | Seedream 4.5                       | Exceptional for stylized, expressive, and artistic image generation              |
| **Design assets and icons**        | Recraft V4                         | Clean graphic design output with strong brand consistency                        |
| **Fast iteration and drafts**      | ImageGen 4 Fast or Flux.1 Dev      | Low cost, fast generation — ideal for prompt testing and concept exploration     |
| **Maximum Google quality**         | ImageGen 4 Ultra ⭐                 | Google's highest quality tier for production-ready output                        |
| **Reference-guided generation**    | Nano Banana 2 or Nano Banana Pro ⭐ | Up to 13 reference images — best for brand consistency and style matching        |
| **Most reference images**          | GPT Image 2 ⭐                      | Supports up to 15 reference images per generation                                |
| **Background removal**             | BiRefNet V2 (auto)                 | Best-in-class subject isolation — runs automatically via Background Removal tool |
| **Inpainting and cleanup**         | Flux General Inpainting (auto)     | Precise mask-based editing — runs automatically via Inpainting tool              |