# Image Models

Qolaba provides access to a curated set of image generation models across text-to-image, image-to-image, and image editing workflows. Each model has distinct strengths, supported quality levels, credit costs, and generation limits. Use this page as a reference when selecting a model for your image generation task.

***

### How to Read This Page

| Column                   | What It Means                                                       |
| ------------------------ | ------------------------------------------------------------------- |
| **Credits / Image**      | Credits consumed per generated image at the specified quality level |
| **Quality**              | Supported resolution or quality tiers for that model                |
| **Max Generations**      | Maximum number of images producible in a single run                 |
| **Max Reference Images** | Maximum number of reference images uploadable (image-to-image only) |

{% hint style="info" %}
Models marked with ⭐ are available on **paid plans only**.
{% endhint %}

***

### Text-to-Image Models

Text-to-image models generate images from written prompts. Select based on the visual style, quality level, and credit budget required for your task.

#### **Flagship Models**

| Model                 | Quality | Credits / Image | Max Generations | Best For                                                                                   |
| --------------------- | ------- | --------------- | --------------- | ------------------------------------------------------------------------------------------ |
| **Nano Banana 2**     | 0.5K    | 15              | 4               | General-purpose generation — fast, versatile, strong quality across a wide range of styles |
|                       | 1K      | 21              |                 |                                                                                            |
|                       | 2K      | 32              |                 |                                                                                            |
|                       | 4K      | 48              |                 |                                                                                            |
| **Nano Banana Pro** ⭐ | 1K / 2K | 42              | 4               | Premium quality — higher fidelity for demanding, production-ready outputs                  |
|                       | 4K      | 75              |                 |                                                                                            |

***

#### **OpenAI Models**

GPT Image 2 uses quality tiers — **Low**, **Medium**, and **High** — instead of resolution values. These correspond to increasing levels of detail, prompt adherence, and visual fidelity:

* **Low** — fast generation, basic detail, suitable for drafts and concept testing
* **Medium** — balanced quality for most professional use cases
* **High** — maximum detail, photorealism, and prompt accuracy for final production output

| Model             | Quality | Credits / Image | Max Generations | Best For                                                                        |
| ----------------- | ------- | --------------- | --------------- | ------------------------------------------------------------------------------- |
| **GPT Image 2** ⭐ | Low     | 4               | 8               | Photorealism, text-in-image, structured scenes with strong prompt comprehension |
|                   | Medium  | 19              |                 |                                                                                 |
|                   | High    | 69              |                 |                                                                                 |

***

#### **Google Models**

| Model                  | Quality | Credits / Image | Max Generations | Best For                                                                           |
| ---------------------- | ------- | --------------- | --------------- | ---------------------------------------------------------------------------------- |
| **ImageGen 4**         | 1K / 2K | 11              | 4               | General-purpose — Google's latest image model, strong all-rounder for varied tasks |
| **ImageGen 4 Fast**    | 1K / 2K | 6               | 4               | Fast generation — quick iterations, draft testing, high-volume workflows           |
| **ImageGen 4 Ultra** ⭐ | 1K / 2K | 16              | 4               | Maximum quality — highest fidelity tier from Google for production-ready output    |

***

#### **Flux Models**

*Black Forest Labs*

| Model            | Quality  | Credits / Image | Max Generations | Best For                                                                                     |
| ---------------- | -------- | --------------- | --------------- | -------------------------------------------------------------------------------------------- |
| **Flux.2 Pro** ⭐ | Standard | 19              | 4               | Creative, high-fidelity generation — next-generation Flux model for detailed artistic output |
| **Flux.1 Dev**   | Standard | 8               | 4               | Developer-friendly experimentation — open model, great for testing and creative exploration  |

***

#### **ByteDance Models**

| Model            | Quality  | Credits / Image | Max Generations | Best For                                                                                             |
| ---------------- | -------- | --------------- | --------------- | ---------------------------------------------------------------------------------------------------- |
| **Seedream 4.5** | Standard | 13              | 6               | Creative and stylized images — ByteDance's flagship, exceptional for artistic and expressive outputs |

***

#### **Recraft Models**

| Model          | Quality  | Credits / Image | Max Generations | Best For                                                                                              |
| -------------- | -------- | --------------- | --------------- | ----------------------------------------------------------------------------------------------------- |
| **Recraft V4** | Standard | 13              | 4               | Design assets and icons — clean graphic design output, brand-consistent visuals, vector-style imagery |

***

### Image-to-Image Models

Image-to-image models accept one or more reference images to guide generation. Everything else — prompt, keywords, output controls — works the same as text-to-image. The reference images influence composition, style, subject, or structure of the output.

#### **Qolaba Flagship Models**

| Model                 | Quality | Credits / Image | Max Generations | Max Reference Images | Best For                                                                               |
| --------------------- | ------- | --------------- | --------------- | -------------------- | -------------------------------------------------------------------------------------- |
| **Nano Banana 2**     | 0.5K    | 15              | 4               | 13                   | General-purpose reference-guided generation — versatile, consistent quality            |
|                       | 1K      | 21              |                 |                      |                                                                                        |
|                       | 2K      | 32              |                 |                      |                                                                                        |
|                       | 4K      | 48              |                 |                      |                                                                                        |
| **Nano Banana Pro** ⭐ | 1K / 2K | 42              | 4               | 13                   | Premium reference-guided generation — highest fidelity for brand and production assets |
|                       | 4K      | 75              |                 |                      |                                                                                        |

***

#### **OpenAI Models**

<table><thead><tr><th width="113.29296875">Model</th><th>Quality</th><th>Credits / Image</th><th>Max Generations</th><th>Max Reference Images</th><th>Best For</th></tr></thead><tbody><tr><td><strong>GPT Image 2</strong> ⭐</td><td>Low</td><td>4</td><td>8</td><td>15</td><td>Photorealistic transformations, text-in-image, strong prompt-guided reference editing</td></tr><tr><td></td><td>Medium</td><td>19</td><td></td><td></td><td></td></tr><tr><td></td><td>High</td><td>69</td><td></td><td></td><td></td></tr></tbody></table>

***

#### **Flux Models**

| Model            | Quality  | Credits / Image | Max Generations | Max Reference Images | Best For                                                                         |
| ---------------- | -------- | --------------- | --------------- | -------------------- | -------------------------------------------------------------------------------- |
| **Flux.2 Pro** ⭐ | Standard | 19              | 4               | 8                    | Creative, high-fidelity style and content transformation with reference guidance |

***

### Image Editing Models

Image editing models are purpose-built tools used within the four editing workflows — Inpainting, Background Removal, Upscaling, and Image Variation. They run automatically based on the tool selected and are not manually chosen in the model selector.

| Model                       | Used For             | What It Does                                                                                        |
| --------------------------- | -------------------- | --------------------------------------------------------------------------------------------------- |
| **BiRefNet V2**             | Background Removal   | One-click background removal — accurately detects and isolates the subject from any background      |
| **Flux General Inpainting** | Inpainting & Cleanup | Mask-based editing — removes or replaces specific areas of an image based on a text prompt          |
| **Nano Banana 2**           | Image Variation      | Generates creative variations inspired by uploaded reference images — up to 13 references supported |

***

### Model Comparison at a Glance

| Use Case                           | Recommended Model                  | Reason                                                                           |
| ---------------------------------- | ---------------------------------- | -------------------------------------------------------------------------------- |
| **General-purpose generation**     | Nano Banana 2                      | Versatile, strong quality, lowest cost entry point for flagship output           |
| **Premium production output**      | Nano Banana Pro ⭐                  | Highest fidelity for demanding, client-facing creative work                      |
| **Photorealism and text-in-image** | GPT Image 2 ⭐                      | OpenAI's latest — best-in-class for photorealistic scenes and legible text       |
| **Creative and artistic output**   | Seedream 4.5                       | Exceptional for stylized, expressive, and artistic image generation              |
| **Design assets and icons**        | Recraft V4                         | Clean graphic design output with strong brand consistency                        |
| **Fast iteration and drafts**      | ImageGen 4 Fast or Flux.1 Dev      | Low cost, fast generation — ideal for prompt testing and concept exploration     |
| **Maximum Google quality**         | ImageGen 4 Ultra ⭐                 | Google's highest quality tier for production-ready output                        |
| **Reference-guided generation**    | Nano Banana 2 or Nano Banana Pro ⭐ | Up to 13 reference images — best for brand consistency and style matching        |
| **Most reference images**          | GPT Image 2 ⭐                      | Supports up to 15 reference images per generation                                |
| **Background removal**             | BiRefNet V2 (auto)                 | Best-in-class subject isolation — runs automatically via Background Removal tool |
| **Inpainting and cleanup**         | Flux General Inpainting (auto)     | Precise mask-based editing — runs automatically via Inpainting tool              |


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.qolaba.ai/model-reference/image-models.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
