# Model Settings

Model Settings let you control how a model thinks and responds. Two settings are available — **Thinking Depth** and **Temperature** — each affecting a different dimension of the output.

***

#### 1. Thinking Depth

Thinking Depth controls how much internal reasoning the model applies before generating a response. When enabled, the model works through a series of reasoning steps — called **thinking tokens** — before producing its final answer.

**What Are Thinking Tokens?**

Thinking tokens are the model's internal reasoning steps — the process it goes through to interpret your prompt, evaluate different approaches, and arrive at a well-considered response before replying. They are visible in the response so you can follow how the model reasoned through your request.

Thinking tokens are counted as **output tokens** and consume credits accordingly. The deeper the thinking level, the more reasoning steps the model takes, and the more credits are used.

Thinking tokens are most valuable for:

* Complex research and analysis
* Multi-step problem solving
* Strategy planning and decision-making
* Advanced coding and debugging
* Tasks where understanding *how* the model reasoned matters as much as the answer itself

**Thinking Depth Levels**

| Level          | Reasoning Effort           | Credit Usage | Best For                                                               |
| -------------- | -------------------------- | ------------ | ---------------------------------------------------------------------- |
| **None**       | No thinking tokens         | Lowest       | Simple Q\&A, formatting, short rewrites                                |
| **Low**        | Minimal reasoning          | Low          | Basic content writing, casual prompts                                  |
| **Medium**     | Balanced reasoning         | Moderate     | Blog writing, coding assistance, structured business tasks             |
| **High**       | Deep, multi-step reasoning | High         | Complex coding, strategy planning, analytical writing                  |
| **Extra High** | Maximum reasoning effort   | Highest      | Advanced research, long-form reasoning chains, complex problem solving |

***

#### 2. Temperature

Temperature controls how creative or predictable the model's responses are — specifically, the randomness applied when the model selects words and constructs its response. In Qolaba, Temperature is set on a scale of **0 to 100**.

* **0** — Fully deterministic. The model picks the most probable word at every step. Responses are consistent, precise, and repeatable.
* **100** — Maximum randomness. The model explores less probable word choices, producing more varied, creative, and sometimes unexpected outputs.

<table><thead><tr><th width="105.53436279296875">Range</th><th width="283.85626220703125">Output Style</th><th>Best For</th></tr></thead><tbody><tr><td><strong>0 – 30</strong></td><td>Focused, deterministic, factual</td><td>Coding, legal drafts, data analysis, structured outputs (JSON, tables)</td></tr><tr><td><strong>40 – 60</strong></td><td>Balanced, natural, controlled</td><td>Blog posts, marketing copy, email drafts, general writing</td></tr><tr><td><strong>70 – 100</strong></td><td>Creative, varied, less predictable</td><td>Storytelling, brainstorming, ad copy, brand naming, ideation</td></tr></tbody></table>

***

#### Recommended Combinations

<table><thead><tr><th width="136.23748779296875">Thinking Depth</th><th width="132.08123779296875">Temperature</th><th>Output Type</th></tr></thead><tbody><tr><td><strong>High</strong></td><td><strong>0 – 30</strong></td><td>Precise and structured — technical reports, competitive analysis, complex coding</td></tr><tr><td><strong>Medium</strong></td><td><strong>40 – 60</strong></td><td>Balanced and reliable — business writing, content drafts, professional communication</td></tr><tr><td><strong>Low</strong></td><td><strong>70 – 100</strong></td><td>Fast and creative — brainstorming, ideation, headline generation, ad variations</td></tr></tbody></table>

**Examples:**

* Writing a detailed strategy document → **High Thinking Depth + Temperature 10–20**
* Generating 10 creative campaign name ideas → **Low Thinking Depth + Temperature 80–90**


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.qolaba.ai/chatbot/model-selection/model-settings.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
