> For the complete documentation index, see [llms.txt](https://docs.qolaba.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.qolaba.ai/speech-generation/voices-accents-and-style.md).

# Voices, Accents & Style

Before generating audio, configure how your output should sound — which voice delivers it, what accent and dialect it uses, and what tone and style it carries. These three settings work together to define the personality, clarity, and emotional quality of your generated audio.

***

#### Voice Library

Qolaba provides a library of 30+ voice profiles, each with distinct characteristics in tone, pitch, energy, and speaking style. Selecting the right voice is the single most impactful decision in your configuration — it defines how your audience experiences the content.

**Browsing and Previewing Voices**

Click any voice in the library to hear a preview before selecting it. This lets you evaluate tone and style before committing to a generation.

**Voice Categories**

Voices are organized by characteristic style to help you find the right fit quickly:

| Category        | Characteristic                     |
| --------------- | ---------------------------------- |
| **Bright**      | Clear, positive, and energetic     |
| **Upbeat**      | Enthusiastic and engaging          |
| **Informative** | Measured, authoritative, and clear |
| **Firm**        | Confident and direct               |
| **Excitable**   | High energy and expressive         |
| **Youthful**    | Fresh, casual, and approachable    |
| **Clear**       | Neutral and precise                |
| **Smooth**      | Warm and fluid delivery            |
| **Soft**        | Gentle and calm                    |
| **Gravelly**    | Deep and textured                  |

**Filtering Voices**

Use the search and filter options to narrow down the library by:

* **Gender** — male or female voices
* **Tone category** — filter by characteristic style (Bright, Smooth, Firm, etc.)

{% hint style="info" %}
Match the voice category to the content type — an **Informative** voice works well for product walkthroughs and instructional content, while an **Upbeat** or **Excitable** voice suits marketing and promotional audio.
{% endhint %}

***

#### Language & Accent

1. **Multi-Language Support**

The output language of your generated audio is determined entirely by the language of your script. Write your script in any language — English, Hindi, French, Arabic, Mandarin, or any other supported language — and the audio will be generated in that language automatically. There is no separate language setting to configure.

This makes Speech Generation natively multilingual — switch languages simply by changing the language of your input text.

2. **Accent & Dialect Selection**

Many languages have multiple regional dialects with distinct pronunciation patterns. Accent selection lets you specify which regional variant the voice should follow — improving clarity, naturalness, and audience relatability for region-specific content.

**Examples of available accents:**

<table><thead><tr><th width="289.35546875">Language</th><th>Available Dialects</th></tr></thead><tbody><tr><td><strong>English</strong></td><td>United States, United Kingdom, India, Australia</td></tr><tr><td><strong>French</strong></td><td>France, Canada</td></tr><tr><td><strong>Arabic</strong></td><td>Egypt, Global</td></tr><tr><td><strong>Mandarin</strong></td><td>China, Taiwan</td></tr><tr><td><strong>Hindi</strong></td><td>India</td></tr><tr><td><strong>Spanish</strong></td><td>Spain, Latin America</td></tr></tbody></table>

{% hint style="info" %}
If your script is in a language with multiple regional dialects and your audience is in a specific region, selecting the matching accent improves pronunciation accuracy and makes the audio feel more natural to that audience.
{% endhint %}

***

#### Style Instructions

Style instructions let you guide the emotional tone and delivery manner of the generated audio — going beyond voice selection to define *how* the voice speaks, not just which voice speaks.

**How to Write Style Instructions**

Enter a brief, plain-language description of the desired delivery in the **Style Prompt** field. The model interprets this and adjusts its delivery accordingly.

**Examples:**

| Intent                 | Style Instruction                                                |
| ---------------------- | ---------------------------------------------------------------- |
| Warm and personable    | *"Speak warmly and conversationally, like talking to a friend"*  |
| Professional narration | *"Clear, professional, and authoritative tone"*                  |
| Energetic marketing    | *"Enthusiastic and high-energy delivery"*                        |
| Calm instructional     | *"Calm, slow-paced, and easy to follow"*                         |
| Storytelling           | *"Engaging narrative style, with natural pauses and expression"* |

**Combining Voice and Style**

Voice selection and style instructions work best in combination. A **Smooth** voice with a *"warm and conversational"* style instruction produces a noticeably different output than the same voice with a *"professional and authoritative"* instruction.

{% hint style="info" %}
If your first generation doesn't match the intended tone, refine the style instruction before switching voices. Often a more specific style prompt produces better results than changing the voice entirely.
{% endhint %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.qolaba.ai/speech-generation/voices-accents-and-style.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
