# Text To Speech

Qolaba's platform offers a powerful Text-to-Speech (TTS) feature, allowing you to convert written text into high-quality audio.

### Accessing the Text-to-Speech Tool

To use the Text-to-Speech feature, look for the corresponding option on the left panel of the Qolaba dashboard. Clicking on it will take you to the dedicated TTS dashboard.

<figure><img src="https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2F2LvR6rJZu0OrnfeshG9P%2FScreenshot%202024-05-03%20183147.png?alt=media&#x26;token=bec1d8dd-8f0c-4881-b53a-b8989fdf4c4f" alt="" width="56"><figcaption></figcaption></figure>

### Entering the Text and Selecting a Voice

On the left side of the TTS dashboard, you'll find a text box where you can enter the content you want to convert to speech. Above the text box, you'll see an option to select the desired voice for the audio output.

<figure><img src="https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FpCcunRI98xEuZr7nir69%2FScreenshot%202024-03-27%20232129.png?alt=media&#x26;token=142d00b0-6648-4d24-81bc-92df0eabc26f" alt=""><figcaption></figcaption></figure>

<figure><img src="https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2F0BtOeezHTSXGKo3KMsVV%2FScreenshot%202024-03-27%20232158.png?alt=media&#x26;token=edff0cd2-54f5-43dc-ae7c-7723d7466a7b" alt=""><figcaption></figcaption></figure>

### Generating the Speech Audio

Once you've entered the text and selected the voice, click the "Generate" button at the bottom of the left panel. Qolaba's TTS model will then convert your written content into an audio file.

### Adjusting the Speech Parameters

On the right side of the TTS dashboard, you'll find four options to fine-tune the generated speech:

<figure><img src="https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FdJfoF8xMf665h7QTq6zW%2FScreenshot%202024-03-27%20232207.png?alt=media&#x26;token=50df45a2-9418-45e1-a9cc-6d82634935b3" alt="" width="186"><figcaption></figcaption></figure>

1. **Stability**: Adjusting the Stability parameter can make the output more or less stable. Higher values result in more consistent speech, while lower values can lead to more variable output.

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2F9GCbzTudOSXsrTKlcDnJ%2FHey%2C%20I%20am%20Rachel%20(8).mp3?alt=media&token=62c4a221-7b70-4420-b15c-df2284c23c74>" %}
Stability value 1
{% endfile %}

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FZ19iUl50tCL1jjhd3o2I%2FHey%2C%20I%20am%20Rachel%20(7).mp3?alt=media&token=e2a7a01d-dd6b-4e6b-92cb-3243643fd447>" %}
Stability value 0
{% endfile %}

2. **Clarity & Similarity Enhancement**: Increasing this parameter will make the generated speech clearer and more similar to the selected voice.

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FBih6EqJHT0kT2ycRGuLI%2Fsb%200.mp3?alt=media&token=4ed84ac0-8d1e-4943-86c4-f71f7885a380>" %}
Clarity & Similarity Enhancement value 0
{% endfile %}

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FIBemw94noGwbmLxfnO43%2Fsb%201.mp3?alt=media&token=1435ea4d-f78e-4a0e-8e54-2240bc4db45c>" %}
Clarity & Similarity Enhancement value 1
{% endfile %}

2. **Style Exaggeration**: Higher values can exaggerate the style of the speech, making it more closely match the chosen voice. However, this may also lead to increased instability.

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FfjJxxjrUvaWGjZ5cIgS2%2FSE%200.mp3?alt=media&token=46dab71c-afc3-4f11-873b-1cc64f9337c0>" %}
Style Exaggeration value 0
{% endfile %}

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FiXr0kN2FTGA8sNcXrECN%2FSE%201.mp3?alt=media&token=dd8dfc6c-9ed8-45a3-8eaf-81a76cb3de8f>" %}
Style Exaggeration value 1
{% endfile %}

2. **Speaker Boost**: This option boosts the similarity of the synthesized speech to the selected voice, but it may slightly reduce the generation speed.

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2F65oBr5hLHAZLFGk2DHqs%2Fhey%2C%20I%20am%20rachel.mp3?alt=media&token=21e6cde1-9b07-44cd-89a8-617fe9f048f4>" %}
With Speaker Boost for Rachel Voice. (The original voice could&#x20;
{% endfile %}

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2FdLml51Nh4FBO3PfSZuM5%2Fhey%2C%20I%20am%20rachel%20(1).mp3?alt=media&token=d4d73f92-764f-4f64-812c-a4b2702931d4>" %}
Without Speaker Boost for Rachel Voice
{% endfile %}

Experiment with these parameters to achieve the desired quality and characteristics for your text-to-speech output.

{% file src="<https://2623503699-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FPYWrOrU7drWieMmcpAgk%2Fuploads%2Ffj9auVDpCnha8948eujh%2Fdf6788f9-5c96-470d-8312-aab3b3d8f50a.mp3?alt=media&token=564d1e6f-d809-49ef-bd29-578ef1a0228e>" %}
Original Rachel voice
{% endfile %}

### Saving and Sharing the Audio

Once you're satisfied with the generated speech, you can save the audio file or share it directly with others using the available options.
