Documentation
  • How Our System works
  • Text to Image
    • Dalle-3
    • SDXL Models
  • Image Editing
    • Replace Background
    • Inpainting and CleanUp
    • Upscaling
  • Image to Image
    • SDXL Models
    • Face Consistent
    • ControlNet
    • Illusion Diffusion
    • Image Variation
  • ChatBot
  • Knowledge Bases in Chatbot
  • Text To Speech
  • History Section
  • Qreative wall
  • How to Cancel Your Subscription
  • Extra Resources
    • Prompt Advancement
    • Private Session
    • Image Editing Options
    • Resolution
    • Generation Settings
  • API Platform
    • Text to Image
    • Image to Image
    • ControlNet
    • Inpainting
    • Replace Background
    • Face Consistency
    • Face Avatar
    • Image Variation
    • Illusion Diffusion
    • Upscaling
    • Background Removal
    • Text to Speech
    • Task Status
    • ChatBot API
Powered by GitBook
On this page
  • Accessing the Text-to-Speech Tool
  • Entering the Text and Selecting a Voice
  • Generating the Speech Audio
  • Adjusting the Speech Parameters
  • Saving and Sharing the Audio

Was this helpful?

Text To Speech

PreviousKnowledge Bases in ChatbotNextHistory Section

Last updated 1 year ago

Was this helpful?

Qolaba's platform offers a powerful Text-to-Speech (TTS) feature, allowing you to convert written text into high-quality audio.

Accessing the Text-to-Speech Tool

To use the Text-to-Speech feature, look for the corresponding option on the left panel of the Qolaba dashboard. Clicking on it will take you to the dedicated TTS dashboard.

Entering the Text and Selecting a Voice

On the left side of the TTS dashboard, you'll find a text box where you can enter the content you want to convert to speech. Above the text box, you'll see an option to select the desired voice for the audio output.

Generating the Speech Audio

Once you've entered the text and selected the voice, click the "Generate" button at the bottom of the left panel. Qolaba's TTS model will then convert your written content into an audio file.

Adjusting the Speech Parameters

On the right side of the TTS dashboard, you'll find four options to fine-tune the generated speech:

  1. Stability: Adjusting the Stability parameter can make the output more or less stable. Higher values result in more consistent speech, while lower values can lead to more variable output.

  1. Clarity & Similarity Enhancement: Increasing this parameter will make the generated speech clearer and more similar to the selected voice.

  1. Style Exaggeration: Higher values can exaggerate the style of the speech, making it more closely match the chosen voice. However, this may also lead to increased instability.

  1. Speaker Boost: This option boosts the similarity of the synthesized speech to the selected voice, but it may slightly reduce the generation speed.

Experiment with these parameters to achieve the desired quality and characteristics for your text-to-speech output.

Saving and Sharing the Audio

Once you're satisfied with the generated speech, you can save the audio file or share it directly with others using the available options.

20KB
Hey, I am Rachel (8).mp3
Stability value 1
18KB
Hey, I am Rachel (7).mp3
Stability value 0
23KB
sb 0.mp3
Clarity & Similarity Enhancement value 0
20KB
sb 1.mp3
Clarity & Similarity Enhancement value 1
22KB
SE 0.mp3
Style Exaggeration value 0
20KB
SE 1.mp3
Style Exaggeration value 1
21KB
hey, I am rachel.mp3
With Speaker Boost for Rachel Voice. (The original voice could
26KB
hey, I am rachel (1).mp3
Without Speaker Boost for Rachel Voice
14KB
df6788f9-5c96-470d-8312-aab3b3d8f50a.mp3
Original Rachel voice