Close
← Back to Glossary

Text-to-Speech

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

What is text-to-speech used for?

3 of the most common uses for text-to-speech are:

  1. Text-to-speech readers, which turn written digital content, such as web pages and documents, into speech for people who are blind or have low vision.
  2. Text-to-speech narration, which enables users to consume written media, like ebooks and articles, in the form of audio.
  3. Text-to-speech voiceovers, which convert a written script into a video voiceover without the need for microphones.
Text-to-speech function in an AI video maker
Text-to-speech functionality in Synthesia STUDIO

<highlight-start>

Interested in learning how text-to-speech can be used in videos? 💡

Click here to find out. 👇

{{related-post}}

<highlight-end>

<highlight-start>

Or just jump straight into making videos:

{{related-tool}}

<highlight-end>

Related terms

AI Voice

An AI voice (also known as text-to-speech) is a synthetic voice generated by artificial intelligence.

Text-to-Video

Text-to-video is the process of converting information from a text format into a video format, usually done with AI.

Text-to-Speech Avatar

A text-to-speech avatar is a digitally-created, human-like avatar that produces speech using text-to-speech technology.