← Back to Glossary

Text-to-Speech

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

What is text-to-speech used for?

3 of the most common uses for text-to-speech are:

  1. Text-to-speech readers, which turn written digital content, such as web pages and documents, into speech for people who are blind or have low vision.
  2. Text-to-speech narration, which enables users to consume written media, like ebooks and articles, in the form of audio.
  3. Text-to-speech voiceovers, which convert a written script into a video voiceover without the need for microphones.
Text-to-speech function in an AI video maker
Text-to-speech functionality in Synthesia STUDIO

What does TTS stand for?

TTS stands for Text-to-Speech, a type of technology that converts written text into spoken words.

Further reading

How to Make Text-to-Speech Videos in 5 Minutes

Learn how to easily create a professional-looking video with a text-to-speech voiceover, all in one browser window.

6 Best AI Voice Generators (Text-to-Speech)

Looking for the best AI voice generators? We've tested + compared the top 6 text-to-speech tools based on their price, languages, voices, UI, and more →

How to Convert Article to Video in 6 Simple Steps

Converting articles into videos isn't as simple as copy-pasting your article into a video maker. Learn how to create videos better using AI.

Related terms

Text-to-Speech Avatar

A text-to-speech avatar is a digitally-created, human-like avatar that produces speech using text-to-speech technology.

AI-Generated Text

AI generated-text is a type of text that is produced by artificial intelligence.

AI Avatar

An AI avatar is a digital representation of a human in the online space. ‘AI’ indicates that the avatar is powered by artificial intelligence.