Close
← Back to Glossary

Text-to-Speech

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

Text-to-speech (TTS) is a type of speech synthesis that converts text into spoken words.

What is text-to-speech?

What is text-to-speech used for?

3 of the most common uses for text-to-speech are:

  1. Text-to-speech readers, which turn written digital content, such as web pages and documents, into speech for people who are blind or have low vision.
  2. Text-to-speech narration, which enables users to consume written media, like ebooks and articles, in the form of audio.
  3. Text-to-speech voiceovers, which convert a written script into a video voiceover without the need for microphones.
Text-to-speech function in an AI video maker
Text-to-speech functionality in Synthesia STUDIO

What does TTS stand for?

TTS stands for Text-to-Speech, a type of technology that converts written text into spoken words.

<highlight-start>

Jump straight into making text-to-speech videos:

{{related-tool}}

<highlight-end>

Further reading

How to Make Text-to-Speech Videos in 5 Minutes

Learn how to easily create a professional-looking video with a text-to-speech voiceover, all in one browser window.

5 Best AI Voice Generators (Text-to-Speech): An In-Depth Review

Looking for the best AI voice generators? We've tested + compared the top 5 text-to-speech tools based on their price, languages, voices, UI, and more →

How to Convert Text to Video in 5 Simple Steps

How do you convert different types of text into engaging videos? Find the answer here.

Related terms

AI Voice

An AI voice (also known as text-to-speech) is a synthetic voice generated by artificial intelligence.

Text-to-Video

Text-to-video is the process of converting information from a text format into a video format, usually done with AI.

Text-to-Speech Avatar

A text-to-speech avatar is a digitally-created, human-like avatar that produces speech using text-to-speech technology.