Blog
Synthesia
June 11, 2026

Introducing Next-Gen Pronunciation Controls in Synthesia

Product Manager at Synthesia

Create AI videos with 240+ avatars in 160+ languages

When you’re creating AI video at scale, the right pronunciation isn't a nice-to-have. The second you say something wrong, you break trust with your audience.

For teams using AI video for training, internal comms, marketing, or customer-facing content, pronunciation is both a quality problem and a brand problem. The words that tend to get mispronounced are often the most important ones like company names, industry terms and acronyms.

Today, we're releasing a rebuilt pronunciation system in Synthesia, designed to make getting words right effortless and permanent. We're the first AI video platform where teams can speak a word once and have it sound right, automatically, across every video they create.

Until now, getting pronunciation right in an AI video meant typing out a phonetic approximation of the word and hoping the output matched your intention. It was a guessing game. Teams spent too much time on single words. Some rewrote their scripts entirely to work around the problem while others gave up and published content with the mispronunciation intact.

What’s new
The rebuilt system simplifies pronunciation down to three things.

  • Speak it once. Get it right, every time. Instead of typing phonetic approximations, you can now simply record yourself saying the word. Synthesia handles the rest. 
  • A cleaner, simpler experience. Pronunciation settings are now directly accessible within the editor. The interface is designed to get out of your way so you can focus on getting the word right, not finding the tool.
  • Set it once, and you're done. Save a pronunciation to your Glossary and it applies automatically across every voice and every video your team creates from that point forward. One person sets it and everyone inherits it so you never have to start from scratch. Take a pharmaceutical company producing training videos for a new drug. Once the correct pronunciation is saved to the Glossary, every video that references that drug name will say it right, automatically, every time.

Available now
The new pronunciation controls are now available to all Synthesia customers. You'll find them directly within the video editor, just click any word in your script to access pronunciation settings, record your preferred version, and save it to your Glossary.

For a step-by-step guide to setting up your Glossary and using the speak-to-set feature, visit the Help Center.

Sundar Solai

Sundar Solai is a Product Manager at Synthesia, focused on AI video creation. With a background in computer science and statistics, he previously led YouTube’s autoplay algorithm and writes on AI topics, with work featured in TechCrunch.

Go to author's profile
Video template title
Video template
Create video from template