How to Make Synthesia Videos: A Step-by-Step Guide

Written by
Kevin Alster
January 22, 2026

Create AI videos with 240+ avatars in 160+ languages.

Synthesia makes it easy to create professional, on-brand videos without cameras, studios, or production delays.

From training and onboarding videos to product demos, internal updates, and short marketing explainers, teams use Synthesia to communicate faster and at scale.

There are 2 main ways to make a Synthesia video:

Option 1: Start from a prompt, file, or webpage

This is the quickest way to create a Synthesia video.

Step 1: Go to Synthesia's AI video generator

Head to Synthesia's AI video generator.

Synthesia's AI video generator

Step 2: Input your prompt or upload a file, URL, or script

You can enter a simple prompt in the Idea tab to get started.

You can also select the File tab and upload PDFs, PowerPoint slides, Word documents, or text files.

The URL tab allows you to convert webpage content into AI video.

Or you can paste a video script into the Script tab.

When you're ready hit Generate.

Choose your input method

Step 3. Sign up to Synthesia for free

Sign up for a free Synthesia account.

Log in to Synthesia

Step 4: Outline your Synthesia video

You’ll now see an overview of your video’s scenes along with a draft script for each one.

From here, you can change templates, adjust settings such as video duration, objective, and language. add, remove, or edit scenes, or recreate the outline entirely.

When you’re ready, click Continue in editor.

Outlining your Synthesia video

Step 5: Edit your Synthesia video

Now it's time to edit your Synthesia video. You can review your scenes, refine the script, and assemble all multimedia elements into a complete video.

Editing your Synthesia video

Choose an AI avatar and voice

You can select from a wide range of AI avatars, AI voices, languages, and accents to match your audience and context.

Selecting an AI avatar

Add supporting visuals and B-roll

Next, add supporting visuals to reinforce each scene.

At this stage, you can include screen recordings, stock images, or short video clips to clarify key points. Choose the visual format based on the type of video you’re creating.

To keep the video visually engaging, add B-roll between scenes or behind an avatar or voiceover to illustrate real-world context and break up longer segments. You can generate B-roll with AI video models, upload your own footage, or select clips from Synthesia’s built-in stock library.

Generating B-roll

Add interactivity

Add interactive elements such as quizzes, branching scenarios, and clickable buttons to keep viewers engaged.

For example, short knowledge checks after each section or role-based branching options allow viewers to explore different scenarios.

Adding a knowledge check

Step 6: Generate your video

When you're ready, click Generate in the top-right corner to create your video.

Generate your Synthesia video

Before generating, run through this checklist. Five minutes of review saves regeneration time and credits.

  • Script review: One clear idea per scene? Strong hook in first 5 seconds? Explicit CTA near the end? Natural conversation flow?
  • Visual consistency: Consistent typography throughout? Sufficient contrast for readability? Cohesive color scheme? Aligned elements across scenes?
  • Timing check: Preview your video using the Play button. Do transitions feel natural? Does pacing match content complexity? Are animations synchronized with narration?
  • Accessibility: Clear fonts at readable sizes? High contrast between text and background? Captions enabled for hearing-impaired viewers?

Step 7: Publish and share your video

The final step is to publish and share your video.

You can download your Synthesia video as an MP4, get a shareable link, embed your video on a webpage, or download a SCORM version of your video and upload it to your LMS.

Option 2: Start from a template

Synthesia offers a wide variety of video templates which you can edit to fit your needs.

Synthesia video templates

Below are some of my favorite templates for a range of video types. You can click 'Edit video template' to start using them.

Training and education

AI training videos work especially well when they follow a familiar classroom-style format. An AI presenter can structure the lesson, explain concepts clearly, and highlight key points, making complex topics easier to follow and more approachable.

This template shows how you can quickly create polished instructional videos without needing a real instructor on camera.

Quizzes and interactive learning

For interactive learning, AI videos can guide learners through questions, choices, and feedback in a conversational way. An AI avatar adds pacing and presence, helping users feel guided through the experience rather than simply clicking through a form.

This template is ideal for creating engaging quizzes, knowledge checks, and branching scenarios.

Product demos and tours

In AI-powered product demos, an on-screen presenter can act as a guide who walks viewers through features and workflows.

The avatar provides context for what’s happening on screen and why it matters, helping you create clear, narrative-driven walkthroughs without recording yourself.

Sales and outreach

AI sales videos make outreach more personal by letting a presenter speak directly to the viewer.

Instead of static slides or long emails, you can use an AI avatar to introduce your offer, explain value, and set the right tone for first-touch or follow-up communication at scale.

Corporate and business presentations

For company updates and presentations, AI talking head videos help structure information and keep attention.

A virtual presenter can introduce topics, transition between sections, and emphasize key messages, making it easy to produce consistent, professional-looking internal or external communications.

Knowledge base and internal comms

AI videos are a great fit for knowledge base and internal communication content.

An avatar can quickly set context, explain processes, and guide viewers through documentation, turning static information into clear, easy-to-follow video explanations.

Recruitment and HR

In recruitment and HR, AI video templates let you introduce your company, roles, and processes in a more human way.

A virtual presenter helps candidates and employees understand who the message is from and what to expect, while keeping production fast and scalable for hiring, onboarding, and policy updates.

Planning a Synthesia video

I think it's a good idea to go through these two planning steps before getting started on your video.

Step 1: Plan your video strategy

Before touching Synthesia, spend 10-15 minutes on strategic planning. This upfront investment saves hours of rework later.

Define your video's purpose and audience

Are you creating compliance training for new hires? A product demo for prospects? An executive update for global teams? Each requires a different approach.

Training videos need clear learning objectives and knowledge checks. Product demos combine avatar narration with screen recordings. Internal communications prioritize consistency and quick updates over complex visuals.

Set success metrics upfront

Track completion rates (aim for 80%+), engagement points, or knowledge retention scores. For marketing videos, measure click-through rates and conversions. For training, track assessment scores before and after viewing.

Determine optimal length

Based on customer data: 45-90 seconds for explainers, 2-4 minutes for tutorials, 5-7 minutes for detailed training. Shorter videos get higher completion rates, but ensure you cover essential information.

Plan for localization needs

If you'll need multiple language versions, structure your content for easy video translation from the start. Use simple, clear language that translates well across cultures. Avoid idioms and culturally specific references.

Step 2: Write a strategic video script

Your script determines 80% of your video's success. I recommend following the FOCA framework: Focus (hook), Outcome (what they'll learn), Content (main message), Action (clear CTA).

📝 FOCA framework for better scripts
  • Focus – Start with a hook to grab attention
  • Outcome – Clearly state what the viewer will learn
  • Content – Deliver your main message concisely
  • Action – End with a clear call-to-action (CTA)

Tip: Aim for 2–4 short sentences per scene, and keep your tone conversational for the best results.

Structure for success

Aim for 2-4 short sentences per scene, with 12-23 scenes total for optimal pacing. Start with a strong hook in your first 5 seconds—pose a question, share a surprising statistic, or address a pain point directly.

Common script mistakes I see

Long lists in narration (use on-screen text for scannable information instead), technical jargon without context, and missing clear CTAs.

Many users find Synthesia's AI-generated scripts helpful as starting points, but always refine them for accuracy and brand tone, especially for technical or financial content.

I recommend writing your script in a conversational tone, as if explaining to a colleague. Read it aloud before importing—if it sounds stiff spoken, it will sound worse with an AI voice.

Scaling your video production

Once you've mastered single video creation, scale your production efficiently:

  • Create reusable templates: After perfecting a video format, save it as a template. Your team can then create consistent videos 10x faster. Set up Brand Kits first to ensure all videos maintain visual consistency.
  • Leverage bulk features: Use template variables for personalized video series. Need 50 onboarding videos with different names? Create one template with variables, then bulk generate. For enterprise needs, explore the API for programmatic video creation.
  • Establish collaboration workflows: Use Synthesia Spaces for team projects. Set up approval workflows so stakeholders review before final generation. Create different workspaces for different departments or video types.
  • Plan for localization: Structure content for easy translation from the start. Synthesia supports 140+ languages—take advantage of this for global reach. Create one master video, then generate versions in multiple languages efficiently.

Measuring success and iterating

Track these metrics to optimize future videos:

  • Completion rates: Aim for 80%+ for training videos, 60%+ for marketing content. If rates drop at specific points, that scene needs revision.
  • Engagement metrics: Monitor where viewers pause, replay, or drop off. Use this data to adjust pacing and content density.
  • Learning outcomes: For training videos, compare pre and post-assessment scores. Strong videos show measurable knowledge improvement.
  • Time to value: Track how quickly you can update videos versus traditional methods. Most users report 75% time savings—use this to justify expanded video programs.

One key advantage of Synthesia: easy content updates. When information changes, update the script and regenerate in 30 minutes rather than reshooting. This agility enables you to keep content current and relevant.

About the author

Strategic Advisor

Kevin Alster

Kevin Alster is a Strategic Advisor at Synthesia, where he helps global enterprises apply generative AI to improve learning, communication, and organizational performance. His work focuses on translating emerging technology into practical business solutions that scale.He brings over a decade of experience in education, learning design, and media innovation, having developed enterprise programs for organizations such as General Assembly, The School of The New York Times, and Sotheby’s Institute of Art. Kevin combines creative thinking with structured problem-solving to help companies build the capabilities they need to adapt and grow.

Go to author's profile
Get started

Make videos with AI avatars in 160+ languages

Get started

Create AI videos with 240+ avatars in 160+ languages.

Try out our AI Video Generator

Create a free AI video
faq

How do I use Synthesia to create a video from start to finish?

Creating a video in Synthesia follows a simple workflow that takes about 35 minutes from concept to shareable video. Start by defining your goal, audience, and call-to-action, then write a 45-60 second script using the FOCA framework (Focus, Outcome, Content, Action). Next, log into Synthesia and choose whether to start from a template, import a file, or create from scratch. Select an AI avatar that matches your content tone, paste your script scene by scene, and add visual elements like text, shapes, or screen recordings to support your message.

Once your content is ready, run a quick quality check for script flow, visual consistency, and timing before clicking 'Generate' in the top right corner. The platform will process your video in 3-10 minutes, after which you can download it as an MP4, share via direct link, or embed it on your website. This streamlined process eliminates the need for cameras, microphones, or video editing skills while delivering professional results that engage your audience.

Can I import a PowerPoint, PDF, or Word file to turn it into a Synthesia video?

Yes, Synthesia's AI video assistant can transform your existing PowerPoint presentations, PDFs, Word documents, and plain text files directly into engaging videos. Simply click 'New video' and select the file import option, then upload your document. The AI assistant automatically parses your content, converts text into natural-sounding narration, and generates on-brand scenes with appropriate pacing, transitions, and relevant visuals based on your content.

After the initial generation, you have full control to fine-tune every aspect of your video. Adjust scripts, timing, voices, avatars, and layout for each scene individually, then regenerate instantly to see your changes. This feature is particularly valuable for teams who already have training materials, presentations, or documentation they want to transform into more engaging video content without starting from scratch.

How do I choose the right AI avatar for my video, and can I create a custom avatar of myself?

Selecting the right avatar depends on your content type and audience expectations. For engaging presentations and marketing content, choose expressive AI avatars that use natural gestures and varied facial expressions to maintain viewer interest. For formal training or compliance videos where authority matters more than entertainment, professional avatars work best. You can also vary avatar framing between waist-up for introductions and chest-up for detailed explanations to add visual variety and emphasize different types of content.

If you need consistent brand representation or want executives to deliver messages personally, you can create custom avatars. Navigate to 'Avatars' then 'Create your own avatar' to choose between a 5-minute web avatar for quick needs or a studio-quality custom avatar for premium results. Custom avatars are particularly valuable for executive communications, brand consistency across video series, or when you need the same spokesperson to deliver regular updates without scheduling repeated recording sessions.

Does Synthesia support multiple languages, and how should I plan for localization?

Synthesia supports over 140 languages with AI voices that automatically match your script language, making it ideal for global teams and international audiences. When planning for localization, structure your content from the start with translation in mind by using simple, clear language that translates well across cultures and avoiding idioms or culturally specific references. The platform automatically detects your script language and suggests appropriate voices, though you should test 2-3 voice options to find the best match for your audience's preferences.

To create multilingual versions efficiently, develop one master video with your primary language, then duplicate it and replace the script with translations. This approach maintains consistent visuals and timing while allowing you to generate versions in multiple languages within minutes rather than hours. Many organizations use this capability to ensure training materials, product updates, and company communications reach their entire global workforce in their preferred language, significantly improving engagement and comprehension.

Can I try Synthesia for free before choosing a plan?

Yes, Synthesia offers a free AI video generator that lets you create and experience the platform before committing to a paid plan. You can access this by clicking 'Create free AI video' on the website, which allows you to test core features like avatar selection, script input, and basic video generation without providing credit card information. This gives you hands-on experience with the interface and helps you understand how Synthesia can meet your specific video creation needs.

The free trial is particularly useful for evaluating video quality, testing different avatars and voices, and understanding the workflow before making a purchase decision. You can create a complete video to share with stakeholders and get buy-in for larger video initiatives. Once you're ready to scale your video production with additional features like custom avatars, advanced templates, and team collaboration tools, you can explore the various pricing plans that match your organization's needs and video volume requirements.

VIDEO TEMPLATE