Vozo Help Center

This feature is available to Studio and higher-tier members.

What Is a Variant

A variant is a new output generated from the same Talking Photo project using a different audio.
It allows you to produce multiple versions while keeping the same trained model.

Why Create a Variant Instead of a New Project

Creating a variant reuses the model that was already trained in your project.
Compared with starting a new project:

⚡ Much faster — usually completes within minutes instead of tens of minutes or hours.
🎯 More consistent — keeps the same facial expressions and movements for a stable visual quality.
💰 Cost-saving — only 5 points per minute are charged, with no additional 5 base points.

How to Create a Variant

Open Your Talking Photo Project

Go to your Talking Photo project and click Create with New Audio in the top-right corner.

Upload or Input New Audio

Upload a new audio file, or enter text to generate speech via Text to Speech.

Generate

Click Generate and wait for the process to complete.

View Variants

All variants are stored under the same project.
Click the Variant option in the top-right corner to open the dropdown and switch between variants, preview, or download.

Last modified on May 21, 2026

Get Started Get Started

⌘I

Getting Started

Translate & Dub

Translate Subtitle

Visual Translate

Lip Sync

Talking Photo

Voice Studio

Long to Shorts

Labs

Create Variant with New Audio

What Is a Variant

Why Create a Variant Instead of a New Project

How to Create a Variant

​What Is a Variant

​Why Create a Variant Instead of a New Project

​How to Create a Variant

What Is a Variant

Why Create a Variant Instead of a New Project

How to Create a Variant