# Voice Cloning
Create custom AI voices trained on your own audio samples for a branded, personalized voice agent experience.
---
Create a custom AI voice that sounds like you (or anyone you have permission to clone). Voice cloning uses ElevenLabs Instant Voice Cloning to synthesize voices from short audio samples -- no training phase, ready in seconds.
> **Warning:** Voice cloning requires a Studio plan or higher.
## How It Works
1. Record or upload a short audio sample (1-2 minutes)
2. ElevenLabs analyzes vocal characteristics (pitch, timbre, prosody, accent)
3. A custom voice is generated instantly
4. Select it as your voice agent's voice
Cloned voices are shared across all apps in your organization.
## Creating a Custom Voice
**1.**
Open Voice Settings
Go to your app's **Build** page and open the **Voice** card. Voice mode must be enabled.
**2.**
Add Custom Voice
Find the **Custom Voices** section and click **Add Custom Voice**.
**3.**
Record or Upload Audio
Choose one method:
**Browser recording** (recommended):
- Click Record and speak naturally for 1-2 minutes
- Re-record if needed
**File upload**:
- Upload a pre-recorded audio file
- Supported formats: WAV, MP3, M4A, OGG, WebM, FLAC
- Maximum file size: 50MB
**4.**
Name and Create
Give your voice a memorable name (e.g., "CEO Rachel" or "Support Persona"). Click **Create Voice**. The voice is available immediately.
**5.**
Select the Voice
In the voice selection dropdown, your custom voices appear at the top. Select it to use with your voice agent.
## Recording Tips for Best Quality
The quality of your clone depends entirely on the quality of your audio sample.
### Duration
- **Optimal:** 1-2 minutes
- **Too short** (<30 seconds): May lack vocal variety
- **Too long** (>5 minutes): Can introduce instability
- The AI captures voice characteristics best from concise, focused samples
### Environment
- Record in a quiet room with soft furnishings (curtains, carpets reduce echo)
- Turn off fans, air conditioning, and notifications
- Close windows to block outside noise
- The AI replicates **everything** it hears -- background noise becomes part of the voice
### Microphone Technique
- Position the microphone about 20cm away (two fists distance)
- Speak slightly off-axis to reduce plosive sounds (hard P's and B's)
- Use a pop filter if available
- Avoid breathing directly into the mic
### Audio Quality
- Peak levels: -6dB to -3dB (loud parts don't clip)
- Avoid clipping/distortion at all costs -- the AI can't recover from it
- Standard sample rate (44.1kHz or 48kHz) works well
### Delivery
- Maintain consistent tone and energy throughout
- Don't switch between animated and subdued delivery
- Read with natural pacing -- not robotic, not theatrical
- Use your own writing or scripts for natural rhythm
## Managing Custom Voices
- **Delete a voice:** Remove it from your organization's voice library at any time. This also removes it from ElevenLabs.
- **Multiple voices:** Create as many custom voices as your plan allows
- **Cross-app usage:** All apps in your organization can use any custom voice
- **Audio privacy:** Your recording is stored encrypted and never shared publicly
## Troubleshooting
**Voice doesn't sound right?**
- Re-record with better audio quality (less background noise, no clipping)
- Try a longer sample (aim for 1-2 minutes of natural speech)
- Ensure consistent tone throughout the recording
**Voice not appearing in selection?**
- Check that the creation completed successfully
- Refresh the voice settings page
- Verify you're on a Studio plan or higher
**Upload failing?**
- Check file size (max 50MB)
- Verify the file format (WAV, MP3, M4A, OGG, WebM, FLAC)
- Try a different format or re-export the audio
> **Note:** For more about voice agent configuration, see the [Voice Agents guide](/docs/integrations/voice-agents).