Prerequisites: API key, audio sample (MP3, WAV, or PCM format)
Clone the Voice
Upload your audio file directly using multipart/form-data. See the Clone Voice endpoint for details.
Check Voice Status
Check the voice status using the Status values:
voice_id from the clone response. See the Get Voice endpoint for details.PENDING- Voice cloning has startedPROCESSING- Voice is being processedAVAILABLE- Voice is ready to useFAILED- Voice cloning failed
List Voices
List all voices owned by the authenticated user. See the List Voices endpoint for details.Request Parameters
file(required) - Audio file (MP3, WAV, or OGG format, max 7.5MB)name(optional) - Name for the voicevoice_visibility(optional) -"PUBLIC"or"PRIVATE"(default:"PUBLIC")language(optional) - Language code in ISO 639-1 format (default:"en"). Supported languages:en- Englishes- Spanishfr- Frenchde- Germanit- Italianpt- Portuguesepl- Polishru- Russiannl- Dutchsv- Swedishca- Catalan
The endpoint accepts
multipart/form-data. Upload the audio file directly - no base64 encoding needed.Update Voice
Update voice metadata (name and/or visibility). See the Update Voice endpoint for details.Delete Voice
Delete a voice (owner-only). See the Delete Voice endpoint for details.Audio Requirements
- Formats: MP3, WAV, or PCM
- Max size: 7.5MB
- Quality: Clear, single speaker audio works best
- Duration: 10-60 seconds recommended