Prerequisites: API key, audio sample (MP3, WAV, or PCM format)
1
Convert Audio to Base64
Encode your audio file to base64 format:
2
Clone the Voice
Send the base64-encoded audio to the API to create your custom voice:
3
Check Voice Status
Check the voice status using the Status values:
voice_id from the clone response:PENDING- Voice cloning has startedPROCESSING- Voice is being processedAVAILABLE- Voice is ready to useFAILED- Voice cloning failed
List Voices
List all voices owned by the authenticated user:Request Parameters
base64_audio(required) - Base64-encoded audio file (max 7.5MB)name(optional) - Name for the voicevoice_visibility(optional) -"PUBLIC"or"PRIVATE"(default:"PUBLIC")
Update Voice
Update voice metadata (name and/or visibility):Delete Voice
Delete a voice (owner-only):Audio Requirements
- Formats: MP3, WAV, or PCM
- Max size: 7.5MB
- Quality: Clear, single speaker audio works best
- Duration: 10-60 seconds recommended