Single voice stream per WebSocket connection. Voice settings persist for the session. First message is an init message (sets voice, model, language). Subsequent messages are text-only.
Authentication Required: Include your API key in the Authorization header when establishing the WebSocket connection: Authorization: Bearer YOUR_API_KEY.