Overview
The KA Voicemail API detects voicemail greetings in real-time audio using Whisper.cpp transcription and phrase matching. Submit audio chunks during a live call — get back a detection decision in under 2 seconds.
Base URL
Authentication
All endpoints require authentication via one of the following methods.
Send as a header or Bearer token:
Standard HTTP Basic Authentication:
Upload an audio file. Returns real-time voicemail detection result with matched phrases and dead air tracking.
| Parameter | Type | Required | Description |
|---|---|---|---|
| audio | file | yes | WAV audio file to analyze |
| language | string | no | Language code (default: en) |
| decision_window_ms | int | no | Audio duration to analyze in ms (default: 2000) |
| session_id | string | no | Unique call identifier for dead air tracking |
| dead_air_threshold | int | no | Consecutive blank chunks before dead_air=true (default: 3) |
Request
Response
Send audio as a base64-encoded string in a JSON body. Also supports passing pre-transcribed text directly. Includes DNC (Do Not Call) phrase detection.
Request Body
Response
Transcribe audio to text without voicemail detection. Returns full transcript and segment-level detail.
Request
Response
Returns the API health status. Use this to verify the service is running.
Request
Response
Error Codes
| Code | Meaning | Description |
|---|---|---|
| 401 | Unauthorized | Missing or invalid API key / Basic Auth credentials. |
| 400 | Bad Request | Missing required fields (e.g. no audio file or audio_base64). |
| 500 | Internal Server Error | Transcription failed or whisper.cpp encountered an error. |