GPT Audio
Access GPT Audio through one API key. Audio-capable multimodal model.
OpenAI's audio-capable model. Processes audio input and generates text or audio output for multimodal applications.
Context Window
128K tokens
Max Output
16K tokens
Input Price
$2.50 / 1M tokens
Output Price
$10.00 / 1M tokens
Try it live
Send a message and see GPT Audio respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with GPT Audio.
const res = await fetch('https://api.yepapi.com/v1/ai/openai/gpt-audio', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use GPT Audio through YepAPI?
Frequently asked questions
OpenAI's audio-capable model. Processes audio input and generates text or audio output for multimodal applications.
Input tokens cost $2.50 per 1M tokens and output tokens cost $10.00 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/openai/gpt-audio endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
GPT Audio supports a 128K token context window with up to 16K output tokens per request.
Ready to use GPT Audio?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
GPT-4o Mini
OpenAIAccess GPT-4o Mini through one API key. Fast, cheap, and OpenAI-compatible.
GPT-4o
OpenAIAccess GPT-4o through one API key. Flagship reasoning and multimodal capabilities.
GPT-5.4
OpenAIAccess GPT-5.4 through one API key. OpenAI's latest and most capable model.