Llama 4 Maverick
Access Llama 4 Maverick through one API key. Meta's most capable open-weight model.
Meta's larger open-weight model. More capable than Scout with a massive 1M context window.
Context Window
1.0M tokens
Max Output
16K tokens
Input Price
$0.15 / 1M tokens
Output Price
$0.60 / 1M tokens
Try it live
Send a message and see Llama 4 Maverick respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with Llama 4 Maverick.
const res = await fetch('https://api.yepapi.com/v1/ai/meta-llama/llama-4-maverick', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use Llama 4 Maverick through YepAPI?
Frequently asked questions
Meta's larger open-weight model. More capable than Scout with a massive 1M context window.
Input tokens cost $0.15 per 1M tokens and output tokens cost $0.60 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/meta-llama/llama-4-maverick endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
Llama 4 Maverick supports a 1.0M token context window with up to 16K output tokens per request.
Ready to use Llama 4 Maverick?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
Llama 4 Scout
MetaAccess Llama 4 Scout through one API key. Meta's open-weight model with 512K context.
GPT-4o Mini
OpenAIAccess GPT-4o Mini through one API key. Fast, cheap, and OpenAI-compatible.
GPT-4o
OpenAIAccess GPT-4o through one API key. Flagship reasoning and multimodal capabilities.