Llama 4 Scout
Access Llama 4 Scout through one API key. Meta's open-weight model with 512K context.
Meta's latest open-weight model. Strong performance on reasoning, coding, and instruction-following with competitive pricing.
Context Window
512K tokens
Max Output
8K tokens
Input Price
$0.15 / 1M tokens
Output Price
$0.60 / 1M tokens
Try it live
Send a message and see Llama 4 Scout respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with Llama 4 Scout.
const res = await fetch('https://api.yepapi.com/v1/ai/meta-llama/llama-4-scout', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use Llama 4 Scout through YepAPI?
Frequently asked questions
Meta's latest open-weight model. Strong performance on reasoning, coding, and instruction-following with competitive pricing.
Input tokens cost $0.15 per 1M tokens and output tokens cost $0.60 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/meta-llama/llama-4-scout endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
Llama 4 Scout supports a 512K token context window with up to 8K output tokens per request.
Ready to use Llama 4 Scout?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
Llama 4 Maverick
MetaAccess Llama 4 Maverick through one API key. Meta's most capable open-weight model.
GPT-4o Mini
OpenAIAccess GPT-4o Mini through one API key. Fast, cheap, and OpenAI-compatible.
GPT-4o
OpenAIAccess GPT-4o through one API key. Flagship reasoning and multimodal capabilities.