Gemma 4 26B
Access Gemma 4 26B through one API key. Efficient MoE model at the lowest price.
Google's efficient open-weight MoE model. 26B total parameters with 4B active for fast inference.
Context Window
262K tokens
Max Output
262K tokens
Input Price
$0.13 / 1M tokens
Output Price
$0.40 / 1M tokens
Try it live
Send a message and see Gemma 4 26B respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with Gemma 4 26B.
const res = await fetch('https://api.yepapi.com/v1/ai/google/gemma-4-26b-a4b-it', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use Gemma 4 26B through YepAPI?
Frequently asked questions
Google's efficient open-weight MoE model. 26B total parameters with 4B active for fast inference.
Input tokens cost $0.13 per 1M tokens and output tokens cost $0.40 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/google/gemma-4-26b-a4b-it endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
Gemma 4 26B supports a 262K token context window with up to 262K output tokens per request.
Ready to use Gemma 4 26B?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
Gemini 2.5 Flash
GoogleAccess Gemini 2.5 Flash through one API key. Google's fastest model with 1M context.
Gemini 2.5 Pro
GoogleAccess Gemini 2.5 Pro through one API key. Google's most capable model with 1M context.
Gemini 3.1 Pro
GoogleAccess Gemini 3.1 Pro through one API key. Google's next-generation model.