Gemma 4 31B
Access Gemma 4 31B through one API key. Google's open-weight model at ultra-low cost.
Google's open-weight 31B model. Strong performance at ultra-low cost with 131K output tokens.
Context Window
262K tokens
Max Output
131K tokens
Input Price
$0.14 / 1M tokens
Output Price
$0.40 / 1M tokens
Try it live
Send a message and see Gemma 4 31B respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with Gemma 4 31B.
const res = await fetch('https://api.yepapi.com/v1/ai/google/gemma-4-31b-it', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use Gemma 4 31B through YepAPI?
Frequently asked questions
Google's open-weight 31B model. Strong performance at ultra-low cost with 131K output tokens.
Input tokens cost $0.14 per 1M tokens and output tokens cost $0.40 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/google/gemma-4-31b-it endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
Gemma 4 31B supports a 262K token context window with up to 131K output tokens per request.
Ready to use Gemma 4 31B?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
Gemini 2.5 Flash
GoogleAccess Gemini 2.5 Flash through one API key. Google's fastest model with 1M context.
Gemini 2.5 Pro
GoogleAccess Gemini 2.5 Pro through one API key. Google's most capable model with 1M context.
Gemini 3.1 Pro
GoogleAccess Gemini 3.1 Pro through one API key. Google's next-generation model.