Command Palette

Search for a command to run...

YepAPI
Google/v1/ai/google/gemma-4-31b-it

Gemma 4 31B

Access Gemma 4 31B through one API key. Google's open-weight model at ultra-low cost.

Google's open-weight 31B model. Strong performance at ultra-low cost with 131K output tokens.

Context Window

262K tokens

Max Output

131K tokens

Input Price

$0.14 / 1M tokens

Output Price

$0.40 / 1M tokens

Try it live

Send a message and see Gemma 4 31B respond in real time.

POST /v1/ai/google/gemma-4-31b-itLive testing
Loading...

Maximum tokens in the response.

Real-time token streaming

Hit "Run" to see the response

Strengths

Open-weight
Ultra low cost
131K output
262K context

Quick start

Copy this snippet and start making calls with Gemma 4 31B.

const res = await fetch('https://api.yepapi.com/v1/ai/google/gemma-4-31b-it', {
  method: 'POST',
  headers: {
    'x-api-key': 'YOUR_API_KEY',
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    "messages": [
      {
        "role": "user",
        "content": "Explain API gateways in 2 sentences."
      }
    ],
    "maxTokens": 256
  }),
});
const data = await res.json();
console.log(data);

Why use Gemma 4 31B through YepAPI?

One API key for all models — no separate accounts
OpenAI SDK compatible — just change the base URL
No monthly minimums — pay per token
Switch models with one line of code
Full provider passthrough — citations, search results, and all extras included
Streaming and non-streaming support on every model
Works with Cursor, Claude, LangChain, and any LLM tool
Unified billing across all providers

Frequently asked questions

Google's open-weight 31B model. Strong performance at ultra-low cost with 131K output tokens.

Input tokens cost $0.14 per 1M tokens and output tokens cost $0.40 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.

Sign up for a free API key, then send requests to the /v1/ai/google/gemma-4-31b-it endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.

Gemma 4 31B supports a 262K token context window with up to 131K output tokens per request.

Ready to use Gemma 4 31B?

Get your API key and start making calls in 30 seconds. No credit card required.

Explore more models