xAI/v1/ai/x-ai/grok-4.1-fast

Grok 4.1 Fast

Access Grok 4.1 Fast through one API key. Fast xAI model with 2M context.

xAI's fast inference model with massive 2M context window. Budget-friendly with strong performance.

Full Docs Get API Key

Context Window

2M tokens

Max Output

30K tokens

Input Price

$0.20 / 1M tokens

Output Price

$0.50 / 1M tokens

Try it live

Send a message and see Grok 4.1 Fast respond in real time.

POST /v1/ai/x-ai/grok-4.1-fastLive testing

API Key

Message *

Max Tokens

Maximum tokens in the response.

Stream

Real-time token streaming

Hit "Run" to see the response

Strengths

✓Speed

✓Large context

✓Cost-effective

✓Real-time

Quick start

Copy this snippet and start making calls with Grok 4.1 Fast.

const res = await fetch('https://api.yepapi.com/v1/ai/x-ai/grok-4.1-fast', {
  method: 'POST',
  headers: {
    'x-api-key': 'YOUR_API_KEY',
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    "messages": [
      {
        "role": "user",
        "content": "Explain API gateways in 2 sentences."
      }
    ],
    "maxTokens": 256
  }),
});
const data = await res.json();
console.log(data);

Why use Grok 4.1 Fast through YepAPI?

✓One API key for all models — no separate accounts

✓OpenAI SDK compatible — just change the base URL

✓No monthly minimums — pay per token

✓Switch models with one line of code

✓Full provider passthrough — citations, search results, and all extras included

✓Streaming and non-streaming support on every model

✓Works with Cursor, Claude, LangChain, and any LLM tool

✓Unified billing across all providers

Frequently asked questions

xAI's fast inference model with massive 2M context window. Budget-friendly with strong performance.

Input tokens cost $0.20 per 1M tokens and output tokens cost $0.50 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.

Sign up for a free API key, then send requests to the /v1/ai/x-ai/grok-4.1-fast endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.

Grok 4.1 Fast supports a 2M token context window with up to 30K output tokens per request.