Command Palette

Search for a command to run...

YepAPI
84 models

AI API

The cheapest LLM API and AI gateway for GPT-4o, Claude, Gemini, DeepSeek, Llama, and Qwen. One API key, OpenAI compatible API endpoint, pay-per-token pricing. No subscriptions.

No credit card required. Takes 30 seconds.

Astronaut hamster planting a flag on the moon

2,400+

Developers

1.2M+

API calls served

100+

Endpoints

$0.01

Per call

Yep, that's it.

Why use a unified AI API?

One API key, 60+ AI models

YepAPI is an AI models API for text, image, and video generation. Access GPT-5.4, Claude, Gemini, DeepSeek, Llama, Qwen — all through one LLM API key. No juggling providers.

OpenAI compatible API endpoint

Our OpenAI compatible API endpoint means zero code changes. Same request format, same response shape. Drop in YepAPI's base URL and switch models by changing one parameter.

AI gateway & LLM router

YepAPI works as an AI gateway, routing requests to 60+ models through a single endpoint. An LLM API router that handles authentication, rate limits, and failover across all providers.

Cheapest AI API pricing

The cheapest LLM API on the market — the cheapest AI API for production workloads. No markup on most models. Access GPT-4o API, DeepSeek API, and Claude API pricing at direct rates.

Start calling AI models in 30 seconds

$5 free credit on signup. No credit card required.

One API call. Any model.

OpenAI-compatible endpoint — same format you already use.

JavaScriptRequest
const res = await fetch("https://api.yepapi.com/v1/ai/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer yep_sk_your_key",
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
    model: "deepseek/deepseek-chat-v3-0324",
    messages: [{ role: "user", content: "Explain quantum computing" }],
  }),
});
const data = await res.json();
console.log(data.choices[0].message.content);
PythonRequest
from openai import OpenAI

client = OpenAI(
    base_url="https://api.yepapi.com/v1/ai",
    api_key="yep_sk_your_key",
)

response = client.chat.completions.create(
    model="deepseek/deepseek-chat-v3-0324",
    messages=[{"role": "user", "content": "Explain quantum computing"}],
)
print(response.choices[0].message.content)

Switch models by changing one parameter. Works with the OpenAI SDK, LangChain, Vercel AI SDK, or raw HTTP.

Text Generation

GPT-4o Mini

OpenAI

Access GPT-4o Mini through one API key. Fast, cheap, and OpenAI-compatible.

128K ctx$0.15/M in

GPT-4o

OpenAI

Access GPT-4o through one API key. Flagship reasoning and multimodal capabilities.

128K ctx$2.50/M in

Claude Sonnet 4

Anthropic

Access Claude Sonnet 4 through one API key. Anthropic's best balance of speed and intelligence.

200K ctx$3.00/M in

Claude Haiku 4

Anthropic

Access Claude Haiku 4 through one API key. Anthropic's fastest model for high-volume tasks.

200K ctx$0.80/M in

Gemini 2.5 Flash

Google

Access Gemini 2.5 Flash through one API key. Google's fastest model with 1M context.

1M ctx$0.15/M in

Gemini 2.5 Pro

Google

Access Gemini 2.5 Pro through one API key. Google's most capable model with 1M context.

1M ctx$2.50/M in

Llama 4 Scout

Meta

Access Llama 4 Scout through one API key. Meta's open-weight model with 512K context.

512K ctx$0.15/M in

DeepSeek V3

DeepSeek

Access DeepSeek V3 through one API key. Frontier-level coding at a fraction of the cost.

128K ctx$0.27/M in

DeepSeek R1

DeepSeek

Access DeepSeek R1 through one API key. Reasoning-specialized model for complex problems.

128K ctx$0.55/M in

GPT-5.4

OpenAI

Access GPT-5.4 through one API key. OpenAI's latest and most capable model.

1.1M ctx$2.50/M in

GPT-5.4 Mini

OpenAI

Access GPT-5.4 Mini through one API key. Smart and affordable with 400K context.

400K ctx$0.75/M in

GPT-5.4 Nano

OpenAI

Access GPT-5.4 Nano through one API key. Ultra-cheap for high-volume tasks.

400K ctx$0.20/M in

Claude Opus 4.7

Anthropic

Access Claude Opus 4.7 through one API key. Anthropic's newest and most intelligent model.

1M ctx$5.00/M in

Claude Opus 4.6

Anthropic

Access Claude Opus 4.6 through one API key. Anthropic's previous-generation flagship.

1M ctx$5.00/M in

Claude Sonnet 4.6

Anthropic

Access Claude Sonnet 4.6 through one API key. Anthropic's latest balanced model with 1M context.

1M ctx$3.00/M in

Grok 4.20

xAI

Access Grok 4.20 through one API key. xAI's flagship model with the largest context window.

2M ctx$2.00/M in

Gemini 3.1 Pro

Google

Access Gemini 3.1 Pro through one API key. Google's next-generation model.

1.0M ctx$2.00/M in

Qwen 3.6 Plus

Qwen

Access Qwen 3.6 Plus through one API key. Powerful reasoning with 1M context at a great price.

1M ctx$0.33/M in

Mistral Small 4

Mistral

Access Mistral Small 4 through one API key. Europe's leading AI model at a great price.

262K ctx$0.15/M in

Devstral 2

Mistral

Access Devstral 2 through one API key. Mistral's coding-specialized model for developers.

262K ctx$0.40/M in

GPT-5.4 Pro

OpenAI

Access GPT-5.4 Pro through one API key. OpenAI's most powerful reasoning model.

1.1M ctx$30.00/M in

Sonar Pro

Perplexity

Access Sonar Pro through one API key. Search-augmented AI with citations built in.

200K ctx$3.00/M in

Sonar

Perplexity

Access Sonar through one API key. Affordable search-augmented AI with citations.

200K ctx$1.00/M in

Qwen 3 Coder

Qwen

Access Qwen 3 Coder through one API key. Coding-specialized model at an ultra-low price.

262K ctx$0.12/M in

Mistral Small Creative

Mistral

Access Mistral Small Creative through one API key. Creative writing at the lowest price.

262K ctx$0.10/M in

Gemini 3 Flash

Google

Access Gemini 3 Flash through one API key. Google's next-generation fast model.

1M ctx$0.50/M in

Grok 4.20 Multi-Agent

xAI

Access Grok 4.20 Multi-Agent through one API key. Multi-agent orchestration with 2M context.

2M ctx$2.00/M in

GPT-5.3 Chat

OpenAI

Access GPT-5.3 Chat through one API key. Reliable previous-generation OpenAI model.

128K ctx$1.75/M in

GPT-5.3 Codex

OpenAI

Access GPT-5.3 Codex through one API key. Previous-gen coding model with massive output.

400K ctx$1.75/M in

GPT-5.2 Pro

OpenAI

Access GPT-5.2 Pro through one API key. Previous-gen premium reasoning model.

400K ctx$21.00/M in

GPT-5.2

OpenAI

Access GPT-5.2 through one API key. Previous-generation flagship model.

400K ctx$1.75/M in

GPT-5.2 Chat

OpenAI

Access GPT-5.2 Chat through one API key. Reliable chat model.

128K ctx$1.75/M in

GPT-5.2 Codex

OpenAI

Access GPT-5.2 Codex through one API key. Reliable coding model with massive context.

400K ctx$1.75/M in

GPT-5.1 Codex Max

OpenAI

Access GPT-5.1 Codex Max through one API key. Max-output coding model.

400K ctx$1.25/M in

GPT Audio

OpenAI

Access GPT Audio through one API key. Audio-capable multimodal model.

128K ctx$2.50/M in

GPT Audio Mini

OpenAI

Access GPT Audio Mini through one API key. Affordable audio-capable model.

128K ctx$0.60/M in

Claude Opus 4.6 Fast

Anthropic

Access Claude Opus 4.6 Fast through one API key. Fastest Opus with 128K output.

1M ctx$30.00/M in

Gemini 3.1 Flash Lite

Google

Access Gemini 3.1 Flash Lite through one API key. Ultra-affordable with 1M context.

1.0M ctx$0.25/M in

Gemma 4 31B

Google

Access Gemma 4 31B through one API key. Google's open-weight model at ultra-low cost.

262K ctx$0.14/M in

Gemma 4 26B

Google

Access Gemma 4 26B through one API key. Efficient MoE model at the lowest price.

262K ctx$0.13/M in

Qwen 3.5 Plus

Qwen

Access Qwen 3.5 Plus through one API key. Strong model with 1M context.

1M ctx$0.26/M in

Qwen 3.5 397B

Qwen

Access Qwen 3.5 397B through one API key. Alibaba's largest MoE model.

262K ctx$0.39/M in

Qwen 3.5 122B

Qwen

Access Qwen 3.5 122B through one API key. Efficient MoE model.

262K ctx$0.26/M in

Qwen 3.5 35B

Qwen

Access Qwen 3.5 35B through one API key. Ultra-fast MoE model.

262K ctx$0.16/M in

Qwen 3.5 27B

Qwen

Access Qwen 3.5 27B through one API key. Dense model for consistent performance.

262K ctx$0.20/M in

Qwen 3.5 9B

Qwen

Access Qwen 3.5 9B through one API key. Cheapest model for high-volume tasks.

256K ctx$0.05/M in

Qwen 3.5 Flash

Qwen

Access Qwen 3.5 Flash through one API key. Ultra-fast with 1M context.

1M ctx$0.07/M in

Qwen 3 Max Thinking

Qwen

Access Qwen 3 Max Thinking through one API key. Reasoning model for complex problems.

262K ctx$0.78/M in

Llama 4 Maverick

Meta

Access Llama 4 Maverick through one API key. Meta's most capable open-weight model.

1.0M ctx$0.15/M in

MiniMax M2.7

MiniMax

Access MiniMax M2.7 through one API key. Bilingual flagship model with massive output.

205K ctx$0.30/M in

MiniMax M2.5

MiniMax

Access MiniMax M2.5 through one API key. Affordable bilingual model.

197K ctx$0.12/M in

GLM 5.1

Z.ai

Access GLM 5.1 through one API key. Zhipu AI's latest bilingual model.

203K ctx$1.26/M in

GLM 5

Z.ai

Access GLM 5 through one API key. Balanced bilingual model from Zhipu AI.

80K ctx$0.72/M in

GLM 5 Turbo

Z.ai

Access GLM 5 Turbo through one API key. Fast bilingual model from Zhipu AI.

203K ctx$1.20/M in

Seed 2.0 Lite

ByteDance

Access Seed 2.0 Lite through one API key. ByteDance's latest bilingual model.

262K ctx$0.25/M in

Seed 2.0 Mini

ByteDance

Access Seed 2.0 Mini through one API key. Ultra-cheap from ByteDance.

262K ctx$0.10/M in

Seed 1.6

ByteDance

Access Seed 1.6 through one API key. Reliable model from ByteDance.

262K ctx$0.25/M in

Seed 1.6 Flash

ByteDance

Access Seed 1.6 Flash through one API key. Cheapest model from ByteDance.

262K ctx$0.07/M in

MiMo V2 Pro

Xiaomi

Access MiMo V2 Pro through one API key. Xiaomi's flagship AI model.

1.0M ctx$1.00/M in

MiMo V2 Omni

Xiaomi

Access MiMo V2 Omni through one API key. Xiaomi's multimodal model.

262K ctx$0.40/M in

MiMo V2 Flash

Xiaomi

Access MiMo V2 Flash through one API key. Cheapest Xiaomi model.

262K ctx$0.09/M in

DeepSeek V3.2

DeepSeek

Access DeepSeek V3.2 through one API key. Latest DeepSeek model.

164K ctx$0.26/M in

Claude Sonnet 4.5

Anthropic

Access Claude Sonnet 4.5 through one API key. Previous-gen Anthropic flagship.

1M ctx$3.00/M in

Grok 4.1 Fast

xAI

Access Grok 4.1 Fast through one API key. Fast xAI model with 2M context.

2M ctx$0.20/M in

Gemini 2.5 Flash Lite

Google

Access Gemini 2.5 Flash Lite through one API key. Google's cheapest model.

1.0M ctx$0.10/M in

GPT-OSS 120B

OpenAI

Access GPT-OSS 120B through one API key. OpenAI's open-source model.

131K ctx$0.04/M in

Step 3.5 Flash

StepFun

Access Step 3.5 Flash through one API key. StepFun's fast model.

262K ctx$0.10/M in

Kimi K2.5

Moonshot AI

Access Kimi K2.5 through one API key. Moonshot AI's flagship model.

262K ctx$0.38/M in

Nemotron 3 Super

NVIDIA

Access Nemotron 3 Super through one API key. NVIDIA's free model.

262K ctx$0.00/M in

DeepSeek V4 Flash

DeepSeek

Access DeepSeek V4 Flash through one API key. 1M context, MoE speed, budget pricing.

1.0M ctx$0.20/M in

DeepSeek V4 Pro

DeepSeek

Access DeepSeek V4 Pro through one API key. Frontier reasoning at a fraction of flagship pricing.

1.0M ctx$0.61/M in

Image Generation

Video Generation

Music Generation

Image Analysis

Video Analysis

The LLM API that gives you 60+ models through one endpoint

YepAPI's LLM API is a unified AI gateway and AI models API that gives you access to 60+ AI models through a single OpenAI compatible API endpoint. Use the GPT-4o API, DeepSeek API, Claude API, Gemini API, Llama API, and Qwen API — all through one LLM API endpoint. Send the same request format you use with OpenAI and switch models by changing one parameter.

Looking for the cheapest AI API? YepAPI's LLM API has no markup on most models — you pay the same per-token rate as going direct. The DeepSeek API starts at $0.14/M input tokens. The GPT-4o API costs $2.50/M input. Claude API pricing starts at $0.80/M for Haiku. No monthly minimums makes this the cheap LLM API developers want for pay-as-you-go pricing. The cheapest AI API with the broadest model coverage.

YepAPI works as an AI gateway and LLM router — one key, one bill, one dashboard. Building an AI agent, chatbot, or content tool? This LLM API works with any framework — LangChain, LlamaIndex, Vercel AI SDK, or raw HTTP calls. Plus, your same API key covers web scraping, SEO data, SERP results, and YouTube APIs. Compare OpenRouter pricing to YepAPI — same models, better value.

OpenRouter alternative — better pricing, more APIs

Compare OpenRouter pricing to YepAPI: same LLM API access, but YepAPI bundles web scraping, SEO, SERP, and YouTube APIs under one key. The cheapest AI API with no monthly minimums.

Provider
Models
Catch
OpenRouter
Limited
Limited to AI models only
OpenAI direct
Limited
Only OpenAI models
Anthropic direct
Limited
Only Claude models
Together AI
Limited
Open-source models only
YepAPI
60+ models
No minimums

AI API — Frequently asked questions

An AI API gives you programmatic access to large language models (LLMs) like GPT-5.4, Claude, Gemini, and DeepSeek. You send a prompt via HTTP request and get back generated text, code, or structured data. YepAPI's AI API is a unified endpoint — one API key accesses 60+ models from OpenAI, Anthropic, Google, Meta, and more.

YepAPI gives you $5 free credit on signup — no credit card required. That's enough for thousands of API calls on cheaper models like DeepSeek, Llama, or Qwen. There's no free tier with hard limits — you get real credits to use on any model, then pay as you go.

An LLM gateway is a unified API layer that sits between your app and multiple AI model providers. Instead of integrating with OpenAI, Anthropic, Google, and Meta separately, you connect to one gateway. YepAPI acts as an LLM gateway and router — one endpoint, one key, one billing dashboard. Switch between models by changing a single parameter.

Through YepAPI, DeepSeek V3.2 costs $0.14/M input tokens and $0.28/M output tokens. DeepSeek R1 (reasoning model) costs $0.55/M input and $2.19/M output. No monthly minimums — pay per token, starting from your $5 free credit.

An OpenAI-compatible API uses the same request and response format as OpenAI's API. Any code written for the OpenAI SDK works with YepAPI — just change the base URL and API key. This means you can access Claude, Gemini, DeepSeek, and Llama models using the same OpenAI client libraries you already use.

YepAPI serves a similar role to OpenRouter — both are LLM gateways that give you access to multiple AI models through one API. The differences: YepAPI bundles web scraping, SEO, SERP, and YouTube APIs under the same key, uses straightforward pay-per-token pricing, and gives you $5 free credit to start. If you want an OpenRouter alternative with broader API coverage, YepAPI is it.

Claude API pricing through YepAPI uses per-token rates with no markup. Claude Sonnet 4.6 costs $3/M input and $15/M output. Claude Haiku 4.5 costs $0.80/M input and $4/M output. Claude Opus 4.6 costs $15/M input and $75/M output. No monthly minimums — pay only for the tokens you use. The cheapest way to access Claude API without an Anthropic enterprise contract.

The GPT-4o API through YepAPI costs $2.50/M input tokens and $10/M output tokens — the same rate as OpenAI direct. GPT-4o mini costs $0.15/M input and $0.60/M output. No monthly subscriptions or minimums. Your $5 free credit covers thousands of GPT-4o API calls. Switch between GPT-4o and any other model by changing one parameter.

YepAPI is the cheapest LLM API for accessing multiple AI models. DeepSeek API starts at $0.14/M tokens. Llama models start at $0.03/M tokens. Qwen models start at $0.05/M tokens. No markup on most models, no monthly minimums, and $5 free credit on signup. Compare that to OpenRouter pricing or going direct to each provider — YepAPI is the cheapest AI API with the broadest model coverage.

OpenRouter pricing and YepAPI pricing are both per-token with no monthly minimums. The key differences: YepAPI bundles web scraping, SEO, SERP, and YouTube APIs under the same key — OpenRouter is LLM-only. YepAPI gives you $5 free credit on signup. Both offer the same models (GPT-4o API, DeepSeek API, Claude API, Gemini), but YepAPI is a more complete AI gateway for developers building data-driven apps.

DeepSeek API pricing through YepAPI: DeepSeek V3.2 costs $0.14/M input and $0.28/M output tokens. DeepSeek R1 (reasoning model) costs $0.55/M input and $2.19/M output. The DeepSeek API is one of the cheapest LLM APIs available — $5 free credit covers tens of thousands of calls. No monthly minimums, no subscriptions.

Ready to build with AI?

All 84 models included with one API key. $5 free credit on signup. No credit card required.