Qwen/v1/ai/qwen/qwen3.5-9b

Qwen 3.5 9B

Access Qwen 3.5 9B through one API key. Cheapest model for high-volume tasks.

Alibaba's smallest and cheapest model. Ideal for high-volume classification, extraction, and simple tasks.

Full Docs Get API Key

Context Window

256K tokens

Max Output

33K tokens

Input Price

$0.05 / 1M tokens

Output Price

$0.15 / 1M tokens

Try it live

Send a message and see Qwen 3.5 9B respond in real time.

POST /v1/ai/qwen/qwen3.5-9bLive testing

API Key

Message *

Max Tokens

Maximum tokens in the response.

Stream

Real-time token streaming

Hit "Run" to see the response

Strengths

✓Cheapest Qwen

✓Ultra low latency

✓High throughput

✓256K context

Quick start

Copy this snippet and start making calls with Qwen 3.5 9B.

const res = await fetch('https://api.yepapi.com/v1/ai/qwen/qwen3.5-9b', {
  method: 'POST',
  headers: {
    'x-api-key': 'YOUR_API_KEY',
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    "messages": [
      {
        "role": "user",
        "content": "Explain API gateways in 2 sentences."
      }
    ],
    "maxTokens": 256
  }),
});
const data = await res.json();
console.log(data);

Why use Qwen 3.5 9B through YepAPI?

✓One API key for all models — no separate accounts

✓OpenAI SDK compatible — just change the base URL

✓No monthly minimums — pay per token

✓Switch models with one line of code

✓Full provider passthrough — citations, search results, and all extras included

✓Streaming and non-streaming support on every model

✓Works with Cursor, Claude, LangChain, and any LLM tool

✓Unified billing across all providers

Frequently asked questions

Alibaba's smallest and cheapest model. Ideal for high-volume classification, extraction, and simple tasks.

Input tokens cost $0.05 per 1M tokens and output tokens cost $0.15 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.

Sign up for a free API key, then send requests to the /v1/ai/qwen/qwen3.5-9b endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.

Qwen 3.5 9B supports a 256K token context window with up to 33K output tokens per request.