GLM 5
Access GLM 5 through one API key. Balanced bilingual model from Zhipu AI.
Zhipu AI's balanced model. Good performance across Chinese and English tasks at a lower price.
Context Window
80K tokens
Max Output
131K tokens
Input Price
$0.72 / 1M tokens
Output Price
$2.30 / 1M tokens
Try it live
Send a message and see GLM 5 respond in real time.
Maximum tokens in the response.
Real-time token streaming
Strengths
Quick start
Copy this snippet and start making calls with GLM 5.
const res = await fetch('https://api.yepapi.com/v1/ai/z-ai/glm-5', {
method: 'POST',
headers: {
'x-api-key': 'YOUR_API_KEY',
'Content-Type': 'application/json',
},
body: JSON.stringify({
"messages": [
{
"role": "user",
"content": "Explain API gateways in 2 sentences."
}
],
"maxTokens": 256
}),
});
const data = await res.json();
console.log(data);Why use GLM 5 through YepAPI?
Frequently asked questions
Zhipu AI's balanced model. Good performance across Chinese and English tasks at a lower price.
Input tokens cost $0.72 per 1M tokens and output tokens cost $2.30 per 1M tokens through YepAPI. No monthly minimums — you only pay for what you use.
Sign up for a free API key, then send requests to the /v1/ai/z-ai/glm-5 endpoint. YepAPI is OpenAI SDK compatible, so you can use it with any tool that supports OpenAI — just change the base URL.
GLM 5 supports a 80K token context window with up to 131K output tokens per request.
Ready to use GLM 5?
Get your API key and start making calls in 30 seconds. No credit card required.
Explore more models
GLM 5.1
Z.aiAccess GLM 5.1 through one API key. Zhipu AI's latest bilingual model.
GLM 5 Turbo
Z.aiAccess GLM 5 Turbo through one API key. Fast bilingual model from Zhipu AI.
GPT-4o Mini
OpenAIAccess GPT-4o Mini through one API key. Fast, cheap, and OpenAI-compatible.