Models/zhipu/GLM-4 Flash
Z

zhipu

GLM-4 Flash

glm-4-flash131K contextFree / Free per Mtool calling

Context Window

131K

Input Price / M

Free

Output Price / M

Free

Max Completion

Input Modalities

text

Output Modalities

text

About This Model

Zhipu GLM-4 ultra-fast edition, free to use

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

Provider
Context Length
Input Price
Output Price
Cache Read
Priority

Z.AI (Zhipu)

glm-4-flash
131K
Free / M
Free / M
1

Status Snapshot

Quick route status for this model. Open the full Status page to compare it against the wider market and current incident patterns.

Open Status

Effective Pricing

Input

Free

per 1M input tokens

Output

Free

per 1M output tokens

Cached Read

Free

if prompt caching applies

Best For

免费体验快速对话
ChatToolsStreaming

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

Exact request field

"model": "glm-4-flash"
curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-4-flash",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Supported Parameters

temperaturetop_pmax_tokenstools

API Key Setup

Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to glm-4-flash.

Forced Model

Always override incoming requests and lock the key to glm-4-flash.