ZhipuGLM 4Arena #100Jan 19, 2026
Z.ai: GLM 4.7 Flash
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
Context Window
203K
tokens
Max Output
—
tokens
Released
Jan 19, 2026
Arena Rank
#100
of 305 models
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Frequency Penalty
Reduce repetition
Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
min_p
Presence Penalty
Encourage new topics
Reasoning
Extended thinking
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $0.06 | $0.40 | $0.01 |
Benchmarks
Arena ELOLMSYS Chatbot Arena
1,305/1,500Model IDs
OpenRouter
z-ai/glm-4.7-flashHugging Facezai-org/GLM-4.7-Flash ↗
Tags
reasoningtool-calling