ZhipuGLM 4Arena #100Jan 19, 2026

Z.ai: GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Context Window
203K
tokens
Max Output
tokens
Released
Jan 19, 2026
Arena Rank
#100
of 305 models

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Include Reasoning
Show reasoning tokens
Max Tokens
Output length limit
min_p
Presence Penalty
Encourage new topics
Reasoning
Extended thinking
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.06$0.40$0.01

Benchmarks

Arena ELOLMSYS Chatbot Arena
1,305/1,500

Model IDs

OpenRouterz-ai/glm-4.7-flash

Tags

reasoningtool-calling
Compare with another model

Related Models