GLM 4.6
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
Capabilities
Supported Parameters
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| Requesty★ | $0.60 | $2.20 | $0.11 |
| OpenRouter | $0.35 | $1.50 | $0.17 |
| Vercel AI | $0.45 | $1.80 | — |
| Martian | $0.35 | $1.50 | $0.17 |
| DeepInfra | $0.43 | $1.74 | $0.08 |
Model IDs
zai/GLM-4.6z-ai/glm-4.6