ZhipuGLM 4Arena #89Dec 8, 2025

Z.ai: GLM 4.6V

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts and charts directly as visual inputs, and integrates native multimodal function calling to connect perception with downstream tool execution. The model also enables interleaved image-text generation and UI reconstruction workflows, including screenshot-to-HTML synthesis and iterative visual editing.

Context Window
131K
tokens
Max Output
131K
tokens
Released
Dec 8, 2025
Arena Rank
#89
of 305 models

Capabilities

👁Vision
🧠Reasoning
🔧Tool Calling
Prompt Caching
🖥Computer Use
🎨Image Generation

Supported Parameters

Frequency Penalty
Reduce repetition
Include Reasoning
Show reasoning tokens
Logit Bias
Adjust token weights
Max Tokens
Output length limit
min_p
Presence Penalty
Encourage new topics
Reasoning
Extended thinking
Repetition Penalty
Penalize repeated tokens
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Tool Choice
Control tool usage
Tool Calling
Function calling support
Top K
Top-K sampling
Top P
Nucleus sampling

Pricing Comparison

RouterInput / 1MOutput / 1MCached Input / 1M
OpenRouter$0.30$0.90
Vercel AI$0.30$0.90

Benchmarks

Arena ELOLMSYS Chatbot Arena
1,326/1,500

Model IDs

OpenRouterz-ai/glm-4.6v

Tags

visionreasoningtool-calling
Compare with another model

Related Models