z-ai

glm-4.7

High-throughput model suitable for real-time applications and code completion

z-ai 198K context

Value rank

#12 of 23

intelligence per dollar in our catalogue

Context rank

#4 tied

198K token window

Agentic strength

96%

tau2-bench tool-use success

Benchmarks

Independent scores by Artificial Analysis, compared with the strongest models in our catalogue.

Intelligence index

glm-5.2
51.1
minimax-m3
44.4
kimi-k2.6
42.8
glm-4.7
33.8

Coding index

glm-5.2
50.7
kimi-k2.6
47.1
minimax-m3
43.4
glm-4.7
36.3

Agentic (tau2-bench)

glm-4.7
95.9%
kimi-k2.6
95.9%
minimax-m3
88.9%

Best for

Where this model earns its keep.

Agentic pipelines and tool use Vision and image-aware tasks Prompt-cached workloads

The numbers

Pricing is live from our platform. Prices per 1M tokens, zero data retention on every request.

Input price$0.60
Cache read price$0.15
Output price$2.20
Context window198K tokens
Intelligence / coding index33.8 / 36.3
Agentic: tau2 / terminal-bench96% / 32%
GPQA / MMLU-Pro86% / 86%

Or consider

Close alternatives in the catalogue.

Quick start

OpenAI-compatible. Switch in one line.

# pip install openai
client = OpenAI(base_url="https://api.tensorx.ai/v1", api_key="tsx-...")
r = client.chat.completions.create(
    model="z-ai/glm-4.7",
    messages=[{"role": "user", "content": "Hello"}],
)

Benchmark data from the Artificial Analysis Intelligence Index v4.1, measured independently. Pricing live from the TensorX platform. All inference on EU-sovereign infrastructure with zero data retention.