Model Library

Every model we serve, with independent quality benchmarks and transparent pricing.

SmartestiRanked by the Artificial Analysis Intelligence Index, a blended score across reasoning, maths, coding and knowledge benchmarks. Higher is smarter.

1glm-5.2 51.1
2minimax-m3 44.4
4kimi-k2.6 42.8

Most agenticiRanked by tau2-bench, which measures how reliably a model completes multi-step tool-use tasks. Shown as a percentage.

1glm-5-turbo 98.5%
2glm-5v-turbo 98.5%
3glm-5 98.2%
4glm-5.1 97.7%

Best valueiIntelligence per dollar: the Intelligence Index divided by blended price, (3x input + output) / 4 per million tokens. Higher means more capability for your spend.

1gpt-oss-120b 23.8 @ $0.04 in
2deepseek-v4-flash 40.3 @ $0.15 in
3qwen3.5-9b 25 @ $0.15 in
4qwen3-coder-30b-a3b-instruct 13.6 @ $0.06 in
5qwen3-235b-a22b-2507 18.2 @ $0.07 in

z-ai/glm-5.2

New z-ai 1M
Input$1.50 / 1M
Cache Read$0.38 / 1M
Output$4.50 / 1M
Intel
51.1
Coding
50.7
Functions Tool Choice Reasoning

minimax/minimax-m3

New minimax 1M
Input$0.40 / 1M
Cache Read$0.10 / 1M
Output$2.00 / 1M
Intel
44.4
Coding
43.4
Agentic
89%
Functions Tool Choice Reasoning Vision

deepseek/deepseek-v4-pro

New deepseek 1M
Input$1.75 / 1M
Cache Read$0.44 / 1M
Output$3.50 / 1M
Intel
44.3
Coding
47.5
Agentic
96%
Functions Tool Choice Reasoning

moonshotai/kimi-k2.6

New moonshotai 256K
Input$1.00 / 1M
Cache Read$0.25 / 1M
Output$4.00 / 1M
Intel
42.8
Coding
47.1
Agentic
96%
Functions Tool Choice Reasoning Vision

moonshotai/kimi-k2.7-code

New moonshotai 256K
Input$1.25 / 1M
Cache Read$0.31 / 1M
Output$4.50 / 1M
Intel
41.9
Coding
45.8
Functions Tool Choice Reasoning Vision

deepseek/deepseek-v4-flash

New deepseek 1M
Input$0.15 / 1M
Cache Read$0.04 / 1M
Output$0.30 / 1M
Intel
40.3
Coding
38.7
Agentic
95%
Functions Tool Choice Reasoning

z-ai/glm-5.1

New z-ai 198K
Input$1.40 / 1M
Cache Read$0.35 / 1M
Output$4.40 / 1M
Intel
40.2
Coding
43.4
Agentic
98%
Functions Tool Choice Reasoning

z-ai/glm-5-turbo

New z-ai 198K
Input$1.20 / 1M
Cache Read$0.30 / 1M
Output$4.00 / 1M
Intel
38.1
Coding
36.8
Agentic
99%
Functions Tool Choice Reasoning

z-ai/glm-5v-turbo

New z-ai 198K
Input$1.20 / 1M
Cache Read$0.30 / 1M
Output$4.00 / 1M
Intel
34.5
Coding
36.2
Agentic
99%
Functions Tool Choice Reasoning Vision

z-ai/glm-5

z-ai 198K
Input$1.00 / 1M
Cache Read$0.25 / 1M
Output$3.20 / 1M
Intel
39.5
Coding
44.2
Agentic
98%
Functions Tool Choice Reasoning

moonshotai/kimi-k2.5

moonshotai 256K
Input$0.50 / 1M
Cache Read$0.13 / 1M
Output$2.80 / 1M
Intel
38.1
Coding
39.6
Agentic
96%
Functions Tool Choice Vision

z-ai/glm-4.7

z-ai 198K
Input$0.60 / 1M
Cache Read$0.15 / 1M
Output$2.20 / 1M
Intel
33.8
Coding
36.3
Agentic
96%
Functions Tool Choice Reasoning Vision

minimax/minimax-m2.5

minimax 64K
Input$0.30 / 1M
Cache Read$0.08 / 1M
Output$1.20 / 1M
Intel
33.7
Coding
37.4
Agentic
95%
Functions Tool Choice Reasoning

qwen/qwen3.5-122b-a10b

qwen 256K
Input$0.50 / 1M
Cache Read$0.13 / 1M
Output$3.50 / 1M
Intel
32.3
Coding
34.7
Agentic
94%
Functions Tool Choice Reasoning Vision

nvidia/nemotron-3-super-120b-a12b

nvidia 256K
Input$0.30 / 1M
Cache Read$0.08 / 1M
Output$0.90 / 1M
Intel
25.4
Coding
31.2
Agentic
68%
Functions Tool Choice Reasoning

qwen/qwen3.5-9b

qwen 256K
Input$0.15 / 1M
Cache Read$0.04 / 1M
Output$0.20 / 1M
Intel
25
Coding
25.3
Agentic
87%
Functions Tool Choice Reasoning

deepseek/deepseek-v3.2

deepseek 160K
Input$0.30 / 1M
Cache Read$0.08 / 1M
Output$0.50 / 1M
Intel
24.7
Coding
34.6
Agentic
79%
Functions Tool Choice Reasoning

openai/gpt-oss-120b

openai 131K
Input$0.04 / 1M
Cache Read$0.01 / 1M
Output$0.20 / 1M
Intel
23.8
Coding
28.6
Agentic
66%
Functions Tool Choice Reasoning

deepseek/deepseek-chat-v3.1

deepseek 164K
Input$0.20 / 1M
Cache Read$0.05 / 1M
Output$0.80 / 1M
Intel
21
Coding
28.4
Agentic
35%
Functions Tool Choice Reasoning

deepseek/deepseek-r1-0528

deepseek 164K
Input$0.66 / 1M
Cache Read$0.17 / 1M
Output$2.60 / 1M
Intel
20.1
Coding
24
Agentic
37%
Functions Tool Choice Reasoning

qwen/qwen3-235b-a22b-2507

qwen 131K
Input$0.07 / 1M
Cache Read$0.02 / 1M
Output$0.46 / 1M
Intel
18.2
Coding
22.1
Agentic
33%
Functions Tool Choice

qwen/qwen3-vl-235b-a22b-instruct

qwen 131K
Input$0.21 / 1M
Cache Read$0.05 / 1M
Output$1.90 / 1M
Intel
14.3
Coding
16.5
Agentic
35%

qwen/qwen3-coder-30b-a3b-instruct

qwen 262K
Input$0.06 / 1M
Cache Read$0.02 / 1M
Output$0.25 / 1M
Intel
13.6
Coding
19.4
Agentic
35%
Functions Tool Choice

qwen/qwen3-embedding-8b

qwen 32K
Input$0.01 / 1M
Cache Read$0.00 / 1M
Output$0.00 / 1M

No models match your search.

Benchmark scores from the Artificial Analysis Intelligence Index v4.1. Pricing live from the TensorX platform. All inference on EU-sovereign infrastructure with zero data retention.