qwen3.5-9b

Compact, cost-effective Qwen model for fast, high-volume general tasks.

qwen 256K context

Get API Key View Pricing

Value rank

#3 of 23

intelligence per dollar in our catalogue

Context rank

#2 tied

256K token window

Agentic strength

87%

tau2-bench tool-use success

Benchmarks

Independent scores by Artificial Analysis, compared with the strongest models in our catalogue.

Intelligence index

glm-5.2

51.1

minimax-m3

44.4

deepseek-v4-pro

44.3

kimi-k2.6

42.8

qwen3.5-9b

Coding index

glm-5.2

50.7

deepseek-v4-pro

47.5

kimi-k2.6

47.1

minimax-m3

43.4

qwen3.5-9b

25.3

Agentic (tau2-bench)

deepseek-v4-pro

96.2%

kimi-k2.6

95.9%

minimax-m3

88.9%

qwen3.5-9b

86.8%

Best for

Where this model earns its keep.

Prompt-cached workloads

The numbers

Pricing is live from our platform. Prices per 1M tokens, zero data retention on every request.

Input price	$0.15
Cache read price	$0.04
Output price	$0.20
Context window	256K tokens
Intelligence / coding index	25 / 25.3
Agentic: tau2 / terminal-bench	87% / 24%
GPQA / MMLU-Pro	81% / -

Or consider

Close alternatives in the catalogue.

deepseek-v3.2

Intelligence 24.7 · agentic 79%

nemotron-3-super-120b-a12b

Intelligence 25.4 · agentic 68%

gpt-oss-120b

Intelligence 23.8 · agentic 66%

Quick start

OpenAI-compatible. Switch in one line.

# pip install openai
client = OpenAI(base_url="https://api.tensorx.ai/v1", api_key="tsx-...")
r = client.chat.completions.create(
model="qwen/qwen3.5-9b",
messages=[{"role": "user", "content": "Hello"}],
)

Benchmark data from the Artificial Analysis Intelligence Index v4.1, measured independently. Pricing live from the TensorX platform. All inference on EU-sovereign infrastructure with zero data retention.

For Enterprises

For AI Developers

For Enterprises

For AI Developers

qwen3.5-9b

Benchmarks

Best for

The numbers

Or consider

Quick start

qwen3.5-9b

Benchmarks

Best for

The numbers

Or consider

Quick start

Stay in the Loop

Thank you!