qwen

qwen3.5-9b

Compact, cost-effective Qwen model for fast, high-volume general tasks.

qwen 256K context

Value rank

#3 of 23

intelligence per dollar in our catalogue

Context rank

#2 tied

256K token window

Agentic strength

87%

tau2-bench tool-use success

Benchmarks

Independent scores by Artificial Analysis, compared with the strongest models in our catalogue.

Intelligence index

glm-5.2
51.1
minimax-m3
44.4
kimi-k2.6
42.8

Coding index

glm-5.2
50.7
kimi-k2.6
47.1
minimax-m3
43.4
qwen3.5-9b
25.3

Agentic (tau2-bench)

kimi-k2.6
95.9%
minimax-m3
88.9%
qwen3.5-9b
86.8%

Best for

Where this model earns its keep.

Prompt-cached workloads

The numbers

Pricing is live from our platform. Prices per 1M tokens, zero data retention on every request.

Input price$0.15
Cache read price$0.04
Output price$0.20
Context window256K tokens
Intelligence / coding index25 / 25.3
Agentic: tau2 / terminal-bench87% / 24%
GPQA / MMLU-Pro81% / -

Or consider

Close alternatives in the catalogue.

Quick start

OpenAI-compatible. Switch in one line.

# pip install openai
client = OpenAI(base_url="https://api.tensorx.ai/v1", api_key="tsx-...")
r = client.chat.completions.create(
    model="qwen/qwen3.5-9b",
    messages=[{"role": "user", "content": "Hello"}],
)

Benchmark data from the Artificial Analysis Intelligence Index v4.1, measured independently. Pricing live from the TensorX platform. All inference on EU-sovereign infrastructure with zero data retention.