deepseek

deepseek-v4-flash

Fast, efficient DeepSeek model for high-volume, long-context tasks.

deepseek 1M context ★ Featured

Value rank

#2 of 23

intelligence per dollar in our catalogue

Context rank

#1 tied

1M token window

Agentic strength

95%

tau2-bench tool-use success

Benchmarks

Independent scores by Artificial Analysis, compared with the strongest models in our catalogue.

Intelligence index

glm-5.2
51.1
minimax-m3
44.4
kimi-k2.6
42.8

Coding index

glm-5.2
50.7
kimi-k2.6
47.1
minimax-m3
43.4

Agentic (tau2-bench)

kimi-k2.6
95.9%
minimax-m3
88.9%

Best for

Where this model earns its keep.

Agentic pipelines and tool use Long-document analysis Prompt-cached workloads

The numbers

Pricing is live from our platform. Prices per 1M tokens, zero data retention on every request.

Input price$0.15
Cache read price$0.04
Output price$0.30
Context window1M tokens
Intelligence / coding index40.3 / 38.7
Agentic: tau2 / terminal-bench95% / 36%
GPQA / MMLU-Pro89% / -

Or consider

Close alternatives in the catalogue.

Quick start

OpenAI-compatible. Switch in one line.

# pip install openai
client = OpenAI(base_url="https://api.tensorx.ai/v1", api_key="tsx-...")
r = client.chat.completions.create(
    model="deepseek/deepseek-v4-flash",
    messages=[{"role": "user", "content": "Hello"}],
)

Benchmark data from the Artificial Analysis Intelligence Index v4.1, measured independently. Pricing live from the TensorX platform. All inference on EU-sovereign infrastructure with zero data retention.