From pay-as-you-go developer inference to fully dedicated sovereign GPU clusters — all on EU infrastructure with zero data retention.
Enterprise-grade EU-sovereign GPU infrastructure with predictable pricing. Contact us for a custom quote.
Your own isolated GPUs. EU-sovereign. Zero data retention.
Prices shown per 1 million tokens. Create an account, buy credits, and start running private inference instantly.
| Description | Context | |||||
|---|---|---|---|---|---|---|
| Open-weight frontier model with strong coding, agentic performance and a... | z-ai | 1M | $1.50 | $0.38 | $4.50 | |
| Open-weight model built for coding, long-context agent work and multimodal... | minimax | 1M | $0.40 | $0.10 | $2.00 | |
| DeepSeek's most capable model for complex reasoning, coding, and agentic... | deepseek | 1M | $1.75 | $0.44 | $3.50 | |
| Latest Moonshot Kimi model with strong reasoning and agentic tool... | moonshotai | 256K | $1.00 | $0.25 | $4.00 | |
| Open-weight model built for end-to-end coding and multi-step agent workflows. | moonshotai | 256K | $1.25 | $0.31 | $4.50 | |
| Fast, efficient DeepSeek model for high-volume, long-context tasks. | deepseek | 1M | $0.15 | $0.04 | $0.30 | |
| Enhanced GLM-5 release with improved reasoning and tool-use performance. | z-ai | 198K | $1.40 | $0.35 | $4.40 | |
| Faster, lower-cost GLM-5 variant tuned for real-time, high-throughput workloads. | z-ai | 198K | $1.20 | $0.30 | $4.00 | |
| Multimodal GLM model that handles vision and text for fast,... | z-ai | 198K | $1.20 | $0.30 | $4.00 | |
| Z.ai's flagship GLM model for advanced reasoning, coding, and agentic... | z-ai | 198K | $1.00 | $0.25 | $3.20 | |
| Capable Moonshot Kimi model for general assistant, reasoning, and long-context... | moonshotai | 256K | $0.50 | $0.13 | $2.80 | |
| High-throughput model suitable for real-time applications and code completion | z-ai | 198K | $0.60 | $0.15 | $2.20 | |
| Strong general-purpose model from MiniMax, well suited to reasoning and... | minimax | 64K | $0.30 | $0.08 | $1.20 | |
| Large Qwen mixture-of-experts model for demanding reasoning, coding, and multilingual... | qwen | 256K | $0.50 | $0.13 | $3.50 | |
| NVIDIA Nemotron mixture-of-experts model balancing strong reasoning with efficient inference. | nvidia | 256K | $0.30 | $0.08 | $0.90 | |
| Compact, cost-effective Qwen model for fast, high-volume general tasks. | qwen | 256K | $0.15 | $0.04 | $0.20 | |
| Well-rounded DeepSeek model for everyday reasoning, coding, and chat. | deepseek | 160K | $0.30 | $0.08 | $0.50 | |
| OpenAI's open-weight 120B model for capable general-purpose reasoning and coding. | openai | 131K | $0.04 | $0.01 | $0.20 | |
| Versatile DeepSeek chat model offering strong general reasoning at low... | deepseek | 164K | $0.20 | $0.05 | $0.80 | |
| DeepSeek R1 reasoning model built for step-by-step problem solving and... | deepseek | 164K | $0.66 | $0.17 | $2.60 | |
| Large Qwen mixture-of-experts model for advanced reasoning, coding, and multilingual... | qwen | 131K | $0.07 | $0.02 | $0.46 | |
| Qwen multimodal model that understands both images and text for... | qwen | 131K | $0.21 | $0.05 | $1.90 | |
| Optimized for programming and software development tasks | qwen | 262K | $0.06 | $0.02 | $0.25 | |
| Qwen embedding model for semantic search, retrieval, and RAG pipelines. | qwen | 32K | $0.01 | $0.00 | $0.00 |
Select a model, enter your monthly token usage, and see your estimated cost compared to OpenAI GPT-4o. All models run on EU-sovereign infrastructure with zero data retention.