Inloggen Demo plannen Aan de slag

Product

EU Router HostYourAI Code OpenAI-compatible API Anthropic-compatible API Model Garden Dedicated Instances Playground Fine-tuning (Loes)Connect je GPU-pool

Oplossingen

Use cases

HostYourAI Code LLM Inference RAG pipelines Chatbots AI agents Fine-tuning

Sectoren

Overheid Zorg Finance Juridisch

Modellen

DeepSeek V4 Pro DeepSeek V4 Flash GLM 5.2 Llama 3.1 405B Qwen3.5 397B Llama 3.3 70B Mistral DeepSeek R1 Alle modellen →

Vergelijk

Azure OpenAI AWS Bedrock Claude API ChatGPT OpenAI

Resources

Documentatie Gids: migreren naar de EU Router Gids: eigen LLM deployen (vLLM)Gids: RAG bouwen op EU-GPUs Modelcatalogus

Prijzen

NL EN DE

Inloggen Demo plannen Aan de slag

Model garden Router · beschikbaar Dedicated · beschikbaar

DeepSeek R1 Distill Qwen 32B

Name: DeepSeek R1 Distill Qwen 32B hosting (EU)
Brand: HostYourAI
Price: 0.15 EUR
Availability: InStock

Direct via de EU-router of als dedicated GPU-deployment. Data blijft in Europa.

We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. With R...

Start gratis ← Alle modellen

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B vLLM ready

text->text · deepseek-ai · EU-hosted

33B

Parameters

131K

Contextvenster

80GB

Minimale VRAM

POST /api/v1/chat/completions 200 OK

Specificaties

Parameters 33B

Contextvenster 131,072 tokens

Minimale VRAM 80 GB

Architectuur Qwen2ForCausalLM (vLLM)

Licentie mit

Modaliteit text->text

Uitgebracht January 2025

Uitgever deepseek-ai ↗

Prijzen

Gedeelde router · per token

€0.15

Input (per 1M tokens)

€0.40

Output (per 1M tokens)

Dedicated GPU · per uur

vanaf €3,58 per uur

Eigen vLLM-instance op Europese cloud (80 GB VRAM), per uur afgerekend.

Gedeelde EU-router, pay-per-token, scale-to-zero. Dedicated GPU-deployments worden per uur afgerekend, zie prijzen.

✓ Werkend geverifieerd op 16-07-2026, respons in 1323 ms op onze EU-infrastructuur.

Direct aanroepen

Drop-in vervanger voor OpenAI: wijzig alleen de base-URL en de API-key. Ook het Anthropic-formaat (/v1/messages) wordt ondersteund.

curl https://hostyourai.com/api/v1/chat/completions \
  -H "Authorization: Bearer hyai-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-ai/DeepSeek-R1-Distill-Qwen-32B",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Veelgestelde vragen

Kan ik DeepSeek R1 Distill Qwen 32B in de EU draaien?

Ja. HostYourAI draait DeepSeek R1 Distill Qwen 32B op GPU's in Europese datacenters via vLLM. Prompts en outputs verlaten de EU niet en er is geen Amerikaanse cloudprovider in de keten.

Is DeepSeek R1 Distill Qwen 32B hosten AVG/GDPR-compliant?

Ja. Alle verwerking vindt plaats binnen de EU, er is een verwerkersovereenkomst (DPA) beschikbaar en de subprocessor-lijst is openbaar. Open-source gewichten betekenen ook: geen training op jouw data.

Wat kost DeepSeek R1 Distill Qwen 32B?

Via de gedeelde EU-router betaal je €0.15 per miljoen input-tokens en €0.40 per miljoen output-tokens, zonder vaste kosten. Voor hoge volumes of isolatie kun je DeepSeek R1 Distill Qwen 32B ook als dedicated GPU-instance per uur draaien.

Is de API compatibel met OpenAI?

Ja. Je gebruikt de standaard OpenAI-SDK's met een aangepaste base-URL (https://hostyourai.com/api/v1). Ook de Anthropic Messages API wordt ondersteund als drop-in.

Andere modellen van DeepSeek

DeepSeek V4 Pro

Note: DeepSeek-V4-Pro-DSpark is not a new model. It is the same checkpoint with an additional speculative decoding module attached. A minimal inference example is available in the inference folder. For more details, refer to: https://github.com/deepseek-ai/DeepSpec

1M context Bekijk model →

DeepSeek V4 Flash

Note: DeepSeek-V4-Flash-DSpark is not a new model. It is the same checkpoint with an additional speculative decoding module attached. A minimal inference example is available in the inference folder. For more details, refer to: https://github.com/deepseek-ai/DeepSpec

1M context Bekijk model →

DeepSeek V4 Pro

We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models — DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated) — both supporting a context length of one million tokens.

1M context Bekijk model →

DeepSeek V4 Flash

158B 1M context Bekijk model →

DeepSeek OCR 2

Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.9 + CUDA11.8：

3.4B 8K context Bekijk model →

DeepSeek V3.2

We introduce DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance. Our approach is built upon three key technical breakthroughs:

164K context Bekijk model →

Probeer DeepSeek R1 Distill Qwen 32B gratis

Account aanmaken duurt een minuut. Test DeepSeek R1 Distill Qwen 32B direct in de playground.

Start gratis