Inloggen Demo plannen Aan de slag

Product

EU Router HostYourAI Code OpenAI-compatible API Anthropic-compatible API Model Garden Dedicated Instances Playground Fine-tuning (Loes)Connect je GPU-pool

Oplossingen

Use cases

HostYourAI Code LLM Inference RAG pipelines Chatbots AI agents Fine-tuning

Sectoren

Overheid Zorg Finance Juridisch

Modellen

DeepSeek V4 Pro DeepSeek V4 Flash GLM 5.2 Llama 3.1 405B Qwen3.5 397B Llama 3.3 70B Mistral DeepSeek R1 Alle modellen →

Vergelijk

Azure OpenAI AWS Bedrock Claude API ChatGPT OpenAI

Resources

Documentatie Gids: migreren naar de EU Router Gids: eigen LLM deployen (vLLM)Gids: RAG bouwen op EU-GPUs Modelcatalogus

Prijzen

NL EN DE

Inloggen Demo plannen Aan de slag

Model garden Router · beschikbaar Dedicated · beschikbaar

Phi 4 Mini (3.8B)

Name: Phi 4 Mini (3.8B) hosting (EU)
Brand: HostYourAI
Price: 0.03 EUR
Availability: InStock

Direct via de EU-router of als dedicated GPU-deployment. Data blijft in Europa.

Phi 4 Mini (3.8B) is een open-source taalmodel van Microsoft met 3.8B parameters en een contextvenster van 131K tokens, gehost op Europese GPU's via een OpenAI-compatibele API.

Start gratis ← Alle modellen

phi-4-mini vLLM ready

text->text · microsoft · EU-hosted

3.8B

Parameters

131K

Contextvenster

8GB

Minimale VRAM

POST /api/v1/chat/completions 200 OK

Specificaties

Parameters 3.8B

Contextvenster 131,072 tokens

Minimale VRAM 8 GB

Architectuur Phi3ForCausalLM (vLLM)

Licentie open-weights

Modaliteit text->text

Uitgever microsoft ↗

Prijzen

Gedeelde router · per token

€0.03

Input (per 1M tokens)

€0.06

Output (per 1M tokens)

Dedicated GPU · per uur

vanaf €1,16 per uur

Eigen vLLM-instance op Europese cloud (8 GB VRAM), per uur afgerekend.

Gedeelde EU-router, pay-per-token, scale-to-zero. Dedicated GPU-deployments worden per uur afgerekend, zie prijzen.

✓ Werkend geverifieerd op 15-07-2026, respons in 679 ms op onze EU-infrastructuur.

Direct aanroepen

Drop-in vervanger voor OpenAI: wijzig alleen de base-URL en de API-key. Ook het Anthropic-formaat (/v1/messages) wordt ondersteund.

curl https://hostyourai.com/api/v1/chat/completions \
  -H "Authorization: Bearer hyai-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "phi-4-mini",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Veelgestelde vragen

Kan ik Phi 4 Mini (3.8B) in de EU draaien?

Ja. HostYourAI draait Phi 4 Mini (3.8B) op GPU's in Europese datacenters via vLLM. Prompts en outputs verlaten de EU niet en er is geen Amerikaanse cloudprovider in de keten.

Is Phi 4 Mini (3.8B) hosten AVG/GDPR-compliant?

Ja. Alle verwerking vindt plaats binnen de EU, er is een verwerkersovereenkomst (DPA) beschikbaar en de subprocessor-lijst is openbaar. Open-source gewichten betekenen ook: geen training op jouw data.

Wat kost Phi 4 Mini (3.8B)?

Via de gedeelde EU-router betaal je €0.03 per miljoen input-tokens en €0.06 per miljoen output-tokens, zonder vaste kosten. Voor hoge volumes of isolatie kun je Phi 4 Mini (3.8B) ook als dedicated GPU-instance per uur draaien.

Is de API compatibel met OpenAI?

Ja. Je gebruikt de standaard OpenAI-SDK's met een aangepaste base-URL (https://hostyourai.com/api/v1). Ook de Anthropic Messages API wordt ondersteund als drop-in.

Andere modellen van Microsoft

GELab Zero 4B preview Sico Evolution

GELab Zero 4B preview Sico Evolution is een multimodaal taalmodel van Microsoft met 4.4B parameters, gehost op Europese GPU's via een OpenAI-compatibele API.

4.4B Bekijk model →

X Reasoner 7B

We introduce X-Reasoner, a vision-language model posttrained solely on general-domain text for generalizable reasoning, using a twostage approach: an initial supervised fine-tuning phase with distilled long chainof-thoughts, followed by reinforcement learning with verifiable rewards. Experiments show that X-Reasoner successfully transfers reasoning capabilities to both multimodal and out-of-domain settings, outperforming existing state-of-theart models trained with in-domain and multimodal data across various general and medical benchmarks. More details can be found in the paper: X-Reasoner: T

8.3B 128K context Bekijk model →

OptiMind SFT

OptiMind-SFT is a specialized 20B parameter model designed to bridge the gap between natural language and executable optimization solvers. It automates the translation of complex decision-making problems—such as supply chain planning, scheduling, and resource allocation—into correct MILP formulations.

21B 131K context Bekijk model →

Fara 7B

Description: Fara-7B is Microsoft's first agentic small language model (SLM) designed specifically for computer use. With only 7 billion parameters, Fara-7B is an ultra-compact Computer Use Agent (CUA) that achieves state-of-the-art performance within its size class and is competitive with larger, more resource-intensive agentic systems.

8.3B 128K context Bekijk model →

UserLM 8b

Unlike typical LLMs that are trained to play the role of the "assistant" in conversation, we trained UserLM-8b to simulate the “user” role in conversation (by training it to predict user turns in a large corpus of conversations called WildChat). This model is useful in simulating more realistic conversations, which is in turn useful in the development of more robust assistants.

8B 8K context Bekijk model →

MediPhi Instruct

The MediPhi Model Collection comprises 7 small language models of 3.8B parameters from the base model Phi-3.5-mini-instruct specialized in the medical and clinical domains. The collection is designed in a modular fashion. Five MediPhi experts are fine-tuned on various medical corpora (i.e. PubMed commercial, Medical Wikipedia, Medical Guidelines, Medical Coding, and open-source clinical documents) and merged back with the SLERP method in their base model to conserve general abilities. One model combined all five experts into one general expert with the multi-model merging method BreadCrumbs. F

3.8B 131K context Bekijk model →

Probeer Phi 4 Mini (3.8B) gratis

Account aanmaken duurt een minuut. Test Phi 4 Mini (3.8B) direct in de playground.

Start gratis