HostYourAI

Phi 3 medium 128k instruct hosting in the EU — OpenAI-compatible API

Instantly via the EU router or as a dedicated GPU deployment. Data stays in Europe.

✓ Verified working on 10-06-2026 — responded in 2174 ms on our EU infrastructure.

🎉 Phi-3.5: [[mini-instruct]](https://huggingface.co/microsoft/Phi-3.5-mini-instruct); [[MoE-instruct]](https://huggingface.co/microsoft/Phi-3.5-MoE-instruct) ; [[vision-instruct]](https://huggingface.co/microsoft/Phi-3.5-vision-instruct)

Specifications

Parameters14B
Context window131,072 tokens
Minimum VRAM33 GB
ArchitecturePhi3ForCausalLM (vLLM)
Licensemit
Modalitytext->text
ReleasedMay 2024
Publishermicrosoft (Hugging Face)

Pricing

Input (per 1M tokens)€ 0.15
Output (per 1M tokens)€ 0.25

Shared EU router, pay-per-token, scale-to-zero. Dedicated GPU deployments are billed hourly — see pricing.

Call it now

Drop-in replacement for OpenAI: change only the base URL and API key. The Anthropic format (/v1/messages) is supported too.

curl https://hostyourai.com/api/v1/chat/completions \
  -H "Authorization: Bearer hyai-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "microsoft/Phi-3-medium-128k-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Frequently asked questions

Can I run Phi 3 medium 128k instruct in the EU?

Yes. HostYourAI runs Phi 3 medium 128k instruct on GPUs in European datacenters via vLLM. Prompts and outputs never leave the EU and there is no US cloud provider in the chain.

Is hosting Phi 3 medium 128k instruct GDPR-compliant?

Yes. All processing happens inside the EU, a Data Processing Agreement (DPA) is available and the subprocessor list is public. Open-source weights also mean: no training on your data.

How much does Phi 3 medium 128k instruct cost?

Via the shared EU router you pay €0.15 per million input tokens and €0.25 per million output tokens, with no fixed costs. For high volume or isolation you can also run Phi 3 medium 128k instruct as a dedicated hourly GPU instance.

Is the API OpenAI-compatible?

Yes. You use the standard OpenAI SDKs with a custom base URL (https://hostyourai.com/api/v1). The Anthropic Messages API is supported as a drop-in as well.

Try Phi 3 medium 128k instruct for free

Creating an account takes a minute. Test Phi 3 medium 128k instruct straight away in the playground.

Start for free

← All models