NL EN Book Demo Login Get Started

Product

EU Router OpenAI-compatible API Anthropic-compatible API Model Garden Dedicated Instances Playground Fine-tuning (Loes)

Solutions

Use cases

LLM Inference RAG pipelines Chatbots AI agents Fine-tuning

Industries

Government Healthcare Finance Legal

Models

Llama 3.3 70B Mistral DeepSeek R1 Qwen 2.5 72B Gemma 2 27B Codestral 22B All models →

Compare

Azure OpenAI AWS Bedrock Claude API ChatGPT OpenAI

Resources

Documentation Guide: migrate to the EU Router Guide: deploy your own LLM (vLLM)Guide: build RAG on EU GPUs Model catalog

Company

About us Security Data Processing Agreement Privacy policy Terms of service Contact

Pricing

EU Router

EU Inference Router

One base URL for all your open models, served from the EU on infrastructure you control.

Start free Model Garden

Je app

OpenAI · Anthropic

EU Router

één base URL

Qwen3-8B

warm

Loes (NL)

soeverein

Llama-3.3

warm

What is the EU Inference Router?

The Router is a shared, OpenAI-compatible inference gateway. You point your existing client at a single base URL and the Router sends each request to an open model running on European GPUs. You change only the base URL and the API key; your code stays the same.

Because the Router speaks both the OpenAI and Anthropic APIs, it works directly with the SDKs and tools your team already uses. No rewrite, no vendor lock-in.

EU Inference Router

98.7% ↗ 12%

4,931 of 5,000 requests served warm

EU-hostedModels run on European GPUs

Drop-inOpenAI and Anthropic compatible

Scale to zeroGPUs idle when nobody is online

How does it work?

A request hits the Router, is authenticated with your API key and forwarded to a model that is already warm. Responses stream back just like OpenAI. Popular models are kept warm through a warm pool so a first request does not wait on a cold start.

Every request records usage, latency and cost in your activity log, so you see exactly what happens.

pythoncurljs

from openai import OpenAI
client = OpenAI(
    base_url="https://api.hostyour.ai/v1",
    api_key="hyai_...")
client.chat.completions.create(
    model="llama-3.3-70b",
    messages=[{"role":"user","content":"Hallo!"}])

Why route through it?

Drop-in OpenAI- and Anthropic-compatible: only the base URL changes
Open models on European GPUs you control
Your prompts and data never leave the EU
Pay-as-you-go per token on one prepaid credit balance
Per-request insight into usage, latency and cost

EU Inference Router

98.7% ↗ 12%

4,931 of 5,000 requests served warm

EU-hostedModels run on European GPUs

Drop-inOpenAI and Anthropic compatible

Scale to zeroGPUs idle when nobody is online

Everything you need for AI

From model hosting to a customer-facing API, it is built for developers and businesses who want their AI running on infrastructure they actually control, inside the EU.

100%

EU-hosted

Your data and your models stay on European GPUs. GDPR-friendly by design.

200+

Verified models, ready to serve

Llama, Qwen, DeepSeek, Mistral, FLUX and plenty more. Pick one and it is warm in minutes, with no DevOps on your end.

2 SDK

OpenAI & Anthropic compatible

Point your existing client at the Router and keep your tools. No rewrite, no lock-in.

From zero to a warm endpoint in minutes

No infra to manage. Pick a model, get an OpenAI-compatible URL, ship.

Pick a model

Choose from the Model Garden or paste any HuggingFace ID. Set the VRAM and pick an EU GPU.

Get your endpoint

We deploy vLLM, run readiness probes, and hand you a warm OpenAI- and Anthropic-compatible URL plus an API key.

Route and ship

Point your client at the Router. It auto-routes to a warm instance, idles GPUs when nobody is online, and logs every request.

Built for teams that can't send data away

If a US cloud is off the table, HostYourAI gives you the same developer experience on European infrastructure.

Public sector & government

Citizen data that legally has to stay in the EU, with full auditability.

Regulated enterprise

Finance, healthcare and legal teams under GDPR, DORA and the AI Act.

EU SaaS & scale-ups

Ship AI features your customers trust, without a US sub-processor.

Agencies & integrators

Deliver private AI for clients on infrastructure you can stand behind.

Private by Default

HostYourAI keeps your models, prompts and data on European GPUs. It is built for teams that care about compliance, reliability and real control.

EU-hostedGDPR-friendlyOpenAI-compatiblevLLM-poweredNo lock-in

Full data sovereignty

GPUs and data residency inside Europe. Your prompts never leave the EU.

Open

Models you can audit

Run open-weight models with no black boxes or hidden telemetry.

€0

Scale to zero

GPUs idle when nobody is online, so you only pay for what you run.

Yours

No vendor lock-in

Your infra, your keys, your models. Leave whenever you want.

Frequently asked questions

Can I run this in the EU?

Yes. HostYourAI runs open models on GPUs in European datacenters via vLLM. Your prompts and outputs never leave the EU and there is no US cloud provider in the chain.

Is it GDPR-compliant?

Yes. All processing happens inside the EU, a Data Processing Agreement (DPA) is available and the subprocessor list is public. Open weights also mean no training on your data.

Is the API OpenAI-compatible?

Yes. Point your existing OpenAI or Anthropic client at our Router (https://hostyourai.com/api/v1) — change only the base URL and API key. No rewrite, no lock-in.

What does it cost?

Pay-as-you-go on one prepaid credit balance: the shared router per token or a dedicated GPU per hour. Free to start, no minimum, no fixed monthly fee.

Model garden

Works with 100+ open models

Text and image models on dedicated EU GPUs. Every model tested on our own hardware.

Llama 3.3 70B DeepSeek R1 Qwen 2.5 72B Mistral 7B Mixtral 8x22B Gemma 2 27B DeepSeek Coder Qwen Coder 32B CodeLlama 34B Command R+ Browse all models →

Host. Route. Ship.

No credit card required. Pay as you go, cancel anytime.

Start Hosting Free Today