Pricing · HostYourAI

HostYourAI offers three execution modes side by side on a single account and credit balance. Start with EU Hosted inference, move to dedicated capacity when the workload or compliance profile requires it.

1. EU Hosted Gateway: pay per token

One OpenAI-compatible API key, model catalog, hyai/auto, and scale-to-zero shared capacity. Best for SaaS integrations, agencies, agent apps, and experimentation. Indicative tariffs (EUR per million tokens):

Model class	Input €/M	Output €/M
Tiny (≤2B)	0.02	0.05
Small (≤4B)	0.03	0.08
8B class (Llama 3.1 8B, Qwen3 8B)	0.05	0.12
12–14B class (Qwen3 14B, Phi-4)	0.06	0.15
30B class, quantised (Qwen3 32B FP8, Mistral Small)	0.08	0.25
32B fp16 / 70B quantised	0.15	0.40
70B fp16 (Llama 3.3 70B)	Dedicated deployment, on request
Large MoE (DeepSeek V4 Flash, GLM, Qwen 235B+)	1.20	3.40

Current beta framing: EU Hosted means EU-located GPU processing with shared router capacity. EU Sovereignty Mode is sold separately once a fully EU-sovereign provider chain, DPA, subprocessors, audit export, and support-access controls are active.

2. Dedicated EU Deployment: billed per minute

You pick a GPU class and region, deploy your own vLLM instance, and pay for as long as it runs. Best for custom Hugging Face models, BYOK upstreams, steady high-volume workloads, or when you need full control over the deployment.

The prices below are hourly rates, but you are billed per minute, with no rounding up to the full hour. Stop an instance after six minutes and you pay for six minutes.

GPU class	Typical use	From (EUR / hr)
1x L40S / RTX 4090 (24-48 GB)	Models up to ~20B	€ 2.22
1x RTX PRO 6000 (96 GB)	Models up to ~32B	€ 2.04
1x A100 / H100 (80 GB)	Models up to ~70B quantised	€ 3.58
2x RTX PRO 6000 (192 GB)	70B fp16 / high throughput	€ 5.83
2x H100 (160 GB)	70B fp16 / HA	€ 7.16
4x H100 (320 GB)	Large models / high throughput	€ 12.89
8x H100 (640 GB)	Dense 405B quantised / large MoE	€ 21.48
8x B200 (1.4 TB)	Frontier MoE, max headroom	€ 54.00

These are the cheapest EU offers our providers had when this page was last refreshed (26 Jul 2026). GPU classes with no EU availability are left out rather than quoted. The exact price for each offer is shown before you deploy.

3. Private single-tenant: on request

Need an isolated runtime with dedicated GPUs per customer, at-rest encryption, and a private network policy? For healthcare, government, legal, finance, and workloads that cannot use shared capacity, we scope and price this per project. The configurations below are typical starting points, not a self-serve product.

Configuration	VRAM	Indicative / month	Setup (one-off)
1× L40S	48 GB	from € 1,200	€ 500
1× H100	80 GB	from € 3,500	€ 1,000
2× H100	160 GB	from € 6,500	€ 1,000
4× H100	320 GB	from € 12,500	€ 1,500

Indicative, scoped per project. Talk to us via /contact. Confidential computing (TEE) is on the roadmap; we will not price what we have not yet validated.

BYOK: bring your own API key

You can attach your own OpenAI, Anthropic, Google or Mistral API key to an instance. We forward your traffic to the upstream under your contract with them. BYOK currently carries no platform fee: you only pay your own provider. Useful for hybrid setups that mix EU-hosted open-weights with frontier closed models.

Getting started

Creating an account is free. No credit card to sign up.
Pay as you go from a single prepaid credit balance. No subscription, no minimum.
Top up with iDEAL, card or SEPA, then call the Router or deploy an instance.

Billing

Currency: EUR. VAT added where applicable; reverse-charge for EU B2B with valid VAT number.
Method: Stripe (credit card, iDEAL, SEPA direct debit). Invoices auto-issued from your dashboard.
Credits: top up in advance; balance is consumed by all three modes from a single pool.
Volume / partner tier: for €> 2 000 / month in tokens or one or more single-tenant deployments, we offer a partner tier with discounts, SLAs, and a dedicated technical contact. Contact info@hostyourai.com.

What's not on the price list

Bespoke procurement, custom contracts, NEN 7510 / BIO audit packages, white-label / reseller arrangements, and confidential-computing deployments are quoted per project. Talk to us via /contact.