HostYourAI

Product

EU Router HostYourAI Code OpenAI-compatible API Anthropic-compatible API Model Garden Dedicated Instances Playground Fine-tuning (Loes)Connect your GPU pool

Solutions

Use cases

HostYourAI Code LLM Inference RAG pipelines Chatbots AI agents Fine-tuning

Industries

Government Healthcare Finance Legal

Models

DeepSeek V4 Pro DeepSeek V4 Flash GLM 5.2 Llama 3.1 405B Qwen3.5 397B Llama 3.3 70B Mistral DeepSeek R1 All models →

Compare

Azure OpenAI AWS Bedrock Claude API ChatGPT OpenAI

Resources

Documentation Guide: migrate to the EU Router Guide: deploy your own LLM (vLLM)Guide: build RAG on EU GPUs Model catalog

Pricing

NL EN DE

Model garden Router · on request Dedicated · on request

Llama Guard 3 11B Vision

Name: Llama Guard 3 11B Vision hosting (EU)
Brand: HostYourAI
Price: 0.06 EUR
Availability: LimitedAvailability

This model runs as a dedicated deployment on large GPUs and isn't in the shared playground by default. Get in touch and we'll set it up for you.

Llama Guard 3 11B Vision is an multimodal language model from Meta with 11B parameters, hosted on EU GPUs via an OpenAI-compatible API.

Request access ← All models

meta-llama/Llama-Guard-3-11B-Vision On request

text+image->text · meta-llama · EU-hosted

11B

Parameters

—

Context window

32GB

Minimum VRAM

POST /api/v1/chat/completions On request

Specifications

Parameters 11B

Minimum VRAM 32 GB

Architecture MllamaForConditionalGeneration (vLLM)

License llama3.2

Modality text+image->text

Released September 2024

Publisher meta-llama ↗

Pricing

Shared router · per token

On request

Not available on the shared router. Pricing on request as a dedicated GPU deployment.

Dedicated GPU · per hour

On request

Dedicated deployment, from 32 GB of VRAM. Billed per GPU-hour.

Shared EU router, pay-per-token, scale-to-zero. Dedicated GPU deployments are billed hourly, see pricing.

Call it now

Drop-in replacement for OpenAI: change only the base URL and API key. The Anthropic format (/v1/messages) is supported too.

curl https://hostyourai.com/api/v1/chat/completions \
  -H "Authorization: Bearer hyai-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "meta-llama/Llama-Guard-3-11B-Vision",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Frequently asked questions

Can I run Llama Guard 3 11B Vision in the EU?

Yes. HostYourAI runs Llama Guard 3 11B Vision on GPUs in European datacenters via vLLM. Prompts and outputs never leave the EU and there is no US cloud provider in the chain.

Is hosting Llama Guard 3 11B Vision GDPR-compliant?

Yes. All processing happens inside the EU, a Data Processing Agreement (DPA) is available and the subprocessor list is public. Open-source weights also mean: no training on your data.

How much does Llama Guard 3 11B Vision cost?

Via the shared EU router you pay €0.06 per million input tokens and €0.15 per million output tokens, with no fixed costs. For high volume or isolation you can also run Llama Guard 3 11B Vision as a dedicated hourly GPU instance.

Is the API OpenAI-compatible?

Yes. You use the standard OpenAI SDKs with a custom base URL (https://hostyourai.com/api/v1). The Anthropic Messages API is supported as a drop-in as well.

More models from Meta

Llama Guard 4 12B

Llama Guard 4 12B is an multimodal language model from Meta with 12B parameters, hosted on EU GPUs via an OpenAI-compatible API.

12B View model →

Llama 4 Maverick 17B 128E

Llama 4 Maverick 17B 128E is an open-source language model from Meta with 17B parameters, hosted on EU GPUs via an OpenAI-compatible API.

17B View model →

Llama 4 Scout 17B 16E

Llama 4 Scout 17B 16E is an multimodal language model from Meta with 109B parameters, hosted on EU GPUs via an OpenAI-compatible API.

109B View model →

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is an multimodal language model from Meta with 109B parameters, hosted on EU GPUs via an OpenAI-compatible API.

109B View model →

Llama 4 Maverick 17B 128E Instruct

Llama 4 Maverick 17B 128E Instruct is an open-source language model from Meta with 17B parameters, hosted on EU GPUs via an OpenAI-compatible API.

17B View model →

Llama 4 Maverick 17B 128E Instruct FP8

Llama 4 Maverick 17B 128E Instruct FP8 is an open-source language model from Meta with 17B parameters, hosted on EU GPUs via an OpenAI-compatible API.

17B View model →

Request access

Llama Guard 3 11B Vision isn't available by default yet. Leave your details and we'll arrange a dedicated deployment.

Request access