The GLM family welcomes new members, the GLM-4-32B-0414 series models, featuring 32 billion parameters. Its performance is comparable to OpenAI’s GPT series and DeepSeek’s V3/R1 series. It also supports very user-friendly local deployment features. GLM-4-32B-Base-0414 was pre-trained on 15T of high-quality data, including substantial reasoning-type synthetic data. This lays the foundation for subs...
Specifications
| Parameters | 9.4B |
|---|---|
| Context window | 32,768 tokens |
| Minimum VRAM | 22 GB |
| Architecture | Glm4ForCausalLM (vLLM) |
| License | mit |
| Modality | text->text |
| Released | April 2025 |
| Publisher | zai-org (Hugging Face) |
Pricing
| Input (per 1M tokens) | € 0.10 |
|---|---|
| Output (per 1M tokens) | € 0.18 |
Shared EU router, pay-per-token, scale-to-zero. Dedicated GPU deployments are billed hourly — see pricing.
Call it now
Drop-in replacement for OpenAI: change only the base URL and API key. The Anthropic format (/v1/messages) is supported too.
curl https://hostyourai.com/api/v1/chat/completions \
-H "Authorization: Bearer hyai-..." \
-H "Content-Type: application/json" \
-d '{
"model": "zai-org/GLM-4-9B-0414",
"messages": [{"role": "user", "content": "Hello!"}]
}'
Frequently asked questions
Can I run GLM 4 9B 0414 in the EU?
Yes. HostYourAI runs GLM 4 9B 0414 on GPUs in European datacenters via vLLM. Prompts and outputs never leave the EU and there is no US cloud provider in the chain.
Is hosting GLM 4 9B 0414 GDPR-compliant?
Yes. All processing happens inside the EU, a Data Processing Agreement (DPA) is available and the subprocessor list is public. Open-source weights also mean: no training on your data.
How much does GLM 4 9B 0414 cost?
Via the shared EU router you pay €0.10 per million input tokens and €0.18 per million output tokens, with no fixed costs. For high volume or isolation you can also run GLM 4 9B 0414 as a dedicated hourly GPU instance.
Is the API OpenAI-compatible?
Yes. You use the standard OpenAI SDKs with a custom base URL (https://hostyourai.com/api/v1). The Anthropic Messages API is supported as a drop-in as well.
Creating an account takes a minute. Test GLM 4 9B 0414 straight away in the playground.
Start for free