Pricing
Pricing. Transparent. Scalable.
Four tiers from experiment to enterprise. All with full access to 27 LLMs, audit trail, and DK hosting. Subscription-based — no CAPEX.
Partitions
Pick your partition.
All tiers include full access to the multi-LLM stack. The differentiator is compute capacity and isolation.
Shared
For experiments and PoC
- Shared GPU capacity
- Access to all 27 LLMs
- 10M tokens included
- Audit trail and DK hosting
- Community support
Dedicated
For production
- Dedicated H100 GPUs
- All 27 LLMs + custom deploy
- 100M tokens included
- 99.5% SLA
- Priority support
- Fine-tuning hours included
Private Cloud
For regulated enterprise
- Dedicated multi-GPU partition
- Isolated network and VPN
- ISAE 3000 audit report included
- 99.9% SLA
- Named onboarding engineer
- Custom DPA and compliance review
Enterprise
Scalable, multi-node
- Multi-node DGX allocation
- Multi-region possible
- 24/7 on-call support
- Custom SLA
- Dedicated solutions team
- Air-gapped on-prem deployment possible
All prices are indicative placeholders. 12-month contract. Upgrade anytime. Contact us for final pricing.
Configure add-ons
Comparison
What you get per tier.
The key features per tier — so the choice is straightforward.
| Feature | Shared | Dedicated | Private Cloud | Enterprise |
|---|---|---|---|---|
| Access to 27 LLMs | ||||
| Dedicated H100 GPUs | ||||
| Custom model deployment | ||||
| SLA | ||||
| Isolated network and VPN | ||||
| ISAE 3000 audit report | ||||
| Named onboarding engineer | ||||
| Multi-region deployment | ||||
| Air-gapped on-prem |
Questions
Pricing & terms.
12 months by default. Billing monthly or annually. Enterprise agreements can be negotiated for 24 or 36 months with discount.
Yes. Upgrade from Shared to Dedicated or higher can happen anytime without code changes. API keys and endpoints are stable across tiers. Downgrade requires 30 days notice.
Compute, model access, audit trail, DK hosting, and included token volume. Add-ons (fine-tuning hours, custom deployment, ISAE 3000 report) are billed separately. Token overage is billed per model.
Yes. Shared tier has a free trial period with 10M tokens included. You get an API key the same day and can run against all 27 LLMs. Proof-of-concept migrates to Dedicated without code changes.