TCO Calculator Expert
Compare Cloud API, Cloud GPU Rental, and On-Premise deployments with real-time hardware validation and cost breakdown. Built for senior architects making production decisions.
Start here
Three steps, one clean comparison
-
01
Set the workload once.
Enter queries and tokens in Cloud API. The same demand syncs across Cloud GPU and On-Premise.
-
02
Align the model pair.
Select the provider and the open-source match. VRAM sizing and GPU validation update automatically.
-
03
Review results.
Check the 3-year chart, breakeven, and recommendations before exporting.
Quick profiles
Pick a profile to prefill everything
Profiles sync across all tabs and can be edited anytime.
LLM-based knowledge assistant, 500K queries/month.
- Claude 3.7 Sonnet with Llama 3.3 70B FP8 baseline.
- Compare Cloud GPU H100 commit vs dual H200 racks.
Streaming telemetry, 1.2M queries/month.
- High volume, GPT-4o paired with DeepSeek 67B.
- Auto-suggests 4x H100 or MI300X sizing.
80K queries/month, strict sovereignty.
- Gemini 1.5 Pro paired with Qwen 32B.
- On-Prem discounts removed, opex increased.
Cloud API Configuration
Guided view hides advanced assumptions. Toggle above for full detail.
📊 Workload
💰 Pricing
💵 Cost Summary
Cloud GPU Configuration
Guided view hides advanced assumptions. Toggle above for full detail.
📊 Workload
- Throughput: 480 tokens/sec
- Max queries/hour: 1,570
- GPU utilization: 87%
🤖 Model Selection
- Model weights: 70 GB
- KV cache: 18 GB
- Safety margin (20%): 18 GB
🖥️ Hardware
💾 Storage & Networking
💵 Cost Summary
On-Premise Configuration
Guided view hides advanced assumptions. Toggle above for full detail.
🤖 Model Selection
- Model weights: 70 GB
- KV cache: 18 GB
- Safety margin (20%): 18 GB
🖥️ Hardware Capex
⚡ Power & Cooling
Total TDP: 0W × PUE × hours
🔧 Operational Costs (Annual)
💵 Cost Summary
📈 3-Year TCO Comparison
| Metric | Cloud API | Cloud GPU | On-Premise |
|---|---|---|---|
| Monthly Cost | €0 | €0 | €0 |
| 3-Year TCO | €0 | €0 | €0 |
| Breakeven vs API | - | - | - |
| ROI over 3 years | - | - | - |