Instance Tiers
Choose the right GPU instance for your model size and performance needs.Tier Overview
| Tier | GPU | VRAM | Price | Best For |
|---|---|---|---|---|
| XS | 1x L4 | 24GB | ~$0.20/h | 7B-13B models |
| S | 1x L40S | 48GB | ~$0.60/h | 13B-34B models |
| M | 4x A10G | 96GB | ~$1.80/h | 30B-70B INT4 |
| L | 4x L40S | 192GB | ~$3.50/h | 70B FP16 |
| XL | 8x A100 | 320-640GB | ~$12/h | 70B-180B |
| XXL | 8x H100/H200 | 640-1128GB | ~$20/h | 405B |
Tier XS - Entry Level
Best for small models (7B-13B parameters).GPU XS
| Spec | Value |
|---|---|
| GPU | 1x NVIDIA L4 |
| VRAM | 24GB |
| vCPUs | 4 |
| RAM | 16GB |
| Storage | 250GB NVMe |
| Network | 10 Gbps |
| Price | ~$0.20/h |
GPU XS (2x)
| Spec | Value |
|---|---|
| GPU | 1x NVIDIA L4 |
| VRAM | 24GB |
| vCPUs | 8 |
| RAM | 32GB |
| Storage | 450GB NVMe |
| Network | 15 Gbps |
| Price | ~$0.35/h |
Tier S - Small Production
Best for medium models (13B-34B parameters).GPU S
| Spec | Value |
|---|---|
| GPU | 1x NVIDIA L40S |
| VRAM | 48GB |
| vCPUs | 4 |
| RAM | 16GB |
| Storage | 250GB NVMe |
| Network | 10 Gbps |
| Price | ~$0.60/h |
GPU S (2x)
| Spec | Value |
|---|---|
| GPU | 1x NVIDIA L40S |
| VRAM | 48GB |
| vCPUs | 8 |
| RAM | 32GB |
| Storage | 450GB NVMe |
| Network | 15 Gbps |
| Price | ~$1.00/h |
Tier M - Medium Production
Best for large quantized models (30B-70B INT4).GPU M
| Spec | Value |
|---|---|
| GPU | 4x NVIDIA A10G |
| VRAM | 96GB |
| vCPUs | 48 |
| RAM | 192GB |
| Storage | 3.8TB NVMe |
| Network | 40 Gbps |
| Price | ~$1.80/h |
GPU M (2x)
| Spec | Value |
|---|---|
| GPU | 4x NVIDIA A10G |
| VRAM | 96GB |
| vCPUs | 96 |
| RAM | 384GB |
| Storage | 3.8TB NVMe |
| Network | 50 Gbps |
| Price | ~$3.00/h |
GPU M (4x)
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA A10G |
| VRAM | 192GB |
| vCPUs | 192 |
| RAM | 768GB |
| Storage | 7.6TB NVMe |
| Network | 100 Gbps |
| Price | ~$5.00/h |
Tier L - Large Production
Best for full-precision large models (70B FP16).GPU L
| Spec | Value |
|---|---|
| GPU | 4x NVIDIA L40S |
| VRAM | 192GB |
| vCPUs | 48 |
| RAM | 384GB |
| Storage | 3.8TB NVMe |
| Network | 40 Gbps |
| Price | ~$3.50/h |
GPU L (2x)
| Spec | Value |
|---|---|
| GPU | 4x NVIDIA L40S |
| VRAM | 192GB |
| vCPUs | 96 |
| RAM | 768GB |
| Storage | 3.8TB NVMe |
| Network | 50 Gbps |
| Price | ~$6.00/h |
GPU L (4x)
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA L40S |
| VRAM | 384GB |
| vCPUs | 192 |
| RAM | 1536GB |
| Storage | 7.6TB NVMe |
| Network | 100 Gbps |
| Price | ~$10.00/h |
Tier XL - Enterprise
Best for very large models (70B-180B).GPU XL
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA A100 (40GB) |
| VRAM | 320GB |
| vCPUs | 96 |
| RAM | 1152GB |
| Storage | 8TB NVMe |
| Network | 400 Gbps EFA |
| Price | ~$12.00/h |
GPU XL (80GB)
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA A100 (80GB) |
| VRAM | 640GB |
| vCPUs | 96 |
| RAM | 1152GB |
| Storage | 8TB NVMe |
| Network | 400 Gbps EFA |
| Price | ~$18.00/h |
Tier XXL - Colossal
Best for the largest models (405B).GPU XXL
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA H100 (80GB) |
| VRAM | 640GB |
| vCPUs | 192 |
| RAM | 2048GB |
| Storage | 8TB NVMe |
| Network | 3200 Gbps EFA v2 |
| Price | ~$20.00/h |
GPU XXL (H200)
| Spec | Value |
|---|---|
| GPU | 8x NVIDIA H200 (141GB) |
| VRAM | 1128GB |
| vCPUs | 192 |
| RAM | 2048GB |
| Storage | 8TB NVMe |
| Network | 3200 Gbps EFA v2 |
| Price | ~$30.00/h |
Choosing the Right Tier
By Model Size
| Model Parameters | Precision | Recommended Tier |
|---|---|---|
| 1B - 8B | FP16 | XS |
| 7B - 13B | INT4 | XS |
| 13B - 30B | FP16 | S |
| 30B - 34B | INT4/FP16 | S / S-2x |
| 70B | INT4/AWQ | M |
| 70B | FP16 | L |
| 70B - 180B | FP16 | XL |
| 405B | FP8 | XXL |
| 405B | FP16 | XXL-H200 |
By Use Case
| Use Case | Recommended Tier |
|---|---|
| Development/Testing | XS |
| Small production | S |
| Cost-optimized production | M (quantized) |
| High-quality production | L |
| Enterprise/Maximum quality | XL / XXL |