Instance Types
Two instance types are available, both built on H100 SXM GPUs:| Instance Type | GPUs | vCPU | RAM | VRAM | Storage |
|---|---|---|---|---|---|
| 1xH100-SXM | 1 | 16 | 200 GB | 80 GB | 1 TB SSD |
| 8xH100-SXM | 8 | 128 | 1,600 GB | 640 GB | 8 TB SSD |
Multi-Node and InfiniBand
When you need to distribute a workload across multiple machines, provision 8xH100 instances in the same sector. Instances within a sector are connected over InfiniBand, providing ultra-low latency and high bandwidth for frameworks like PyTorch DDP, DeepSpeed, and Horovod.InfiniBand and sector placement are only available on 8xH100 instances. 1xH100 instances run as standalone machines without inter-node connectivity.
When to Use Compute vs Serverless
The two products serve different workload profiles:| Compute | Serverless | |
|---|---|---|
| Billing | Per-hour, fixed rate | Per-second of runner lifetime |
| Scaling | Manual (you manage instances) | Automatic (runners scale with traffic) |
| Access | Full SSH access to the machine | Code runs inside managed runners |
| Best for | Training, fine-tuning, batch jobs, research | API endpoints, on-demand inference, autoscaling |
| Cold starts | None (instance is always running) | Yes (new runners need startup time) |
Getting Started
Provisioning an instance takes about 2-3 minutes. You choose an instance type, select a sector (for multi-node setups), paste your SSH public key, and click create. Once the instance is ready, you SSH in and have full control.Quickstart
Provision your first instance and run a GPU workload
Pricing
Per-hour rates by instance type