Instance Types

Instance types define the compute resources available for your workloads on Phala Cloud. Each instance type provides a specific combination of vCPU, memory, and GPU resources optimized for different use cases.

We recommend specifying your desired instance type directly when creating VMs through the UI or CLI for predictable resource allocation and pricing.

CPU Instance Types

General-purpose compute instances optimized for CPU-intensive workloads, web applications, and backend services.

IDNamevCPUMemoryHourly Rate
tdx.smallSmall TDX Instance12 GB$0.058000
tdx.mediumMedium TDX Instance24 GB$0.116000
tdx.largeLarge TDX Instance48 GB$0.232000
tdx.xlargeXLarge TDX Instance816 GB$0.464000
tdx.2xlarge2XLarge TDX Instance1632 GB$0.928000
tdx.4xlarge4XLarge TDX Instance3264 GB$1.856000
tdx.8xlarge8XLarge TDX Instance64128 GB$3.712000

GPU Instance Types

GPU-accelerated instances for machine learning, AI inference, graphics rendering, and other GPU-intensive workloads.

IDNamevCPUMemoryGPU
h200.smallH200 SXM 141GB24192 GB-
h200.16xlargeH200 SXM 141GB x 864256 GB-
h200.8x.largeH200 SXM 141GB x 81921536 GB-

Choosing the Right Instance Type

When selecting an instance type for your workload, consider these factors:

  • Workload Type - Choose CPU instances for general-purpose computing, GPU instances for ML/AI workloads
  • Resource Requirements - Estimate the vCPU and memory needs based on your application's requirements
  • Cost Optimization - Choose the smallest instance type that meets your requirements
  • Scalability - Start with smaller instances and scale up as needed

Legacy Resource Matching

For backward compatibility, if you specify custom vCPU and memory values instead of an instance type, Phala Cloud automatically matches your request to the nearest instance type that can satisfy your needs (nearest-upper-neighbor matching).

For example, if you request 3 vCPUs and 6GB of memory, the system will select the smallest instance type that provides at least 3 vCPUs and 6GB of memory.

Pricing Formula

For custom resource specifications, pricing is calculated based on Compute Units (CU):

  • CU = max(vCPU, memory_GB / 2)
  • Compute Rate = CU × $0.058/hr
  • Storage Rate = disk_GB × $0.000139/hr

For example, a 4 vCPU, 8GB memory instance has CU = max(4, 8/2) = 4, resulting in a compute rate of 4 × $0.058 = $0.232/hr (plus storage costs).