Get GPU cluster by cluster ID
Retrieve information about a specific GPU cluster.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
The ID of the cluster to retrieve
Response
OK
Type of cluster.
KUBERNETES, SLURM H100_SXM, H200_SXM, RTX_6000_PCI, L40_PCIE, B200_SXM, H100_SXM_INF Current status of the GPU cluster.
WaitingForControlPlaneNodes, WaitingForDataPlaneNodes, WaitingForSubnet, WaitingForSharedVolume, InstallingDrivers, RunningAcceptanceTests, Paused, OnDemandComputePaused, Ready, Degraded, Deleting Number of CPU-only worker nodes in the cluster.
Cluster-level phase transition history.
Customer's requested number of preemptible GPUs. Set on cluster create or update; persists until changed.
Actual number of preemptible GPUs currently allocated to the cluster. Updated asynchronously by the fulfillment and reclamation workers; may be less than desired_preemptible_gpus when capacity is constrained.
Billing type for the cluster (RESERVED, ON_DEMAND, or SCHEDULED_CAPACITY).
RESERVED, ON_DEMAND, SCHEDULED_CAPACITY Enabled add-ons on this cluster. Only add-ons with enabled=true in their config are returned.
Number of GPUs to draw from a capacity pool. A component of the overall num_gpus, alongside num_reserved_gpus.
Number of prepaid reserved GPUs for this cluster. A component of the overall num_gpus, alongside num_capacity_pool_gpus.
ID of the machine cluster backing this GPU cluster.
Timestamp when the cluster first reached the Ready phase.
Whether the cluster is managed inside a substrate environment.
Whether the control plane is currently ready.
UMS project ID associated with this cluster.
UMS organization ID associated with this cluster.
Data-volume image name for GPU worker nodes.
Internal NVIDIA version ID for this cluster's driver and CUDA combination.
GPU worker nodes retained after they left the live data plane. These are separate from gpu_worker_nodes and must not be counted as live capacity.
Recent node lifecycle events such as scale-up, scale-down, and preemption. Combine these with live and deleted node lists to render the cluster timeline.