Update a GPU cluster
Update the configuration of an existing GPU cluster.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
The ID of the cluster to update
Body
Type of cluster to update.
KUBERNETES, SLURM Target GPU count for the cluster. When omitted, the server keeps the current GPU count from cluster metadata (use for config-only or decommission-time-only updates).
Timestamp at which the cluster should be decommissioned. Only accepted for prepaid clusters.
Number of reserved GPUs to update to. This field is only applicable for clusters with RESERVED billing type.
Updated desired number of preemptible GPUs for the cluster. When omitted, the current value is preserved. Must be a multiple of 8.
Number of GPUs to draw from the cluster's capacity pool. Only valid for clusters created with a capacity_pool_id. Must be a multiple of 8 and not exceed num_gpus. When omitted, the current value is preserved.
Add-ons to update on the cluster. Each entry identifies an existing add-on by name and provides the new external config to merge.
Response
OK
Type of cluster.
KUBERNETES, SLURM H100_SXM, H200_SXM, RTX_6000_PCI, L40_PCIE, B200_SXM, H100_SXM_INF Current status of the GPU cluster.
WaitingForControlPlaneNodes, WaitingForDataPlaneNodes, WaitingForSubnet, WaitingForSharedVolume, InstallingDrivers, RunningAcceptanceTests, Paused, OnDemandComputePaused, Ready, Degraded, Deleting Number of CPU-only worker nodes in the cluster.
Cluster-level phase transition history.
Customer's requested number of preemptible GPUs. Set on cluster create or update; persists until changed.
Actual number of preemptible GPUs currently allocated to the cluster. Updated asynchronously by the fulfillment and reclamation workers; may be less than desired_preemptible_gpus when capacity is constrained.
Billing type for the cluster (RESERVED, ON_DEMAND, or SCHEDULED_CAPACITY).
RESERVED, ON_DEMAND, SCHEDULED_CAPACITY Enabled add-ons on this cluster. Only add-ons with enabled=true in their config are returned.
Number of GPUs to draw from a capacity pool. A component of the overall num_gpus, alongside num_reserved_gpus.
Number of prepaid reserved GPUs for this cluster. A component of the overall num_gpus, alongside num_capacity_pool_gpus.
ID of the machine cluster backing this GPU cluster.
Timestamp when the cluster first reached the Ready phase.
Whether the cluster is managed inside a substrate environment.
Whether the control plane is currently ready.
UMS project ID associated with this cluster.
UMS organization ID associated with this cluster.
Data-volume image name for GPU worker nodes.
Internal NVIDIA version ID for this cluster's driver and CUDA combination.
GPU worker nodes retained after they left the live data plane. These are separate from gpu_worker_nodes and must not be counted as live capacity.
Recent node lifecycle events such as scale-up, scale-down, and preemption. Combine these with live and deleted node lists to render the cluster timeline.