curl -X POST \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
--data '{ "region": "us-west-2", "gpu_type": "H100_SXM", "num_gpus": 8, "cluster_name": "my-gpu-cluster", "duration_days": 7, "driver_version": "CUDA_12_6_560" }' \
https://manager.cloud.together.ai/api/v1/gpu_cluster{
"cluster_id": "<string>",
"status": "UNKNOWN_STATUS"
}curl -X POST \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
--data '{ "region": "us-west-2", "gpu_type": "H100_SXM", "num_gpus": 8, "cluster_name": "my-gpu-cluster", "duration_days": 7, "driver_version": "CUDA_12_6_560" }' \
https://manager.cloud.together.ai/api/v1/gpu_cluster{
"cluster_id": "<string>",
"status": "UNKNOWN_STATUS"
}Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
GPU Cluster create request
Region to create the GPU cluster in. Valid values are us-central-8 and us-central-4.
Type of GPU to use in the cluster
UNKNOWN_GPU_TYPE, H100_SXM, H200_SXM, RTX_6000_PCI Number of GPUs to allocate in the cluster. This must be multiple of 8. For example, 8, 16 or 24
Name of the GPU cluster.
Duration in days to keep the cluster running.
NVIDIA driver version to use in the cluster.
UNKNOWN_DRIVER, CUDA_12_5_555, CUDA_12_6_560, CUDA_12_6_565, CUDA_12_8_570 UNSPECIFIED, RESERVED, ON_DEMAND GPU Cluster create request.
UNKNOWN_TYPE, KUBERNETES, SLURM Was this page helpful?