Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.together.ai/llms.txt

Use this file to discover all available pages before exploring further.

GPU clusters are a beta feature. Behavior, flags, and supported hardware can change. Reach out to your Together AI contact or contact sales with feedback.

Create a cluster

tg beta clusters create

Parameters

FlagDescription
--name [string]Name for the cluster.
--num-gpus [integer]Number of GPUs to allocate in the cluster.
--region [string]Region to create the cluster in. Valid regions can be found with tg beta clusters list-regions.
--billing-type [ON_DEMAND|RESERVED]Cluster reservation approach.

  • ON_DEMAND begins billing the moment the cluster is created. Billing continues until you delete the cluster.
  • RESERVED starts billing immediately. The cluster is automatically torn down after the --duration-days length elapses.
--nvidia-driver-version [string]NVIDIA driver version. Valid versions can be found with tg beta clusters list-regions.
--cuda-version [string]CUDA version. Valid versions can be found with tg beta clusters list-regions.
--duration-days [number]Only used with RESERVED billing. Specifies how many days the cluster is reserved for.
--gpu-type [string]GPU type to use for the cluster. One of H100_SXM, H200_SXM, RTX_6000_PCI, L40_PCIE, B200_SXM, H100_SXM_INF. Available types vary by region; see tg beta clusters list-regions.
--cluster-type [KUBERNETES|SLURM]Cluster workload manager or orchestrator.
--volume [string]Storage volume ID to attach to the cluster. List existing volumes with tg beta clusters storage list.
Run tg beta clusters create with no flags to launch an interactive prompt that walks through the required fields. Pass --non-interactive (or --json) to skip prompts in CI.

Update a cluster

tg beta clusters update [CLUSTER_ID]

Parameters

FlagDescription
--num-gpus [integer]Number of GPUs to allocate in the cluster.
--cluster-type [KUBERNETES|SLURM]Cluster workload manager or orchestrator.

Retrieve a cluster

tg beta clusters retrieve [CLUSTER_ID]

Delete a cluster

tg beta clusters delete [CLUSTER_ID]

List clusters

tg beta clusters list

List regions

Get configuration information per region for creating a GPU cluster.
tg beta clusters list-regions

Example output

{
  "regions": [
    {
      "driver_versions": [
        {
          "cuda_version": "12.9",
          "nvidia_driver_version": "575"
        },
        {
          "cuda_version": "12.8",
          "nvidia_driver_version": "570"
        },
        {
          "cuda_version": "13.1",
          "nvidia_driver_version": "590"
        },
        {
          "cuda_version": "13.1",
          "nvidia_driver_version": "580"
        }
      ],
      "name": "us-central-8",
      "supported_instance_types": [
        "H100_SXM",
        "H200_SXM"
      ]
    }
  ]
}

Get cluster credentials

Download the cluster’s configuration and credentials to your local .kube/config file to manage Kubernetes resources.
tg beta clusters get-credentials [CLUSTER_ID]

Parameters

FlagDescription
--file [Path|-]Override the path to write the kubeconfig to. Pass - to print the config to stdout instead of writing to a file. Default: ~/.kube/config.
--context-name [string]Name of the context to add to the kubeconfig. Defaults to the cluster name.
--overwrite-existingIf there is a conflict with the existing kubeconfig, overwrite it instead of raising an error.
--set-default-contextChange the current context for kubectl to the new context.

Create cluster storage

tg beta clusters storage create

Parameters

FlagDescription
--region [string]Region to create the storage volume in. required
--size-tib [integer]Size of the storage volume in TiB. required
--volume-name [string]Name for the storage volume. required

Retrieve cluster storage

tg beta clusters storage retrieve [VOLUME_ID]

List cluster storage

tg beta clusters storage list

Delete cluster storage

tg beta clusters storage delete [VOLUME_ID]