Cluster Storage

On this page

1. Local disks
2. Shared /home folder
3. Shared remote attached storage

A Together GPU Cluster has 3 types of storage:

1. Local disks

Each server has NVME drives which can be used for high speed local read/writes.

2. Shared `/home` folder

The /home folder is shared across all nodes, mounted as an NFS volume from the head node. This should be used for code, configs, logs, etc. It should not be used for training data or checkpointing, as it is slower. We recommend logging into the Slurm head node first to properly set up your user folder with the right permissions.

3. Shared remote attached storage

The GPU nodes all have a mounted volume from a high-speed storage cluster, which is useful for reading training data and writing checkpoints to/from a central location.

Cluster User Management Slurm Management System

Getting Started

Inference

Capabilities

Examples

Training

Guides

❓ Frequently Asked Questions

1. Local disks

2. Shared `/home` folder

3. Shared remote attached storage

Getting Started

Inference

Capabilities

Examples

Training

Guides

❓ Frequently Asked Questions

​1. Local disks

​2. Shared /home folder

​3. Shared remote attached storage

1. Local disks

2. Shared `/home` folder

3. Shared remote attached storage