Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.together.ai/llms.txt

Use this file to discover all available pages before exploring further.

Upload

Upload a model from Hugging Face or S3 for inference on a dedicated endpoint.
tg models upload \
    --model-name [TEXT] \
    --model-source [URI]

Parameters

FlagTypeDescription
--model-namestringThe name to give your uploaded model. required
--model-sourcestringThe source URI of the model. required
--model-typemodel or adapterWhether the model is a full model or an adapter.
--hf-tokenstringHugging Face token, used when uploading from Hugging Face.
--descriptionstringA description of your model.
--base-modelstringThe base model for an adapter when running against a serverless pool. Only used with --model-type adapter.
--lora-modelstringThe LoRA pool for an adapter when running against a dedicated pool. Only used with --model-type adapter.

List all models

# List models
tg models list

Options

FlagTypeDescription
--typededicatedFilter to models that can be deployed on dedicated endpoints. dedicated is the only available value.
--afterstringThe cursor to start from for pagination.