Requirements
Currently, we support models that meet the following criteria. Source: We support uploads from from Hugging Face or S3. Type: We support text generation models Parameters: Models must haveparameter-count
of 300 billion or less
Base models: Uploads currently work with the following base models
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
google/gemma-2-27b-it
meta-llama/Llama-3.3-70B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
meta-llama/Llama-3-8b-chat-hf
meta-llama/Llama-2-70b-hf
meta-llama/LlamaGuard-2-8b
mistralai/Mistral-7B-Instruct-v0.3
mistralai/Mixtral-8x7B-Instruct-v0.1
Qwen/Qwen2.5-72B-Instruct-Turbo
Qwen/Qwen2-VL-72B-Instruct
Qwen/Qwen2-72B-Instruct
Salesforce/Llama-Rank-V1
Getting Started
Upload the model
Currently, model uploads can be done via the API or the Together web interface.Web Interface
To upload via the web, just log in and navigate to models > add custom model to reach this page:
API
S3To upload a model from S3, list your model name and provide a presigned URL
To upload model from Hugging Face, list your model name and Hugging Face token
Deploy the model
Uploaded models are treated like any other dedicated endpoint models. Deploying a custom model can be done via the CLI, API or the UIDeploying custom model on the UI
All models, custom and finetuned models as well as any model that has a dedicated endpoint will be listed under My Models. To deploy a custom model Select the model to open the model page

