Requirements
Currently, we support models that meet the following criteria.- Source: We support uploads from Hugging Face or S3.
- Type: We support text generation and embedding models.
- Scale: We currently only support models that fit in a single node. Multi-node models are not supported when you upload a custom model.
Getting Started
Upload the model
Model uploads can be done via the UI, API or the CLI. The API reference can be found here.UI
To upload via the web, just log in and navigate to models > add custom model to reach this page:
CLI
Upload a model from Hugging Face or S3:Deploy the model
Uploaded models are treated like any other dedicated endpoint models. Deploying a custom model can be done via the UI, API or the CLI. The API reference can be found here.UI
All models, custom and finetuned models as well as any model that has a dedicated endpoint will be listed under My Models. To deploy a custom model: Select the model to open the model page.

