Upload
Upload a model from Hugging Face or S3 to for inference on a dedicated endpoint.Parameters
| Flag | Type | Description |
|---|---|---|
--model-name | string | The name to give to your uploaded model [required] |
--model-source | string | The source URI of the model [required] |
--model-type | model or adapter | Whether the model is a full model or an adapter |
--hf-token | string | Hugging Face token (if uploading from Hugging Face) |
--description | string | A description of your model |
--base-model | string | The base model to use for an adapter if setting it to run against a serverless pool. Only used with --model_type adapter. |
--lora-model | string | The LoRA pool to use for an adapter if setting it to run against, say, a dedicated pool. Only used with --model-type adapter. |
List all models
Options
| Flag | Type | Description |
|---|---|---|
--type | dedicated | Filter models by type. dedicated is the only available option |
--after | string | The cursor to start from for pagination. |