Get Deployment

Together AI SDK (v2)

from together import Together
client = Together()

deployment = client.beta.jig.retrieve("my-deployment")
print(deployment)

{
  "args": [
    "<string>"
  ],
  "autoscaling": {},
  "command": [
    "<string>"
  ],
  "cpu": 123,
  "created_at": "<string>",
  "description": "<string>",
  "desired_replicas": 123,
  "environment_variables": [
    {
      "name": "<string>",
      "value": "<string>",
      "value_from_secret": "<string>"
    }
  ],
  "gpu_count": 123,
  "gpu_type": "h100-80gb",
  "health_check_path": "<string>",
  "id": "<string>",
  "image": "<string>",
  "max_replicas": 123,
  "memory": 123,
  "min_replicas": 123,
  "name": "<string>",
  "object": "<unknown>",
  "port": 123,
  "ready_replicas": 123,
  "replica_events": {},
  "status": "Updating",
  "storage": 123,
  "updated_at": "<string>",
  "volumes": [
    {
      "mount_path": "<string>",
      "name": "<string>"
    }
  ]
}

GET

deployments

{id}

Together AI SDK (v2)

from together import Together
client = Together()

deployment = client.beta.jig.retrieve("my-deployment")
print(deployment)

{
  "args": [
    "<string>"
  ],
  "autoscaling": {},
  "command": [
    "<string>"
  ],
  "cpu": 123,
  "created_at": "<string>",
  "description": "<string>",
  "desired_replicas": 123,
  "environment_variables": [
    {
      "name": "<string>",
      "value": "<string>",
      "value_from_secret": "<string>"
    }
  ],
  "gpu_count": 123,
  "gpu_type": "h100-80gb",
  "health_check_path": "<string>",
  "id": "<string>",
  "image": "<string>",
  "max_replicas": 123,
  "memory": 123,
  "min_replicas": 123,
  "name": "<string>",
  "object": "<unknown>",
  "port": 123,
  "ready_replicas": 123,
  "replica_events": {},
  "status": "Updating",
  "storage": 123,
  "updated_at": "<string>",
  "volumes": [
    {
      "mount_path": "<string>",
      "name": "<string>"
    }
  ]
}

Authorizations

Authorization

string

header

default:default

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

string

required

Deployment ID or name

Response

Deployment details

args

string[]

Args are the arguments passed to the container's command

autoscaling

object

Autoscaling contains autoscaling configuration parameters for this deployment

Show child attributes

command

string[]

Command is the entrypoint command run in the container

cpu

number

CPU is the amount of CPU resource allocated to each replica in cores (fractional value is allowed)

created_at

string

CreatedAt is the ISO8601 timestamp when this deployment was created

description

string

Description provides a human-readable explanation of the deployment's purpose or content

desired_replicas

integer

DesiredReplicas is the number of replicas that the orchestrator is targeting

environment_variables

object[]

EnvironmentVariables is a list of environment variables set in the container

Show child attributes

gpu_count

integer

GPUCount is the number of GPUs allocated to each replica in this deployment

gpu_type

enum<string>

GPUType specifies the type of GPU requested (if any) for this deployment

Available options:

h100-80gb,

a100-80gb

health_check_path

string

HealthCheckPath is the HTTP path used for health checks of the application

string

ID is the unique identifier of the deployment

image

string

Image specifies the container image used for this deployment

max_replicas

integer

MaxReplicas is the maximum number of replicas to run for this deployment

memory

number

Memory is the amount of memory allocated to each replica in GiB (fractional value is allowed)

min_replicas

integer

MinReplicas is the minimum number of replicas to run for this deployment

name

string

Name is the name of the deployment

object

any

The object type, which is always deployment.

port

integer

Port is the container port that the deployment exposes

ready_replicas

integer

ReadyReplicas is the current number of replicas that are in the Ready state

replica_events

object

ReplicaEvents is a mapping of replica names or IDs to their status events

Show child attributes

status

enum<string>

Status represents the overall status of the deployment (e.g., Updating, Scaling, Ready, Failed)

Available options:

Updating,

Scaling,

Ready,

Failed

storage

integer

Storage is the amount of storage (in MB or units as defined by the platform) allocated to each replica

updated_at

string

UpdatedAt is the ISO8601 timestamp when this deployment was last updated

volumes

object[]

Volumes is a list of volume mounts for this deployment

Show child attributes

Create Deployment

Update Deployment

⌘I

Together APIs

Command Line Interface

General

Authorizations

Path Parameters

Response