Get a deployment by ID or name
Retrieve details of a specific deployment by its ID or name
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
Deployment ID or name
Response
Deployment details
Args are the arguments passed to the container's command
Autoscaling contains autoscaling configuration parameters for this deployment. Omitted when autoscaling is disabled (nil)
- Option 1
- Option 2
- Option 3
Command is the entrypoint command run in the container
CPU is the amount of CPU resource allocated to each replica in cores (fractional value is allowed)
CreatedAt is the ISO8601 timestamp when this deployment was created
Description provides a human-readable explanation of the deployment's purpose or content
DesiredReplicas is the number of replicas that the orchestrator is targeting
EnvironmentVariables is a list of environment variables set in the container
GPUCount is the number of GPUs allocated to each replica in this deployment
GPUType specifies the type of GPU requested (if any) for this deployment
h100-80gb, h100-40gb-mig, h200-140gb, b200-192gb HealthCheckPath is the HTTP path used for health checks of the application
ID is the unique identifier of the deployment
Image specifies the container image used for this deployment
MaxReplicas is the maximum number of replicas to run for this deployment
Memory is the amount of memory allocated to each replica in GiB (fractional value is allowed)
MinReplicas is the minimum number of replicas to run for this deployment
Name is the name of the deployment
The object type, which is always deployment.
Port is the container port that the deployment exposes
ReadyReplicas is the current number of replicas that are in the Ready state
ReplicaEvents is a mapping of replica names or IDs to their status events
Status represents the overall status of the deployment (e.g., Updating, Scaling, Ready, Failed)
Updating, Scaling, Ready, Failed, ScaledToZero Storage is the amount of storage (in MB or units as defined by the platform) allocated to each replica
TerminationGracePeriodSeconds is the time in seconds to wait for graceful shutdown before forcefully terminating the replica
UpdatedAt is the ISO8601 timestamp when this deployment was last updated
Volumes is a list of volume mounts for this deployment