Get started
Quickstart
Deploy and call your first endpoint in 5 minutes.
Manage endpoints
Create, start, stop, update, and delete via the UI or API.
Endpoint settings
Configure endpoint hardware, autoscaling, decoding, prompt caching.
Inference APIs
Explore the API surface for chat, vision, audio, embeddings, and more.
Available models
Browse Together-hosted models you can deploy on dedicated endpoints.
Upload a custom model
Upload your own model weights.
Pricing
Dedicated endpoints bill per-minute by hardware while the endpoint is running, regardless of your model or request volume.| Hardware type | Cost/hour |
|---|---|
| 1x H100 80GB | $3.99 |
| 1x H200 141GB | $5.49 |
| 1x B200 180GB | $9.95 |