Jump to Content

Documentation API Reference

Web Playground Log In

Documentation

Web Playground Log In

Documentation API Reference

Getting Started

Introduction
Quickstart
OpenAI Compatibility

Inference

Serverless models
Dedicated models
- Dedicated Inference
- Uploading a fine-tuned model
Batch Inference
DeepSeek-R1 Quickstart
Llama 4 Quickstart

Capabilities

Chat
Structured outputs
Function calling
Images
Vision
Code Execution
- Together Code Interpreter
- Together Code Sandbox
Other modalities

Examples

Agent Workflows
Together Cookbooks
Example Apps
Integrations

Training

Fine-tuning
GPU Clusters

Guides

How to Build Coding Agents
Getting started with logprobs
Quickstart: Retrieval Augmented Generation (RAG)
Quickstart: Next.js
Quickstart: Using Hugging Face Inference with Together
Quickstart: Using Vercel's AI SDK with Together AI
Together Mixture-Of-Agents (MoA)
Quickstart: How to do OCR
Search and RAG
Apps
How to use Cline with DeepSeek V3 to build faster

frequently asked questions

Deployment Options
Rate limits
Error codes
Deploy dedicated endpoints in the web
Priority Support
- Create tickets in Slack
- Customer Ticket Portal
Deprecations
Playground
Inference FAQs
Fine-tuning FAQs
Multiple API Keys
Dedicated endpoints

Examples

Use these examples to learn best practices for using the inference API.

Together Mixture-Of-Agents (MoA)
Building an AI data analyst
How to build a Claude Artifacts Clone with Llama 3.1 405B

Updated 8 months ago