Jump to Content
Documentation
API Reference
Web Playground
Log In
Documentation
Web Playground
Log In
Moon (Dark Mode)
Sun (Light Mode)
Documentation
API Reference
Instant Clusters
Search
Getting Started
Introduction
Quickstart
OpenAI Compatibility
Inference
Serverless models
Dedicated models
Dedicated Inference
Uploading a fine-tuned model
DeepSeek-R1 Quickstart
Reasoning Models Guide
Prompting DeepSeek-R1
DeepSeek FAQs
Llama 4 Quickstart
Capabilities
Chat
Structured outputs
Function calling
Images
Quickstart: Flux Tools Models
Quickstart: Flux LoRA Inference
Vision
Code Execution
Together Code Interpreter
Together Code Sandbox
Other modalities
Text-to-Speech
Embeddings
RAG Integrations
Rerank
QuickStart: LlamaRank
Code/Language
Examples
Agent Workflows
Sequential Workflow
Parallel Workflow
Conditional Workflow
Iterative Workflow
Agent Integrations
CrewAI
LangGraph
DSPy
PydanticAI
Agno
AutoGen(AG2)
Composio
Together Cookbooks
Example Apps
Integrations
Training
Fine-tuning
Fine-tuning Guide
Data preparation
Supported Models
Serverless LoRA Inference
Preference Fine-Tuning
Deploying a fine-tuned model
Pricing
GPU Clusters
Cluster user management
Cluster storage
Slurm management system
Instant Clusters
Guides
How to Build Coding Agents
Getting started with logprobs
Quickstart: Retrieval Augmented Generation (RAG)
Quickstart: Next.js
Quickstart: Using Hugging Face Inference with Together
Quickstart: Using Vercel's AI SDK with Together AI
Together Mixture-Of-Agents (MoA)
Quickstart: How to do OCR
Search and RAG
How to Improve Search with Rerankers
How to build an AI search engine (OSS Perplexity Clone)
Building a RAG Workflow
How to Implement Contextual RAG from Anthropic
Apps
How to build a Claude Artifacts Clone with Llama 3.1 405B
Building an AI data analyst
Fine-tuning Llama-3 to get 90% of GPT-4’s performance
How to build a real-time image generator with Flux and Together AI
How to build an Open Source NotebookLM: PDF to Podcast
How to build an Interactive AI Tutor with Llama 3.1
frequently asked questions
Deployment Options
Rate limits
Error codes
Deploy dedicated endpoints in the web
Priority Support
Create tickets in Slack
Customer Ticket Portal
Deprecations
Playground
Inference FAQs
Fine-tuning FAQs
Multiple API Keys
Dedicated endpoints
Instant Clusters
Suggest Edits
Updated 2 days ago