Models - Serverless endpoints
The following models are available through the Together Playgrounds and inference API as serverless endpoints. A dedicated instance(s) for these models can also be created for you at your request. See inference pricing.
Run the following command to see the correct prompt format and stop sequence(s).
together models info $MODEL_API_STRING
Chat Models
Organization | Model Name | Model String for API | Max Seq Length |
---|---|---|---|
Stanford | Alpaca (7B) | togethercomputer/alpaca-7b | 2048 |
Austism | Chronos Hermes (13B) | Austism/chronos-hermes-13b | 2048 |
Meta | Code Llama Instruct (13B) | togethercomputer/CodeLlama-13b-Instruct | 8192 |
Meta | Code Llama Instruct (34B) | togethercomputer/CodeLlama-34b-Instruct | 8192 |
Meta | Code Llama Instruct (7B) | togethercomputer/CodeLlama-7b-Instruct | 8192 |
TII UAE | Falcon Instruct (40B) | togethercomputer/falcon-40b-instruct | 2048 |
TII UAE | Falcon Instruct (7B) | togethercomputer/falcon-7b-instruct | 2048 |
Together | GPT-NeoXT-Chat-Base (20B) | togethercomputer/GPT-NeoXT-Chat-Base-20B | 2048 |
Meta | LLaMA-2 Chat (13B) | togethercomputer/llama-2-13b-chat | 4096 |
Meta | LLaMA-2 Chat (70B) | togethercomputer/llama-2-70b-chat | 4096 |
Meta | LLaMA-2 Chat (7B) | togethercomputer/llama-2-7b-chat | 4096 |
Together | LLaMA-2-7B-32K-Instruct (7B) | togethercomputer/Llama-2-7B-32K-Instruct | 32768 |
mistralai | Mistral (7B) Instruct | mistralai/Mistral-7B-Instruct-v0.1 | 4096 |
Gryphe | MythoMax-L2 (13B) | Gryphe/MythoMax-L2-13b | 4096 |
NousResearch | Nous Hermes LLaMA-2 (7B) | NousResearch/Nous-Hermes-llama-2-7b | 4096 |
NousResearch | Nous Hermes Llama-2 (13B) | NousResearch/Nous-Hermes-Llama2-13b | 4096 |
NousResearch | Nous Hermes Llama-2 (70B) | NousResearch/Nous-Hermes-Llama2-70b | 4096 |
NousResearch | Nous Capybara v1.9 (7B) | NousResearch/Nous-Capybara-7B-V1p9 | 8192 |
teknium | OpenHermes-2-Mistral (7B) | teknium/OpenHermes-2-Mistral-7B | 4096 |
teknium | OpenHermes-2.5-Mistral (7B) | teknium/OpenHermes-2p5-Mistral-7B | 4096 |
OpenOrca | OpenOrca Mistral (7B) 8K | Open-Orca/Mistral-7B-OpenOrca | 8192 |
garage-bAInd | Platypus2 Instruct (70B) | garage-bAInd/Platypus2-70B-instruct | 4096 |
Together | Pythia-Chat-Base (7B) | togethercomputer/Pythia-Chat-Base-7B-v0.16 | 2048 |
Qwen | Qwen-Chat (7B) | togethercomputer/Qwen-7B-Chat | 8192 |
Together | RedPajama-INCITE Chat (3B) | togethercomputer/RedPajama-INCITE-Chat-3B-v1 | 2048 |
Together | RedPajama-INCITE Chat (7B) | togethercomputer/RedPajama-INCITE-7B-Chat | 2048 |
Upstage | SOLAR v0 (70B) | upstage/SOLAR-0-70b-16bit | 4096 |
LM Sys | Vicuna v1.5 (7B) | lmsys/vicuna-7b-v1.5 | 4096 |
LM Sys | Vicuna v1.5 (13B) | lmsys/vicuna-13b-v1.5 | 4096 |
LM Sys | Vicuna v1.5 16K (13B) | lmsys/vicuna-13b-v1.5-16k | 16384 |
Language Models
Organization | Model Name | Model String for API | Max Seq Len |
---|---|---|---|
TII UAE | Falcon (40B) | togethercomputer/falcon-40b | 2048 |
TII UAE | Falcon (7B) | togethercomputer/falcon-7b | 2048 |
Together | GPT-JT (6B) | togethercomputer/GPT-JT-6B-v1 | 2048 |
Together | GPT-JT-Moderation (6B) | togethercomputer/GPT-JT-Moderation-6B | 2048 |
Meta | LLaMA (65B) | huggyllama/llama-65b | 2048 |
Meta | LLaMA-2 (13B) | togethercomputer/llama-2-13b | 4096 |
Meta | LLaMA-2 (70B) | togethercomputer/llama-2-70b | 4096 |
Meta | LLaMA-2 (7B) | togethercomputer/llama-2-7b | 4096 |
Together | LLaMA-2-32K (7B) | togethercomputer/LLaMA-2-7B-32K | 32768 |
EleutherAI | Llemma (7B) | EleutherAI/llemma_7b | 4096 |
mistralai | Mistral (7B) | mistralai/Mistral-7B-v0.1 | 4096 |
Qwen | Qwen (7B) | togethercomputer/Qwen-7B | 8192 |
Together | RedPajama-INCITE (3B) | togethercomputer/RedPajama-INCITE-Base-3B-v1 | 2048 |
Together | RedPajama-INCITE (7B) | togethercomputer/RedPajama-INCITE-7B-Base | 2048 |
Together | RedPajama-INCITE Instruct (3B) | togethercomputer/RedPajama-INCITE-Instruct-3B-v1 | 2048 |
Together | RedPajama-INCITE Instruct (7B) | togethercomputer/RedPajama-INCITE-7B-Instruct | 2048 |
WizardLM | WizardLM v1.0 (70B) | WizardLM/WizardLM-70B-V1.0 | 4096 |
Image Models
Organization | Model Name | Model String for API |
---|---|---|
Wavymulder | Analog Diffusion | wavymulder/Analog-Diffusion |
Prompt Hero | Openjourney v4 | prompthero/openjourney |
SG161222 | Realistic Vision 3.0 | SG161222/Realistic_Vision_V3.0_VAE |
Runway ML | Stable Diffusion 1.5 | runwayml/stable-diffusion-v1-5 |
Stability AI | Stable Diffusion 2.1 | stabilityai/stable-diffusion-2-1 |
Stability AI | Stable Diffusion XL 1.0 | stabilityai/stable-diffusion-xl-base-1.0 |
Code Models
Organization | Model Name | Model String for API | Max Seq Len |
---|---|---|---|
Meta | Code Llama (13B) | togethercomputer/CodeLlama-13b | 16384 |
Meta | Code Llama (34B) | togethercomputer/CodeLlama-34b | 16384 |
Meta | Code Llama (7B) | togethercomputer/CodeLlama-7b | 16384 |
Meta | Code Llama Python (13B) | togethercomputer/CodeLlama-13b-Python | 16384 |
Meta | Code Llama Python (34B) | togethercomputer/CodeLlama-34b-Python | 16384 |
Meta | Code Llama Python (7B) | togethercomputer/CodeLlama-7b-Python | 16384 |
Numbers Station | NSQL LLaMA-2 (7B) | NumbersStation/nsql-llama-2-7B | 4096 |
Phind | Phind Code LLaMA Python v1 (34B) | Phind/Phind-CodeLlama-34B-Python-v1 | 16384 |
Phind | Phind Code LLaMA v2 (34B) | Phind/Phind-CodeLlama-34B-v2 | 16384 |
WizardLM | WizardCoder v1.0 (15B) | WizardLM/WizardCoder-15B-V1.0 | 8192 |
Updated 3 days ago