Dedicated models

Chat models

OrganizationModel NameAPI Model StringContext lengthQuantization
01.AI01-ai Yi Chat (34B)zero-one-ai/Yi-34B-Chat4096FP16
AllenAIOLMo Instruct (7B)allenai/OLMo-7B-Instruct2048FP16
AustismChronos Hermes (13B)Austism/chronos-hermes-13b2048FP16
carsoncarson ml318brcarson/ml318br8192FP16
cognitivecomputationsDolphin 2.5 Mixtral 8x7bcognitivecomputations/dolphin-2.5-mixtral-8x7b32768FP16
DatabricksDBRX Instructdatabricks/dbrx-instruct32768FP16
DeepSeekDeepSeek LLM Chat (67B)deepseek-ai/deepseek-llm-67b-chat4096FP16
DeepSeekDeepseek Coder Instruct (33B)deepseek-ai/deepseek-coder-33b-instruct16384FP16
garage-bAIndPlatypus2 Instruct (70B)garage-bAInd/Platypus2-70B-instruct4096FP16
googleGemma-2 Instruct (9B)google/gemma-2-9b-it8192FP16
GoogleGemma Instruct (2B)google/gemma-2b-it8192FP16
GoogleGemma-2 Instruct (27B)google/gemma-2-27b-it8192FP16
GoogleGemma Instruct (7B)google/gemma-7b-it8192FP16
gradientaiLlama-3 70B Instruct Gradient 1048Kgradientai/Llama-3-70B-Instruct-Gradient-1048k1048576FP16
GrypheMythoMax-L2 (13B)Gryphe/MythoMax-L2-13b4096FP16
GrypheGryphe MythoMax L2 Lite (13B)Gryphe/MythoMax-L2-13b-Lite4096FP16
Haotian LiuLLaVa-Next (Mistral-7B)llava-hf/llava-v1.6-mistral-7b-hf4096FP16
HuggingFaceZephyr-7B-ßHuggingFaceH4/zephyr-7b-beta32768FP16
LM SysKoala (7B)togethercomputer/Koala-7B2048FP16
LM SysVicuna v1.3 (7B)lmsys/vicuna-7b-v1.32048FP16
LM SysVicuna v1.5 16K (13B)lmsys/vicuna-13b-v1.5-16k16384FP16
LM SysVicuna v1.5 (13B)lmsys/vicuna-13b-v1.54096FP16
LM SysVicuna v1.3 (13B)lmsys/vicuna-13b-v1.32048FP16
LM SysKoala (13B)togethercomputer/Koala-13B2048FP16
LM SysVicuna v1.5 (7B)lmsys/vicuna-7b-v1.54096FP16
MetaCode Llama Instruct (34B)codellama/CodeLlama-34b-Instruct-hf16384FP16
MetaLlama3 8B Chat HF INT4togethercomputer/Llama-3-8b-chat-hf-int48192FP16
MetaMeta Llama 3.2 90B Vision Instruct Turbometa-llama/Llama-3.2-90B-Vision-Instruct-Turbo131072FP16
MetaMeta Llama 3.2 11B Vision Instruct Turbometa-llama/Llama-3.2-11B-Vision-Instruct-Turbo131072FP16
MetaMeta Llama 3.2 3B Instruct Turbometa-llama/Llama-3.2-3B-Instruct-Turbo131072FP16
MetaTogethercomputer Llama3 8B Instruct Int8togethercomputer/Llama-3-8b-chat-hf-int88192FP16
MetaMeta Llama 3.1 70B Instruct Turbometa-llama/Meta-Llama-3.1-70B-Instruct-Turbo32768FP8
MetaLLaMA-2 Chat (13B)meta-llama/Llama-2-13b-chat-hf4096FP16
MetaMeta Llama 3 70B Instruct Litemeta-llama/Meta-Llama-3-70B-Instruct-Lite8192INT4
MetaMeta Llama 3 8B Instruct Referencemeta-llama/Llama-3-8b-chat-hf8192FP16
MetaMeta Llama 3 70B Instruct Referencemeta-llama/Llama-3-70b-chat-hf8192FP16
MetaMeta Llama 3 8B Instruct Turbometa-llama/Meta-Llama-3-8B-Instruct-Turbo8192FP8
MetaMeta Llama 3 8B Instruct Litemeta-llama/Meta-Llama-3-8B-Instruct-Lite8192INT4
MetaMeta Llama 3.1 405B Instruct Turbometa-llama/Meta-Llama-3.1-405B-Instruct-Lite-Pro4096FP16
MetaLLaMA-2 Chat (7B)meta-llama/Llama-2-7b-chat-hf4096FP16
MetaMeta Llama 3.1 405B Instruct Turbometa-llama/Meta-Llama-3.1-405B-Instruct-Turbo130815FP8
MetaMeta Llama Vision Freemeta-llama/Llama-Vision-Free131072FP16
MetaMeta Llama 3 70B Instruct Turbometa-llama/Meta-Llama-3-70B-Instruct-Turbo8192FP8
MetaMeta Llama 3.1 8B Instruct Turbometa-llama/Meta-Llama-3.1-8B-Instruct-Turbo32768FP8
MetaCode Llama Instruct (7B)togethercomputer/CodeLlama-7b-Instruct16384FP16
MetaCode Llama Instruct (34B)togethercomputer/CodeLlama-34b-Instruct16384FP16
MetaCode Llama Instruct (13B)codellama/CodeLlama-13b-Instruct-hf16384FP16
MetaCode Llama Instruct (13B)togethercomputer/CodeLlama-13b-Instruct16384FP16
MetaLLaMA-2 Chat (13B)togethercomputer/llama-2-13b-chat4096FP16
MetaLLaMA-2 Chat (7B)togethercomputer/llama-2-7b-chat4096FP16
MetaMeta Llama 3 8B Instructmeta-llama/Meta-Llama-3-8B-Instruct8192FP16
MetaMeta Llama 3 70B Instructmeta-llama/Meta-Llama-3-70B-Instruct8192FP16
MetaCode Llama Instruct (70B)codellama/CodeLlama-70b-Instruct-hf4096FP16
MetaLLaMA-2 Chat (70B)togethercomputer/llama-2-70b-chat4096FP16
MetaCode Llama Instruct (7B)codellama/CodeLlama-7b-Instruct-hf16384FP16
MetaLLaMA-2 Chat (70B)meta-llama/Llama-2-70b-chat-hf4096FP16
MetaMeta Llama 3.1 8B Instructmeta-llama/Meta-Llama-3.1-8B-Instruct-Reference16384FP16
MetaMeta Llama 3.1 70B Instruct Turboalbert/meta-llama-3-1-70b-instruct-turbo131072FP16
MetaMeta Llama 3.1 70B Instructmeta-llama/Meta-Llama-3.1-70B-Instruct-Reference8192FP16
microsoftWizardLM-2 (8x22B)microsoft/WizardLM-2-8x22B65536FP16
mistralaiMistral (7B) Instructmistralai/Mistral-7B-Instruct-v0.14096FP16
mistralaiMistral (7B) Instruct v0.2mistralai/Mistral-7B-Instruct-v0.232768FP16
mistralaiMistral (7B) Instruct v0.3mistralai/Mistral-7B-Instruct-v0.332768FP16
mistralaiMixtral-8x7B Instruct v0.1mistralai/Mixtral-8x7B-Instruct-v0.132768FP16
mistralaiMixtral-8x22B Instruct v0.1mistralai/Mixtral-8x22B-Instruct-v0.165536FP16
NousResearchNous Hermes 2 - Mixtral 8x7B-DPONousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO32768FP16
NousResearchNous Hermes LLaMA-2 (70B)NousResearch/Nous-Hermes-Llama2-70b4096FP16
NousResearchNous Hermes 2 - Mixtral 8x7B-SFTNousResearch/Nous-Hermes-2-Mixtral-8x7B-SFT32768FP16
NousResearchNous Hermes Llama-2 (13B)NousResearch/Nous-Hermes-Llama2-13b4096FP16
NousResearchNous Hermes 2 - Mistral DPO (7B)NousResearch/Nous-Hermes-2-Mistral-7B-DPO32768FP16
NousResearchNous Hermes LLaMA-2 (7B)NousResearch/Nous-Hermes-llama-2-7b4096FP16
NousResearchNous Capybara v1.9 (7B)NousResearch/Nous-Capybara-7B-V1p98192FP16
NousResearchHermes 2 Theta Llama-3 70BNousResearch/Hermes-2-Theta-Llama-3-70B8192FP16
OpenChatOpenChat 3.5openchat/openchat-3.5-12108192FP16
OpenOrcaOpenOrca Mistral (7B) 8KOpen-Orca/Mistral-7B-OpenOrca8192FP16
QwenQwen 2 Instruct (72B)Qwen/Qwen2-72B-Instruct32768FP16
QwenQwen2.5 72B Instruct TurboQwen/Qwen2.5-72B-Instruct-Turbo32768FP8
QwenQwen2.5 7B Instruct TurboQwen/Qwen2.5-7B-Instruct-Turbo32768FP8
QwenQwen 1.5 Chat (110B)Qwen/Qwen1.5-110B-Chat32768FP16
QwenQwen 1.5 Chat (72B)Qwen/Qwen1.5-72B-Chat32768FP16
QwenQwen 2 Instruct (1.5B)Qwen/Qwen2-1.5B-Instruct32768FP16
QwenQwen 2 Instruct (7B)Qwen/Qwen2-7B-Instruct32768FP16
QwenQwen 1.5 Chat (14B)Qwen/Qwen1.5-14B-Chat32768FP16
QwenQwen 1.5 Chat (1.8B)Qwen/Qwen1.5-1.8B-Chat32768FP16
QwenQwen 1.5 Chat (32B)Qwen/Qwen1.5-32B-Chat32768FP16
QwenQwen 1.5 Chat (7B)Qwen/Qwen1.5-7B-Chat32768FP16
QwenQwen 1.5 Chat (0.5B)Qwen/Qwen1.5-0.5B-Chat32768FP16
QwenQwen 1.5 Chat (4B)Qwen/Qwen1.5-4B-Chat32768FP16
Snorkel AISnorkel Mistral PairRM DPO (7B)snorkelai/Snorkel-Mistral-PairRM-DPO32768FP16
SnowflakeSnowflake Arctic InstructSnowflake/snowflake-arctic-instruct4096FP16
StanfordAlpaca (7B)togethercomputer/alpaca-7b2048FP16
tekniumOpenHermes-2-Mistral (7B)teknium/OpenHermes-2-Mistral-7B8192FP16
tekniumOpenHermes-2.5-Mistral (7B)teknium/OpenHermes-2p5-Mistral-7B8192FP16
testTest 11test/test114096FP16
Tim DettmersGuanaco (65B)togethercomputer/guanaco-65b2048FP16
Tim DettmersGuanaco (13B)togethercomputer/guanaco-13b2048FP16
Tim DettmersGuanaco (33B)togethercomputer/guanaco-33b2048FP16
Tim DettmersGuanaco (7B)togethercomputer/guanaco-7b2048FP16
Undi95ReMM SLERP L2 (13B)Undi95/ReMM-SLERP-L2-13B4096FP16
Undi95Toppy M (7B)Undi95/Toppy-M-7B4096FP16
upstageUpstage SOLAR Instruct v1 (11B)upstage/SOLAR-10.7B-Instruct-v1.04096FP16
upstageUpstage SOLAR Instruct v1 (11B)-Int4togethercomputer/SOLAR-10.7B-Instruct-v1.0-int44096FP16
WizardLMWizardLM v1.2 (13B)WizardLM/WizardLM-13B-V1.24096FP16

Image models

OrganizationModel NameAPI Model String
Black Forest LabsFLUX.1 [pro]black-forest-labs/FLUX.1-pro
Black Forest LabsFLUX.1 [schnell]black-forest-labs/FLUX.1-schnell
Black Forest LabsFLUX1.1 [pro]black-forest-labs/FLUX.1.1-pro
Black Forest LabsFLUX.1 [schnell] Freeblack-forest-labs/FLUX.1-schnell-Free
Prompt HeroOpenjourney v4prompthero/openjourney
Runway MLStable Diffusion 1.5runwayml/stable-diffusion-v1-5
SG161222Realistic Vision 3.0SG161222/Realistic_Vision_V3.0_VAE
Stability AIStable Diffusion XL 1.0stabilityai/stable-diffusion-xl-base-1.0
Stability AIStable Diffusion 2.1stabilityai/stable-diffusion-2-1
WavymulderAnalog Diffusionwavymulder/Analog-Diffusion

Language models

OrganizationModel NameAPI Model StringContext length
01.AI01-ai Yi Base (34B)zero-one-ai/Yi-34B4096
01.AI01-ai Yi Base (6B)zero-one-ai/Yi-6B4096
AllenAIOLMo (7B)allenai/OLMo-7B2048
EleutherAILlemma (7B)EleutherAI/llemma_7b4096
googleGemma 2 (9B)google/gemma-2-9b8192
GoogleGemma (7B)google/gemma-7b8192
GoogleGemma (2B)google/gemma-2b8192
MetaMeta Llama 3 8Bmeta-llama/Meta-Llama-3-8B8192
MetaLLaMA-2 (70B)meta-llama/Llama-2-70b-hf4096
MetaLLaMA-2 (7B)togethercomputer/llama-2-7b4096
MetaLLaMA (7B)huggyllama/llama-7b2048
MetaLLaMA (65B)huggyllama/llama-65b2048
MetaLLaMA-2 (13B)togethercomputer/llama-2-13b4096
MetaLLaMA-2 (70B)togethercomputer/llama-2-70b4096
MetaLLaMA-2 (13B)meta-llama/Llama-2-13b-hf4096
MetaLLaMA (13B)huggyllama/llama-13b2048
MetaLLaMA (30B)huggyllama/llama-30b2048
MetaMeta Llama 3 70Bmeta-llama/Meta-Llama-3-70B8192
MetaMeta Llama 3 8Bmeta-llama/Llama-3-8b-hf8192
MetaLLaMA-2 (7B)meta-llama/Llama-2-7b-hf4096
MetaMeta Llama 3 70B HFmeta-llama/Llama-3-70b-hf8192
MetaMeta Llama 3.1 8Bmeta-llama/Meta-Llama-3.1-8B-Reference8192
MetaMeta Llama 3.1 70Bmeta-llama/Meta-Llama-3.1-70B-Reference8192
MicrosoftMicrosoft Phi-2microsoft/phi-22048
mistralaiMixtral-8x7B v0.1mistralai/Mixtral-8x7B-v0.132768
mistralaiMistral (7B)mistralai/Mistral-7B-v0.14096
mistralaiMixtral-8x22Bmistralai/Mixtral-8x22B65536
NexusflowNexusRaven (13B)Nexusflow/NexusRaven-V2-13B16384
Nous ResearchNous Hermes (13B)NousResearch/Nous-Hermes-13b2048
QwenQwen 2 (72B)Qwen/Qwen2-72B32768
QwenQwen 1.5 (0.5B)Qwen/Qwen1.5-0.5B32768
QwenQwen 1.5 (1.8B)Qwen/Qwen1.5-1.8B32768
QwenQwen 1.5 (4B)Qwen/Qwen1.5-4B32768
QwenQwen 1.5 (7B)Qwen/Qwen1.5-7B32768
QwenQwen 1.5 (72B)Qwen/Qwen1.5-72B4096
QwenQwen 2 (7B)Qwen/Qwen2-7B32768
QwenQwen 2 (1.5B)Qwen/Qwen2-1.5B32768
QwenQwen 1.5 (32B)Qwen/Qwen1.5-32B32768
QwenQwen 1.5 (14B)Qwen/Qwen1.5-14B32768
TogetherStripedHyena Hessian (7B)togethercomputer/StripedHyena-Hessian-7B32768
TogetherLLaMA-2-32K (7B)togethercomputer/LLaMA-2-7B-32K32768
TogetherEvo-1 Base (131K)togethercomputer/evo-1-131k-base131073
TogetherEvo-1 Base (8K)togethercomputer/evo-1-8k-base8192
WizardLMWizardLM v1.0 (70B)WizardLM/WizardLM-70B-V1.04096

Code models

OrganizationModel NameAPI Model StringContext length
MetaCode Llama Python (34B)codellama/CodeLlama-34b-Python-hf16384
MetaCode Llama Python (70B)codellama/CodeLlama-70b-Python-hf4096
MetaCode Llama Python (34B)togethercomputer/CodeLlama-34b-Python16384
MetaCode Llama (34B)togethercomputer/CodeLlama-34b16384
MetaCode Llama (13B)codellama/CodeLlama-13b-hf16384
MetaCode Llama (34B)codellama/CodeLlama-34b-hf16384
MetaCode Llama Python (7B)togethercomputer/CodeLlama-7b-Python16384
MetaCode Llama (70B)codellama/CodeLlama-70b-hf16384
MetaCode Llama Python (13B)togethercomputer/CodeLlama-13b-Python16384
MetaCode Llama (7B)codellama/CodeLlama-7b-hf16384
MetaCode Llama Python (13B)codellama/CodeLlama-13b-Python-hf16384
MetaCode Llama Python (7B)codellama/CodeLlama-7b-Python-hf16384
Numbers StationNSQL LLaMA-2 (7B)NumbersStation/nsql-llama-2-7B4096
PhindPhind Code LLaMA v2 (34B)Phind/Phind-CodeLlama-34B-v216384
PhindPhind Code LLaMA Python v1 (34B)Phind/Phind-CodeLlama-34B-Python-v116384
WizardLMWizardCoder Python v1.0 (34B)WizardLM/WizardCoder-Python-34B-V1.08192

Moderation models

OrganizationModel NameAPI Model StringContext length
MetaMeta Llama Guard 3 8Bmeta-llama/Meta-Llama-Guard-3-8B8192
MetaMeta Llama Guard 2 8Bmeta-llama/LlamaGuard-2-8b8192
MetaMeta Llama Guard 3 11B Vision Turbometa-llama/Llama-Guard-3-11B-Vision-Turbo131072
MetaLlama Guard (7B)Meta-Llama/Llama-Guard-7b4096

Embedding models

OrganizationModel NameAPI Model StringContext length
BAAIBAAI-Bge-Base-1p5BAAI/bge-base-en-v1.5undefined
BAAIBAAI-Bge-Large-1p5BAAI/bge-large-en-v1.5undefined
GoogleBert Base Uncasedbert-base-uncasedundefined
HazyResearchM2-BERT 2K Retrieval Encoder V1hazyresearch/M2-BERT-2k-Retrieval-Encoder-V12048
TogetherM2-BERT-Retrieval-32ktogethercomputer/m2-bert-80M-32k-retrieval32768
TogetherM2-BERT-Retrieval-2Ktogethercomputer/m2-bert-80M-2k-retrievalundefined
TogetherM2-BERT-Retrieval-8ktogethercomputer/m2-bert-80M-8k-retrieval8192
TogetherSentence-BERTsentence-transformers/msmarco-bert-base-dot-v5512
WhereIsAIUAE-Large-V1WhereIsAI/UAE-Large-V1undefined

Rerank models

OrganizationModel NameAPI Model StringMax Doc Size (tokens)Max Docs
salesforceSalesforce Llama Rank V1 (8B)Salesforce/Llama-Rank-V181921024