Chat models
Organization | Model name | API model name | Context length |
---|---|---|---|
DeepSeek | DeepSeek R1 (orignal) | deepseek-ai/DeepSeek-R1-DE | 163840 |
DeepSeek | DeepSeek R1-0528 | deepseek-ai/DeepSeek-R1 | 163840 |
DeepSeek | DeepSeek R1 Distill Llama 70B | deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 131072 |
DeepSeek | DeepSeek R1 Distill Qwen 14B | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 131072 |
DeepSeek | DeepSeek V3-0324 | deepseek-ai/DeepSeek-V3 | 131072 |
Meta | Meta Llama 3.3 70B Instruct Turbo | meta-llama/Llama-3.3-70B-Instruct-Turbo | 131072 |
Meta | Meta Llama 3.2 3B Instruct Turbo | meta-llama/Llama-3.2-3B-Instruct-Turbo | 131072 |
Meta | Meta Llama 3.1 70B Instruct Turbo | meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | 131072 |
Meta | Meta Llama 3 70B Instruct Turbo | meta-llama/Meta-Llama-3-70B-Instruct-Turbo | 8192 |
Meta | Llama 4 Maverick Instruct (17Bx128E) | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 1048576 |
Meta | Llama 4 Scout Instruct (17Bx16E) | meta-llama/Llama-4-Scout-17B-16E-Instruct | 1048576 |
Meta | Meta Llama 3.1 8B Instruct Turbo | meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | 131072 |
Meta | Meta Llama 3.1 405B Instruct | meta-llama/Llama-3.1-405B-Instruct | 4096 |
Meta | Meta Llama 3.2 1B Instruct | meta-llama/Llama-3.2-1B-Instruct | 131072 |
Meta | Meta Llama 3 8B Instruct | meta-llama/Meta-Llama-3-8B-Instruct | 8192 |
mistralai | Mistral (7B) Instruct v0.3 | mistralai/Mistral-7B-Instruct-v0.3 | 32768 |
mistralai | Mixtral-8x7B Instruct v0.1 | mistralai/Mixtral-8x7B-Instruct-v0.1 | 32768 |
mistralai | Mistral (7B) Instruct | mistralai/Mistral-7B-Instruct-v0.1 | 32768 |
mistralai | Mistral (7B) Instruct v0.2 | mistralai/Mistral-7B-Instruct-v0.2 | 32768 |
OpenAI | OpenAI GPT-OSS 20B | openai/gpt-oss-20b | 131072 |
OpenAI | OpenAI GPT-OSS 120B | openai/gpt-oss-120b | 131072 |
Qwen | Qwen2.5-VL (72B) Instruct | Qwen/Qwen2.5-VL-72B-Instruct | 32768 |
Qwen | Qwen 2.5 Coder 32B Instruct | Qwen/Qwen2.5-Coder-32B-Instruct | 16384 |
Qwen | Qwen2.5 72B Instruct Turbo | Qwen/Qwen2.5-72B-Instruct-Turbo | 131072 |
Qwen | Qwen QwQ-32B | Qwen/QwQ-32B | 131072 |
Qwen | Qwen2.5 7B Instruct Turbo | Qwen/Qwen2.5-7B-Instruct-Turbo | 32768 |
Qwen | Qwen3 Coder 480B A35B Instruct Fp8 | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262144 |
Qwen | Qwen2.5 72B Instruct | Qwen/Qwen2.5-72B-Instruct | 32768 |
Rerank models
Organization | Model name | API model name | Context length |
---|---|---|---|
salesforce | Salesforce Llama Rank V1 (8B) | Salesforce/Llama-Rank-V1 | 8192 |