Recommended Models by Use Case
| Use Case | Recommended Model | Model String | Alternatives | Learn More |
|---|---|---|---|---|
| Chat | Kimi K2.5 (instant mode) | moonshotai/Kimi-K2.5 | deepseek-ai/DeepSeek-V3.1, openai/gpt-oss-120b | Chat |
| Reasoning | Kimi K2.5 (reasoning mode) | moonshotai/Kimi-K2.5 | deepseek-ai/DeepSeek-R1, Qwen/Qwen3-235B-A22B-Instruct-2507-tput | Reasoning Guide, DeepSeek R1 |
| Coding Agents | GLM-5.1 | zai-org/GLM-5.1 | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8, deepseek-ai/DeepSeek-V3.1 | Building Agents |
| Small & Fast | Gemma 4 31B IT | google/gemma-4-31B-it | openai/gpt-oss-20b, Qwen/Qwen3.5-9B | - |
| Medium General Purpose | GPT-OSS 120B | openai/gpt-oss-120b | MiniMaxAI/MiniMax-M2.5, meta-llama/Llama-3.3-70B-Instruct-Turbo | - |
| Function Calling | GLM-5.1 | zai-org/GLM-5.1 | moonshotai/Kimi-K2.5, deepseek-ai/DeepSeek-V3.1 | Function Calling |
| Vision | Kimi K2.5 | moonshotai/Kimi-K2.5 | google/gemma-4-31B-it, Qwen/Qwen3.5-397B-A17B, Qwen/Qwen3.5-9B | Vision, OCR |
| Image Generation | Flash Image 2.5 (Nano Banana) | google/flash-image-2.5 | black-forest-labs/FLUX.2-pro, ByteDance-Seed/Seedream-4.0 | Images |
| Image-to-Image | Flash Image 2.5 (Nano Banana) | google/flash-image-2.5 | black-forest-labs/FLUX.1-kontext-max, google/gemini-3-pro-image | Flux Kontext |
| Text-to-Video | Sora 2 | openai/sora-2-pro | google/veo-3.0, ByteDance/Seedance-1.0-pro | Video Generation |
| Image-to-Video | Veo 3.0 | google/veo-3.0 | ByteDance/Seedance-1.0-pro, kwaivgI/kling-2.1-master | Video Generation |
| Text-to-Speech | Cartesia Sonic 3 | cartesia/sonic-3 | deepgram/aura-2, canopylabs/orpheus-3b-0.1-ft, hexgrad/Kokoro-82M | Text-to-Speech |
| Speech-to-Text | Whisper Large v3 | openai/whisper-large-v3 | nvidia/parakeet-tdt-0.6b-v3, deepgram/deepgram-nova-3, deepgram/deepgram-flux, mistralai/Voxtral-Mini-3B-2507 | Speech-to-Text |
| Embeddings | Multilingual E5 Large | intfloat/multilingual-e5-large-instruct | - | Embeddings |
| Rerank | MixedBread Rerank Large | mixedbread-ai/Mxbai-Rerank-Large-V2 | Only available as Dedicated Endpoint | Rerank, Guide |
| Moderation | Virtue Guard | VirtueAI/VirtueGuard-Text-Lite | meta-llama/Llama-Guard-4-12B | - |
Need Help Choosing?
- Check our Serverless Models page for complete specifications
- See our WhichLLM page which provides categorical benchmarks for the above usecases
- Review Rate Limits for your tier
- See Pricing for cost information
- Visit Inference FAQs for common questions