Table of Contents
- Quick Start
- Audio Transcription
- Audio Translation
- Response Formats
- Advanced Features
- Async Support
- Best Practices
Quick Start
- Register for an account
TOGETHER_API_KEY
:
Shell
- Install your preferred library
- Run your first transcription
Audio Transcription
Audio transcription converts speech to text in the same language as the source audio..wav
(audio/wav).mp3
(audio/mpeg).m4a
(audio/mp4).webm
(audio/webm).flac
(audio/flac)
Input Methods
Local File PathPython
Python
Python
Python
Language Support
Specify the audio language using ISO 639-1 language codes:Python
- “en” - English
- “es” - Spanish
- “fr” - French
- “de” - German
- “ja” - Japanese
- “zh” - Chinese
- “auto” - Auto-detect (default)
Custom Prompts
Use prompts to improve transcription accuracy for specific contexts:Audio Translation
Audio translation converts speech from any language to English text.Response Formats
JSON Format (Default) Returns only the transcribed/translated text:Python
Text
Text
Advanced Features
Temperature Control Adjust randomness in the output (0.0 = deterministic, 1.0 = creative):Async Support
All transcription and translation operations support async/await:Async Transcription
Python
Async Translation
Python
Concurrent Processing
Process multiple audio files concurrently:Python
Best Practices
- Use high-quality audio files for better transcription accuracy
- Minimize background noise
- Ensure clear speech with good volume levels
- Use appropriate sample rates (16kHz or higher recommended)
- Consider file size limits for uploads
- For long audio files, consider splitting into smaller chunks
- Use streaming for real-time applications when available
Next Steps
- Explore our API Reference for detailed parameter documentation