Supported Languages
Whisper models support a comprehensive list of languages:European Languages
European Languages
English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Swedish, Norwegian, Danish, Finnish, Greek, Czech, Romanian, Hungarian, Slovak, Slovenian, Croatian, Serbian, Bulgarian, Ukrainian, Russian, Belarusian, Turkish, and more…
Asian Languages
Asian Languages
Chinese (Mandarin & Cantonese), Japanese, Korean, Hindi, Bengali, Tamil, Telugu, Thai, Vietnamese, Indonesian, Malay, Tagalog, Urdu, Punjabi, Gujarati, Kannada, Malayalam, Marathi, Nepali, Sinhala, and more…
Middle Eastern Languages
Middle Eastern Languages
Arabic, Hebrew, Persian (Farsi), Turkish, Urdu, Pashto
African Languages
African Languages
Swahili, Hausa, Yoruba, Amharic, Somali, Shona, Afrikaans, Malagasy
Other Languages
Other Languages
Armenian, Azerbaijani, Basque, Breton, Georgian, Icelandic, Kazakh, Latin, Mongolian, Sanskrit, Welsh, Yiddish, and more…
See the complete list of 99+ supported languages in the language selector within VoiceTypr.
Language Selection
Setting Your Language
Choose Your Language
Click the dropdown and select your spoken languageOr search by typing the language name
Automatic Language Detection
Whisper can automatically detect the spoken language:How It Works
- Select “Auto” from the language dropdown (if available on your model)
- Start recording and speak naturally
- Whisper analyzes the audio and detects the language
- Transcription is returned in the detected language
When to Use Auto-Detection
✅ Good Use Cases
- Multilingual environments
- Unsure which language you’ll speak
- Testing different languages
- Quick voice notes in various languages
❌ Avoid For
- Professional transcription
- Maximum accuracy needed
- Mixing multiple languages in one recording
- Very short recordings (< 3 seconds)
Limitations
- Code-switching: Switching languages mid-recording may produce mixed results
- Accents: Strong accents may affect detection accuracy
- Background noise: Can interfere with language detection
For best results, manually select your language instead of using auto-detection.
Model-Specific Language Support
Whisper Models
All Whisper models (Tiny, Base, Small, Medium, Large) support 99+ languages. Exception: English-only models (ending in.en)
- Only support English
- Faster transcription for English speech
- Higher accuracy for English
- Language selector shows only “English”
Parakeet Models (macOS)
Parakeet models support 25 European languages only:- Supported Languages
- Unsupported Languages
- Bulgarian (bg)
- Czech (cs)
- Danish (da)
- German (de)
- Greek (el)
- English (en)
- Spanish (es)
- Estonian (et)
- Finnish (fi)
- French (fr)
- Croatian (hr)
- Hungarian (hu)
- Italian (it)
- Lithuanian (lt)
- Latvian (lv)
- Maltese (mt)
- Dutch (nl)
- Polish (pl)
- Portuguese (pt)
- Romanian (ro)
- Russian (ru)
- Slovak (sk)
- Slovenian (sl)
- Swedish (sv)
- Ukrainian (uk)
When using Parakeet models, the language selector only shows the 25 supported European languages.
Soniox Cloud Models
Soniox supports 48 languages, including:- Major European languages
- English, Spanish, French, German, Italian, Portuguese
- Asian: Chinese, Japanese, Korean, Hindi, Thai, Vietnamese
- Middle Eastern: Arabic, Hebrew, Turkish, Persian, Urdu
- And more…
Language availability depends on your cloud provider. Check provider documentation for the complete list.
Translation to English
Whisper can translate non-English speech directly to English:How It Works
When implemented:- Record in any supported language
- Enable “Translate to English” option
- Transcription is returned in English
- Speak: “Bonjour, comment allez-vous?” (French)
- Receive: “Hello, how are you?” (English)
Use Cases
- International communication
- Learning foreign languages
- Translating voice messages
- Quick translations without switching apps
Best Practices
Select Your Language
Select Your Language
Always manually select your language for best accuracy:
- Go to Models tab
- Choose your spoken language
- Start recording
Speak Clearly
Speak Clearly
For non-English languages:
- Speak at a moderate pace
- Reduce background noise
- Use standard pronunciation
- Avoid heavy dialects when possible
Use Larger Models
Use Larger Models
Larger models (Medium, Large) have better multilingual performance:
- Higher accuracy for non-English languages
- Better handling of accents
- More robust language detection
Test Your Language
Test Your Language
Before important work:
- Test with sample recordings
- Verify accuracy for your accent/dialect
- Adjust speaking style if needed
- Try different models (Small vs Large)
Language-Specific Tips
Asian Languages (Chinese, Japanese, Korean)
- Use Small model or larger for best results
- Tiny/Base models may struggle with character accuracy
- Specify exact language (e.g., “Chinese” not “Cantonese”) when available
Tonal Languages (Mandarin, Thai, Vietnamese)
- Speak with clear tonal distinctions
- Use Medium or Large models
- Reduce background noise (tones can be subtle)
Languages with Multiple Scripts
- Arabic: Transcribed in Arabic script
- Hindi: Transcribed in Devanagari script
- Chinese: Transcribed in Chinese characters (simplified)
Whisper outputs text in the native script of the language. No romanization is applied.
Troubleshooting
Wrong Language Detected
Wrong Language Detected
Solution:
- Manually select your language (don’t use auto-detect)
- Ensure you’re speaking consistently in one language
- Try a larger model for better detection
Poor Accuracy
Poor Accuracy
Solution:
- Upgrade to a larger model (Small → Medium → Large)
- Check microphone quality
- Reduce background noise
- Speak more clearly and slowly
Language Not Available
Language Not Available
Solution:
- Switch to a Whisper model (supports 99+ languages)
- Check if using Parakeet (only 25 European languages)
- Verify model is downloaded and selected
Mixed Languages in Output
Mixed Languages in Output
Solution:
- Record each language separately
- Don’t code-switch within a single recording
- Select the primary language before recording