Multi-Language Support

VoiceTypr supports transcription in 99+ languages out of the box using Whisper AI models. No additional downloads or configuration required.

Supported Languages

Whisper models support a comprehensive list of languages:

European Languages

English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Swedish, Norwegian, Danish, Finnish, Greek, Czech, Romanian, Hungarian, Slovak, Slovenian, Croatian, Serbian, Bulgarian, Ukrainian, Russian, Belarusian, Turkish, and more…

Asian Languages

Chinese (Mandarin & Cantonese), Japanese, Korean, Hindi, Bengali, Tamil, Telugu, Thai, Vietnamese, Indonesian, Malay, Tagalog, Urdu, Punjabi, Gujarati, Kannada, Malayalam, Marathi, Nepali, Sinhala, and more…

Middle Eastern Languages

Arabic, Hebrew, Persian (Farsi), Turkish, Urdu, Pashto

African Languages

Swahili, Hausa, Yoruba, Amharic, Somali, Shona, Afrikaans, Malagasy

Other Languages

Armenian, Azerbaijani, Basque, Breton, Georgian, Icelandic, Kazakh, Latin, Mongolian, Sanskrit, Welsh, Yiddish, and more…

See the complete list of 99+ supported languages in the language selector within VoiceTypr.

Language Selection

Setting Your Language

Open Models Tab

Navigate to the Models section in VoiceTypr

Find Language Selector

Located below the header, above the model list

Choose Your Language

Click the dropdown and select your spoken languageOr search by typing the language name

Start Recording

Your language selection applies immediately to new recordings

You can change language at any time. Each recording uses the language selected at the time of recording.

Automatic Language Detection

Whisper can automatically detect the spoken language:

How It Works

Select “Auto” from the language dropdown (if available on your model)
Start recording and speak naturally
Whisper analyzes the audio and detects the language
Transcription is returned in the detected language

Auto-detection works best for:

Single-language recordings
Common languages with distinct characteristics
Recordings longer than 5-10 seconds

When to Use Auto-Detection

✅ Good Use Cases

Multilingual environments
Unsure which language you’ll speak
Testing different languages
Quick voice notes in various languages

❌ Avoid For

Professional transcription
Maximum accuracy needed
Mixing multiple languages in one recording
Very short recordings (< 3 seconds)

Limitations

Code-switching: Switching languages mid-recording may produce mixed results
Accents: Strong accents may affect detection accuracy
Background noise: Can interfere with language detection

For best results, manually select your language instead of using auto-detection.

Model-Specific Language Support

Whisper Models

All Whisper models (Tiny, Base, Small, Medium, Large) support 99+ languages. Exception: English-only models (ending in .en)

Only support English
Faster transcription for English speech
Higher accuracy for English
Language selector shows only “English”

Parakeet Models (macOS)

Parakeet models support 25 European languages only:

Supported Languages
Unsupported Languages

Bulgarian (bg)
Czech (cs)
Danish (da)
German (de)
Greek (el)
English (en)
Spanish (es)
Estonian (et)
Finnish (fi)
French (fr)
Croatian (hr)
Hungarian (hu)
Italian (it)
Lithuanian (lt)
Latvian (lv)
Maltese (mt)
Dutch (nl)
Polish (pl)
Portuguese (pt)
Romanian (ro)
Russian (ru)
Slovak (sk)
Slovenian (sl)
Swedish (sv)
Ukrainian (uk)

When using Parakeet models, the language selector only shows the 25 supported European languages.

Soniox Cloud Models

Soniox supports 48 languages, including:

Major European languages
English, Spanish, French, German, Italian, Portuguese
Asian: Chinese, Japanese, Korean, Hindi, Thai, Vietnamese
Middle Eastern: Arabic, Hebrew, Turkish, Persian, Urdu
And more…

Language availability depends on your cloud provider. Check provider documentation for the complete list.

Translation to English

Whisper can translate non-English speech directly to English:

How It Works

This feature is currently not exposed in the UI but is available in the underlying Whisper model. It may be added in future versions.

When implemented:

Record in any supported language
Enable “Translate to English” option
Transcription is returned in English

Example:

Speak: “Bonjour, comment allez-vous?” (French)
Receive: “Hello, how are you?” (English)

Use Cases

International communication
Learning foreign languages
Translating voice messages
Quick translations without switching apps

Best Practices

Select Your Language

Always manually select your language for best accuracy:

Go to Models tab
Choose your spoken language
Start recording

Don’t rely on auto-detection for important transcriptions.

Speak Clearly

For non-English languages:

Speak at a moderate pace
Reduce background noise
Use standard pronunciation
Avoid heavy dialects when possible

Use Larger Models

Larger models (Medium, Large) have better multilingual performance:

Higher accuracy for non-English languages
Better handling of accents
More robust language detection

Test Your Language

Before important work:

Test with sample recordings
Verify accuracy for your accent/dialect
Adjust speaking style if needed
Try different models (Small vs Large)

Language-Specific Tips

Asian Languages (Chinese, Japanese, Korean)

Use Small model or larger for best results
Tiny/Base models may struggle with character accuracy
Specify exact language (e.g., “Chinese” not “Cantonese”) when available

Tonal Languages (Mandarin, Thai, Vietnamese)

Speak with clear tonal distinctions
Use Medium or Large models
Reduce background noise (tones can be subtle)

Languages with Multiple Scripts

Arabic: Transcribed in Arabic script
Hindi: Transcribed in Devanagari script
Chinese: Transcribed in Chinese characters (simplified)

Whisper outputs text in the native script of the language. No romanization is applied.

Troubleshooting

Wrong Language Detected

Solution:

Manually select your language (don’t use auto-detect)
Ensure you’re speaking consistently in one language
Try a larger model for better detection

Poor Accuracy

Solution:

Upgrade to a larger model (Small → Medium → Large)
Check microphone quality
Reduce background noise
Speak more clearly and slowly

Language Not Available

Solution:

Switch to a Whisper model (supports 99+ languages)
Check if using Parakeet (only 25 European languages)
Verify model is downloaded and selected

Mixed Languages in Output

Solution:

Record each language separately
Don’t code-switch within a single recording
Select the primary language before recording

Switching Between Languages

For users who work in multiple languages:

Before Recording

Open VoiceTypr settingsCheck current language selection

Change Language

Go to Models tabSelect the language you’ll speak

Record

Use your hotkey and speak in the selected language

Verify

Check transcription accuracyAdjust if needed for next time

Create a workflow: Check language before important recordings. It takes 2 seconds but ensures accuracy.

​Supported Languages

​Language Selection

​Setting Your Language

​Automatic Language Detection

​How It Works

​When to Use Auto-Detection

✅ Good Use Cases

❌ Avoid For

​Limitations

​Model-Specific Language Support

​Whisper Models

​Parakeet Models (macOS)

​Soniox Cloud Models

​Translation to English

​How It Works

​Use Cases

​Best Practices

​Language-Specific Tips

​Asian Languages (Chinese, Japanese, Korean)

​Tonal Languages (Mandarin, Thai, Vietnamese)

​Languages with Multiple Scripts

​Troubleshooting

​Switching Between Languages

Supported Languages

Language Selection

Setting Your Language

Automatic Language Detection

How It Works

When to Use Auto-Detection

Limitations

Model-Specific Language Support

Whisper Models

Parakeet Models (macOS)

Soniox Cloud Models

Translation to English

How It Works

Use Cases

Best Practices

Language-Specific Tips

Asian Languages (Chinese, Japanese, Korean)

Tonal Languages (Mandarin, Thai, Vietnamese)

Languages with Multiple Scripts

Troubleshooting

Switching Between Languages