Overview
VoiceTypr supports multiple AI transcription engines with various model sizes to balance speed and accuracy based on your needs. All models run locally on your device for complete privacy.Supported Engines
Whisper
OpenAI’s Whisper models provide excellent accuracy across 99+ languages with multiple size options.
Parakeet
NVIDIA Parakeet models optimized for Apple Silicon, offering fast transcription using the Neural Engine.
Soniox
Cloud-based speech recognition API offering fast, accurate transcription without local model downloads.
Model Types
VoiceTypr distinguishes between local and cloud models:Whisper Models
Available Sizes
Whisper models come in multiple sizes, each with different speed and accuracy tradeoffs:- Tiny
- Base
- Small
- Medium
- Large
Whisper Tiny (~75 MB)
- Speed Score: 10/10 ⚡
- Accuracy Score: 6/10
- Best for: Quick drafts, testing, low-power devices
- Languages: Multilingual
English-Only Models
Whisper also offers English-only variants (.en suffix) that are optimized for English:
English-only models are smaller, faster, and more accurate for English compared to their multilingual counterparts.
Parakeet Models
Parakeet models are available on macOS only and leverage Apple’s Neural Engine for hardware acceleration.Available Models
- Parakeet 1.1B
- Parakeet 1.1B v2
Parakeet 1.1B (~1.3 GB)
- Speed Score: 8/10
- Accuracy Score: 8/10
- Languages: Multilingual (100+ languages)
- Hardware: Apple Neural Engine
Soniox Cloud Models
Soniox is a cloud-based speech recognition service that provides fast, accurate transcription without requiring local model downloads.Overview
Unlike Whisper and Parakeet which run entirely on your device, Soniox processes audio in the cloud:- No downloads required: No disk space needed for models
- Fast transcription: Cloud processing with optimized infrastructure
- Requires internet: Audio is sent to Soniox API for processing
- API key required: You need a Soniox account and API key
Setup
To use Soniox models:Get API Key
Sign up at soniox.com and obtain an API key
Available Models
Soniox offers several optimized models:- stt-async-v3: Latest asynchronous model with best accuracy
- stt-streaming: Real-time streaming transcription
- stt-multilingual: Support for multiple languages
Check the Soniox documentation for the latest available models and language support.
Privacy Considerations
Performance
Soniox typically provides:- Speed: Very fast, limited by network latency
- Accuracy: High accuracy comparable to Whisper Large
- Cost: Based on Soniox API pricing
API Key Storage
Soniox API keys are stored securely in your system keychain:- macOS: Keychain Access
- Windows: Credential Manager
Validation
VoiceTypr validates your Soniox API key by:/v1/models endpoint to verify the key is active.
Hardware Acceleration
VoiceTypr automatically uses hardware acceleration when available for maximum performance.macOS
- Whisper: Uses Metal GPU acceleration via Apple’s Metal Performance Shaders
- Parakeet: Uses Apple Neural Engine for ultra-fast inference
- Requirements: macOS 13.0+ (Ventura or later)
Windows
- Whisper: Supports GPU acceleration via DirectML
- Compatible GPUs: NVIDIA, AMD, and Intel GPUs
- Fallback: Automatically uses CPU if GPU unavailable or drivers missing
Model Management
Downloading Models
Browse Available Models
Models are organized into two sections:
- Available to Use: Already downloaded and ready
- Available to Setup: Need to be downloaded first
Download Progress Tracking
Download progress is tracked in real-time:Model Verification
All downloaded models are verified using SHA-256 checksums to ensure integrity:Model verification happens automatically after download. If verification fails, the download is considered corrupted and must be retried.
Deleting Models
To free up disk space, you can delete models you no longer need:Model Selection
To switch between downloaded models:- Go to the Models tab
- Click on any downloaded model to select it
- The selected model is saved in settings:
Choosing the Right Model
Use this guide to select the best model for your needs:I want the fastest possible transcription
I want the fastest possible transcription
Recommended: Whisper Tiny or Tiny.en
- Fastest inference times
- Good for quick notes and drafts
- Trade-off: Lower accuracy
I need the best accuracy
I need the best accuracy
Recommended: Whisper Large or Medium
- Highest accuracy scores
- Best for professional transcription
- Trade-off: Slower processing
I want balanced performance (recommended)
I want balanced performance (recommended)
Recommended: Whisper Small or Small.en ⭐
- Excellent balance of speed and accuracy
- Default recommended model
- Good for most daily use cases
I have an Apple Silicon Mac
I have an Apple Silicon Mac
Recommended: Parakeet 1.1B v2 (English) or Parakeet 1.1B (Multilingual)
- Optimized for Apple Neural Engine
- Faster than Whisper on M-series chips
- Great accuracy
I transcribe non-English languages
I transcribe non-English languages
Recommended: Whisper Small, Medium, or Large (multilingual variants)
- Support for 99+ languages
- Avoid
.ensuffix models (English-only) - Parakeet 1.1B also supports 100+ languages
Disk Space Requirements
Ensure you have enough free disk space before downloading models:| Model | Size |
|---|---|
| Whisper Tiny | ~75 MB |
| Whisper Base | ~150 MB |
| Whisper Small | ~500 MB |
| Whisper Medium | ~1.5 GB |
| Whisper Large | ~3 GB |
| Parakeet 1.1B | ~1.3 GB |
You only need one model to use VoiceTypr. The recommended Whisper Small model requires just 500 MB.
Model Storage Location
Downloaded models are stored in:- macOS:
~/Library/Application Support/com.voicetypr.app/models/ - Windows:
%APPDATA%\com.voicetypr.app\models\
Troubleshooting
Download Failed
- Check your internet connection
- Ensure you have enough free disk space
- Try canceling and restarting the download
- Check firewall/antivirus isn’t blocking the download
Verification Failed
If SHA-256 verification fails:- Delete the corrupted model
- Retry the download
- Check for disk errors if it continues failing
Model Not Appearing
If a downloaded model doesn’t appear:- Click “Refresh Models” in the Models tab
- Restart VoiceTypr
- Check the model storage location manually
Slow Transcription
If transcription is slower than expected:- macOS: Ensure you’re using a model compatible with Metal acceleration
- Windows: Update your GPU drivers for hardware acceleration
- Try a smaller/faster model (Tiny or Base)
- Close resource-intensive applications