Vox Documentation
Fast, accurate voice transcription with AI enhancement for macOS and Windows.
What is Vox?
Vox is a desktop application for macOS and Windows that transcribes your speech into text. It uses Whisper, an advanced open-source speech recognition technology from OpenAI, running entirely on your device. All processing happens locally, ensuring your privacy while providing fast and accurate transcriptions.
Key Features
- Local Processing - Speech recognition runs entirely on your device
- AI Enhancement - Optional post-processing with AWS Bedrock, DeepSeek, or custom LLM providers
- Always Available - System tray integration with customizable keyboard shortcuts
- Smart Dictionary - Teach Vox custom words and phrases for better accuracy
- Multi-Language - Support for English, Portuguese, Spanish, French, German, and more
- Privacy First - Your audio never leaves your device
Quick Start
- Download Vox from the official website
- Install and launch the application
- Grant required permissions (Microphone, Accessibility, Keychain)
- Download a speech model (Accurate recommended for most users)
- Start recording with
⌘ + Space(hold) or⌘ + ⌥ + Space(toggle)
New to Vox?
Follow our Getting Started guide for detailed setup instructions.
Documentation
⚙️ Configuration
Need Help?
- Issues & Bugs: GitHub Issues
- Discussions: GitHub Discussions
- Source Code: GitHub Repository