Madhur Shrimal

Navigate

← Back to projects
VoxBee

VoxBee

VoxBee is an offline speech-to-text app for macOS and Linux that handles dictation, audio transcription, and meeting recording — all processed on-device. Dictate into any app with push-to-talk across 30 languages, with grammar correction and filler word removal. Transcribe audio files via drag-and-drop or URLs from 1,800+ sites including YouTube, Vimeo, and podcast RSS feeds. Record meetings from Zoom, Google Meet, Teams, FaceTime, and Webex with AI-generated summaries. Powered by 10 on-device models (WhisperKit and NVIDIA Parakeet), no internet, account, or subscription required — just a one-time purchase.

What it does

  • Push-to-talk dictation across all apps with 30 language support
  • Fully offline — 10 on-device Whisper and NVIDIA Parakeet models
  • Transcribe files or URLs from 1,800+ sites (YouTube, Vimeo, podcasts, and more)
  • Meeting recording with auto-detection of Zoom, Meet, Teams, FaceTime, and Webex
  • AI summaries via OpenAI, Anthropic, or local Ollama
  • Grammar correction, filler word removal, and personal dictionary with phonetic matching
  • Voice notes with AI transformation templates and screenshot capture
  • 8 color themes, 8 waveform styles, and configurable hotkeys