Captures system audio and turns speech into text in real time, not after the fact — meetings, podcasts, YouTube, anything playing through your Mac. Runs locally with Whisper, NVIDIA Parakeet, Apple Foundation Models, or Qwen depending on your trade-off between speed and quality, with diarization for separating speakers. Twenty-plus European languages plus English; no cloud round-trip means no waiting and no privacy compromise.
Featured in:On-device, by design