DOSAYGO Studio

WisprNote: Offline Voice Transcription

By Cris, February 22, 2023

Transcribing voice notes offline with high quality is a challenge, often plagued by delays and privacy concerns. I created WisprNote, a Mac app that uses Silero VAD, FFmpeg, and OpenAI Whisper models for fast, private transcription. In the web’s silent grove, WisprNote is a whispering tree, capturing spoken thoughts with serene precision.

Silero VAD removes silence, FFmpeg processes audio, and Whisper transcribes, all offline to ensure privacy, delivering 50–200 words per second.

# Bash script snippet for WisprNote transcription
speech_file="${output_dir}/output.wav"
"$tvad" "$wav_file" "$vad_model"  # Silero VAD to remove silence
"$ffmpeg" -y -i "$speech_file" -acodec pcm_s16le -ar 16000 -ac 1 "$wav_file"  # Re-encode for Whisper
"$whisper" -m "$model" -otxt -of "$wav_file" "$speech_file"  # Transcribe with Whisper

This script removes silence, processes audio, and transcribes with Whisper, ensuring accurate, private results.

WisprNote interface: a Mac screen showing transcribed voice notes, set against a backdrop of a serene grove with glowing orbs

Building WisprNote was like planting a tree in a quiet grove, nurturing thoughts into text. At DOSAYGO, we value privacy and precision, crafting tools that capture ideas securely. WisprNote is a keeper of whispers, preserving voices in a silent embrace.