Live mode works instantly
Live mode works instantly with no downloads
500+ fast, free tools. Most run in your browser only; Image & PDF tools upload files to the backend when you run them.
Transcribe live microphone audio or uploaded files in your browser. Uses Web Speech API for live, Whisper Tiny for files.
Speech to Text offers two transcription modes. Live Mic mode uses your browser's built-in Web Speech API for real-time transcription from your microphone — it works immediately with no downloads. File mode uses OpenAI's Whisper Tiny model via Transformers.js, running entirely in your browser. The Whisper model (~39MB) is downloaded once and cached; your audio files are never uploaded to any server.
Live mode works instantly with no downloads
File mode uses Whisper AI with no server upload
Supports 7+ languages in live mode
Whisper model cached after first use
Transcript can be copied with one click
Live mode works instantly with no downloads via the Web Speech API for real-time dictation.
Input: MP3 of a 20-minute team meeting, English, recorded on a laptop microphone
Output: Plain-text transcript with punctuation and speaker pauses indicated, ready for action-item extraction.
Input: Speaker dictating notes via headset microphone in Chrome
Output: Real-time text streaming into the textarea as the speaker talks, captured in roughly 5 minutes for a 5-minute monologue.
Input: M4A voicemail in mixed Spanish and English, 90 seconds
Output: Whisper auto-detects the dominant language and produces a clean transcript with both languages preserved verbatim.