myWhisperer

Free, open-source voice-to-text desktop app. Powered by OpenAI Whisper and GPT.

Turn your voice into perfectly formatted text — in any app, on any platform. A local, privacy-first alternative to subscription-based dictation tools like Wispr Flow, Otter.ai, and Dragon NaturallySpeaking.

About

myWhisperer is a free, open-source desktop application that turns your voice into text using OpenAI's Whisper API, then polishes it with GPT for clean, publication-ready output. It intelligently detects which application you're typing in and formats the text accordingly — emails sound professional, chat messages stay casual, code comments use the right syntax.

Bring your own OpenAI API key. No subscriptions. No recurring fees. No data collection. Your audio and text stay on your machine — only the OpenAI API calls leave your device.

Why myWhisperer?

Free forever — no subscriptions, pay only for the OpenAI API usage you consume
Private by design — no telemetry, no analytics, no data collection, no servers
Context-aware — automatically adapts formatting to your active application
Open source — MIT licensed, fully auditable, community-driven

Features

Voice-to-text transcription — accurate speech recognition in 100+ languages using OpenAI Whisper
Context-aware smart formatting — detects the active application (email, chat, code editor, notes, terminal, etc.) and adapts output formatting automatically
GPT formatting level slider — from 0% (raw transcription) to 100% (fully formatted with context-aware rewriting), with Light, Moderate, and Full tiers
Side-by-side transcription view — see both the original raw transcription and the GPT-formatted version
Two recording modes — Toggle (press to start/stop) and Push-to-Talk (hold to record, release to stop)
Customizable global hotkey — default Control+Space, configurable to any key combination
System tray integration — runs quietly in the background with minimize-to-tray
Light/Dark/System theme support — matches your OS preference or set manually
Personal dictionary — add custom terminology, names, and jargon for more accurate transcriptions
Auto-copy and auto-paste — transcribed text is automatically copied to clipboard and pasted into the active application
Transcription history — browse, search, and reuse past transcriptions
Cross-platform — macOS, Windows, and Linux
Secure — API key encryption via OS keychain (macOS Keychain, Windows DPAPI), context isolation enabled, no external data storage

Installation

Download the latest release from GitHub Releases:

Platform	Format
macOS	`.dmg` or `.zip`
Windows	NSIS installer (`.exe`) or portable `.exe`
Linux	`.AppImage` or `.deb`

Requires an OpenAI API key (pay-as-you-go, typically a few cents per transcription).

macOS: Bypassing Gatekeeper

Since myWhisperer is not signed with an Apple Developer certificate, macOS will block it on first launch. To open it:

Right-click (or Control-click) the app in Finder
Select Open from the context menu
Click Open in the confirmation dialog

macOS will remember your choice and allow the app to run normally afterward.

Alternatively, you can remove the quarantine attribute from the terminal:

xattr -cr /Applications/myWhisperer.app

macOS: Self-Signing (Optional)

If you prefer to self-sign the app to avoid Gatekeeper warnings entirely:

# Self-sign the application
codesign --force --deep --sign - /Applications/myWhisperer.app

# Verify the signature
codesign --verify --verbose /Applications/myWhisperer.app

Note: Self-signing removes the "unidentified developer" warning but does not replace a proper Apple Developer signature. The app is fully functional without signing.

Getting Started

Download and install myWhisperer for your platform.
Launch the app and open Settings.
Enter your OpenAI API key in the API Configuration section.
Choose your preferred recording mode (Toggle or Push-to-Talk) and hotkey.
Press the hotkey (default: Control+Space) to start recording. Speak naturally.
Your transcription appears with both the raw and formatted versions, and is automatically copied/pasted.

Development Setup

Prerequisites

Node.js 20+
npm
An OpenAI API key (for runtime)

Getting Started

# Clone the repository
git clone https://github.com/KunalGehlot/myWhisperer.git
cd myWhisperer

# Install dependencies
npm install

# Start in development mode (hot reload)
npm run dev

Build and Package

# Build for production
npm run build

# Package for your current platform
npm run dist

# Package for a specific platform
npm run dist:mac
npm run dist:win
npm run dist:linux

Built packages are output to the release/ directory.

Quality Checks

# TypeScript type checking
npm run typecheck

# Run tests
npm test

Configuration

All settings are accessible from the Settings panel inside the app:

Setting	Description	Default
API Key	Your OpenAI API key	--
Whisper Model	Model used for transcription	`whisper-1`
GPT Model	Model used for text formatting	`gpt-4.1`
Formatting Level	GPT formatting intensity (0-100% slider)	70%
Custom Format Prompt	Override the level-based formatting with a custom prompt	--
Recording Mode	Toggle or Push-to-Talk	Toggle
Hotkey	Global keyboard shortcut (customizable)	`Control+Space`
Language	Transcription language (20+ presets, auto-detect default)	Auto-detect
Theme	Light, Dark, or System	System
Audio Input	Microphone device	System default
Auto-copy	Copy text to clipboard after transcription	Enabled
Auto-paste	Paste text into active app after transcription	Enabled
Personal Dictionary	Custom words and phrases for better accuracy	--

Formatting Levels

The formatting slider controls how aggressively GPT processes your transcription:

Level	Behavior
0%	No formatting -- raw transcription output
1-30%	Light -- capitalization, filler word removal, basic punctuation
31-70%	Moderate -- grammar, punctuation, light sentence flow improvements
71-100%	Full -- context-aware rewriting based on the active application (email, chat, code editor, etc.)

Setting a custom format prompt overrides the level-based formatting entirely.

Architecture

myWhisperer follows standard Electron architecture with a clear separation between processes:

+-------------------+       IPC        +--------------------+
|  Main Process     | <--------------> |  Renderer Process  |
|  (Node.js)        |                  |  (React + Vite)    |
|                   |                  |                    |
|  - Audio capture  |                  |  - UI components   |
|  - Whisper API    |                  |  - Settings panel  |
|  - GPT API        |                  |  - History view    |
|  - Context detect |                  |  - Recording state |
|  - Clipboard      |                  |                    |
|  - System tray    |                  |                    |
|  - Global hotkeys |                  |                    |
|  - Settings store |                  |                    |
+-------------------+                  +--------------------+
         |
         v
  +--------------+
  |  OpenAI API  |
  |  - Whisper   |
  |  - GPT       |
  +--------------+

Main process handles all system-level operations: audio recording, API calls to OpenAI, context detection, clipboard management, system tray, and global hotkey registration.
Renderer process is a React application that provides the user interface, styled with Tailwind CSS.
IPC bridge connects the two processes via a secure preload script with context isolation enabled.

Tech Stack

Layer	Technology
Framework	Electron 35
Frontend	React 19
Language	TypeScript 5.7
Styling	Tailwind CSS 3.4
Bundler	Vite 6
AI	OpenAI SDK 4.x
Storage	electron-store 8
Packaging	electron-builder 26

Project Structure

myWhisperer/
  src/
    main/                     # Electron main process
      main.ts                 # App lifecycle, IPC handlers
      gpt-service.ts          # GPT formatting with level-based prompts
      whisper-service.ts      # Whisper API transcription
      context-detector.ts     # Active window detection (macOS/Win/Linux)
      shortcut-manager.ts     # Global shortcuts (toggle + push-to-talk)
      clipboard-manager.ts    # Clipboard and auto-paste
      settings-store.ts       # Persistent settings with electron-store
      tray-manager.ts         # System tray icon and menu
    preload/
      preload.ts              # Secure IPC bridge (context isolation)
    renderer/
      components/             # React UI components
      hooks/                  # React hooks
      styles/                 # Global CSS and Tailwind setup
      types/                  # TypeScript type definitions
      main.tsx                # React entry point
  resources/                  # App icons and platform-specific assets
  .github/workflows/          # CI/CD pipelines
  index.html                  # HTML entry point
  package.json
  tsconfig.json               # TypeScript config (renderer)
  tsconfig.main.json          # TypeScript config (main process)
  vite.config.ts              # Vite bundler config
  tailwind.config.js          # Tailwind CSS config

Contributing

Contributions are welcome.

Development happens on the main branch.
Pushing to the broad branch triggers the auto-release pipeline, which builds and publishes to GitHub Releases for all platforms.
Before submitting a PR, run npm run typecheck and npm test to ensure everything passes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

OpenAI for the Whisper and GPT APIs
Inspired by Wispr Flow

_{myWhisperer is a free, open-source voice-to-text application, speech-to-text transcription tool, and AI-powered dictation software for macOS, Windows, and Linux. It serves as a free alternative to Wispr Flow, Otter.ai, Dragon NaturallySpeaking, and other paid dictation services. Keywords: free transcription app, voice to text, speech to text, dictation software, AI transcription, whisper transcription, open source dictation, desktop transcription tool, offline voice recognition, GPT text formatting.}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
resources		resources
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.main.json		tsconfig.main.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

myWhisperer

About

Why myWhisperer?

Features

Installation

macOS: Bypassing Gatekeeper

macOS: Self-Signing (Optional)

Getting Started

Development Setup

Prerequisites

Getting Started

Build and Package

Quality Checks

Configuration

Formatting Levels

Architecture

Tech Stack

Project Structure

Contributing

License

Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

myWhisperer

About

Why myWhisperer?

Features

Installation

macOS: Bypassing Gatekeeper

macOS: Self-Signing (Optional)

Getting Started

Development Setup

Prerequisites

Getting Started

Build and Package

Quality Checks

Configuration

Formatting Levels

Architecture

Tech Stack

Project Structure

Contributing

License

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages