Product How It Works Providers Styles GitHub Download for Free
OPEN SOURCE

Your voice, transcribed.

Dimmy sits on your screen. Press a key, speak, get text. That simple.

PRODUCT

The Pill

Always There

A tiny pill sits at the edge of your screen. Unobtrusive. Waiting.

Ready to Record

Press your hotkey. The pill expands. A waveform pulses with your voice.

Live Transcription

Your words stream onto screen in real-time. AI transcribes every syllable.

Enhanced & Pasted

Choose a style. Your text transforms. It's in your clipboard, ready to paste.

HOW IT WORKS

Three Moments

Record

Press your shortcut. Speak naturally. Dimmy captures every word with crystal clarity.

Transcribe

AI transcribes in real-time. Six providers, one interface. Pick what works for you.

Enhance

13 AI styles reshape your words instantly. Professional, creative, or completely unhinged.

PROVIDERS

Universal Intelligence

Six AI providers. Swap anytime. Some are completely free.

Groq

STT Free

Blazing-fast Whisper inference. Zero cost, zero compromise.

OpenAI

STT + LLM

The gold standard. Whisper and GPT in one provider.

Deepgram

STT $200 Free

Enterprise-grade speech recognition with generous free tier.

Google Gemini

LLM Free

Google's multimodal AI for powerful text enhancement.

Anthropic

LLM

Claude-powered enhancement. Thoughtful, nuanced text styling.

OpenRouter

LLM

Access 100+ models through a single API. Ultimate flexibility.

STYLES

13 Ways to Say It

Same voice. Different vibes. Watch your words transform in real-time.

Output

Hey team, I wanted to follow up on yesterday's meeting about the Q3 roadmap. I think we should prioritize the mobile app and maybe push the API redesign to Q4.

FEATURES

Built Different

Cross-Platform

Windows. macOS. Linux. Same experience everywhere. No compromises, no platform-specific quirks. One app, three operating systems.

Secure by Design

API keys stored in your OS keyring. Never in config files, never in plain text. Your credentials stay yours.

Real-time Preview

See your words appear as you speak. Live waveform visualization plus streaming text output.

Anti-Hallucination Guard

Preprocessing pipeline filters silence and noise before it hits the AI. No phantom words from empty audio.

Open Source

Free as in freedom. Community-driven development, transparent codebase, no telemetry. Fork it, improve it, make it yours.