Superwhisper app icon

Superwhisper

AI Models

The models that power Superwhisper

Superwhisper ships with a catalog of transcription and language models. Some run on your device and never touch the internet. Others run in the cloud when you want the fastest or most accurate output. You can switch at any time.

SOC 2 Type II certifiedHIPAA compliant
Download freeView pricing

How to read this page

Two jobs, two model types

Superwhisper runs two models back to back. A speech recognition model turns your voice into text. Then an optional language model rewrites that text in Super Mode so an email comes out like an email and a code prompt stays technical.

You can mix and match. On-device Parakeet into cloud Claude Sonnet is a common setup. So is Ultra into nothing at all for raw transcription. The speed and accuracy dots are relative rankings inside Superwhisper, not industry-wide scores.

Speech recognition

Cloud transcription

Audio is sent to the Superwhisper proxy, transcribed, and returned. Best for maximum accuracy and the fastest cloud latency.

ModelProviderSpeed / AccuracyLanguagesTier
UltraSuperwhisper
100+Pro
S1-VoiceSuperwhisper
100+Pro
Scribe V2ElevenLabs
99Pro
Nova 3Deepgram
36Pro
Nova 2Deepgram
36Pro
Nova MedicalDeepgram
EnglishPro

On-device transcription

Models run locally on your Mac, PC, or iPhone. Audio never leaves the device and nothing needs an internet connection. The Fast, Nano, and Standard Whisper models are on the free tier. The faster Parakeets and larger Whisper variants ship with Pro.

ModelProviderSpeed / AccuracyLanguagesSizeTier
Parakeet V2NVIDIA
English476 MBPro
Parakeet V3NVIDIA
24494 MBPro
Ultra V3OpenAI Whisper
100+3.0 GBPro
Ultra V3 TurboOpenAI Whisper
100+1.6 GBPro
ProOpenAI Whisper
100+1.5 GBPro
StandardOpenAI Whisper
100+500 MBFree
NanoOpenAI Whisper
100+150 MBFree
FastOpenAI Whisper
100+75 MBFree

Super Mode

Cloud language models

Super Mode uses these to rewrite your transcript, translate it, or reformat it for the app you're in. Requests go through the Superwhisper proxy, so providers never see your account and nothing is retained for training.

ModelProviderSpeed / IntelligenceContextTier
Claude Sonnet 4.6Anthropic
1MPro
Claude Sonnet 4.5Anthropic
200kPro
Claude Haiku 4.5Anthropic
200kPro
GPT-5.4 miniOpenAI
400kPro
GPT-5.4 nanoOpenAI
400kPro
GPT-5.3 InstantOpenAI
128kPro
GPT-5.2OpenAI
400kPro
GPT-5.1OpenAI
400kPro
GPT-5 miniOpenAI
400kPro
GPT-5 nanoOpenAI
400kPro
Gemini 3 FlashGoogle
1MPro
Gemini 3.1 Flash LiteGoogle
1MPro
Grok 4.1 FastxAI
2MPro
S1-LanguageSuperwhisper
128kPro
Llama 3.1 8BMeta / Groq
128kPro

On-device language models

Run locally through llama.cpp on Apple Silicon or Windows. No internet needed. Size indicates the download on disk. Included with Pro.

ModelProviderSpeed / IntelligenceSizeTier
GPT OSS 20BOpenAI
14 GBPro
DeepSeek R1 DistillDeepSeek
5.4 GBPro
Ministral 3 8BMistral
5.2 GBPro
Llama 3 8BMeta
4.9 GBPro
Mistral 7B v0.2Mistral
4.4 GBPro
Llama 3.2 3BMeta
1.9 GBPro
Phi-2 3BMicrosoft
1.8 GBPro

Methodology

Where the ratings come from

Transcription speed and accuracy are anchored to the Hugging Face Open ASR Leaderboard. Parakeet V2 leads on real-time factor with an industry-best 6.05% word error rate for English. Whisper Large V3 scores slightly higher on accuracy but runs much slower, which is why Ultra V3 earns a 2 on speed and a 5 on accuracy while the Turbo variant evens out at 4 and 4.

Language model speed and intelligence are anchored to Artificial Analysis. Claude Sonnet 4.6 and the GPT-5.4 series sit at the top of the intelligence index alongside GPT-5.1 and GPT-5.2. Gemini 3.1 Flash Lite pushes above 200 tokens per second, which is why it gets the speed crown on the lite side. Groq's Llama 3.1 8B is faster still, but its smaller parameter count caps it at a 2 on intelligence.

The dots are relative to the other models in Superwhisper, not the entire industry. A 5 is the best in its class. A 1 is there for people who want a tiny download and can live with a rougher transcript.

Privacy

On-device by default, cloud by choice

Every on-device model in this list runs locally. Your microphone input never leaves the machine, we do not log audio, and nothing about what you dictate is stored on our servers. You can work on a plane or inside a secure environment and the models behave the same way.

Cloud models are optional. When you pick one, audio and text go through the Superwhisper proxy to the provider and back. Providers see a proxy request, not your account. Nothing is retained for training. Enterprise customers can swap in their own API keys or host compatible models behind a VPC.

What people say

Loved by those who move fast

Andrej Karpathy

Andrej Karpathy

Founder, Eureka Labs

There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. I just talk to Composer with @superwhisperapp...

Pieter Levels

Pieter Levels

Serial entrepreneur, @levelsio

Tried @superwhisperapp today. Very nice. Lets me talk to Cursor and then it codes for me, just gets it right.

Support

Frequently asked questions

How do I pick a model?

For everyday voice-to-text on a Mac with Apple Silicon, Parakeet is a good starting point. It runs on your device, handles English well, and feels instant. If you want the best accuracy in any language, Ultra is the cloud transcription model we run. For Super Mode, Claude Sonnet 4.6 or GPT-5.2 will give you the best rewrites, GPT-5.4 mini is a solid default, and Haiku 4.5 or GPT-5.4 nano are the right call when you want faster output.

What's the difference between on-device and cloud models?

On-device models run locally. Your audio never leaves the computer, nothing is logged, and you can work offline. Cloud models are faster for their accuracy class because they run on larger servers, and they cover more languages. Superwhisper routes everything through its own proxy, so providers like OpenAI and Anthropic never see your account or your content.

How are the speed and accuracy scores decided?

We start with published benchmarks from the Hugging Face Open ASR Leaderboard for transcription and Artificial Analysis for LLMs, then adjust based on how the models actually feel in Superwhisper. The dots compare models inside Superwhisper, not across the whole industry. A 5 is the best in its class, not a claim that every 5-dot model is equivalent.

What do I get on the free tier?

The free tier includes the Fast, Nano, and Standard Whisper models. They run entirely on-device, work offline, and are enough for most everyday voice-to-text. Pro unlocks Parakeet, the larger Whisper variants, Ultra in the cloud, and every Super Mode language model.

Can my company host its own models?

Yes. Enterprise customers can run Superwhisper against self-hosted models or against private endpoints from OpenAI, Anthropic, Google, or AWS Bedrock. Model access is controlled centrally through the admin dashboard.

What about privacy and compliance?

Superwhisper is SOC 2 Type II certified and HIPAA compliant. On-device models never send audio anywhere. Cloud requests are proxied through our infrastructure, and providers don't retain your data for training.

When do new models get added?

We add new models as they ship. Claude Sonnet 4.6, the GPT-5.4 series, GPT-5.3 Instant, Gemini 3.1 Flash Lite, and Parakeet V3 landed within days of release. If a model you want isn't here yet, it's probably on the way.

What if I work with sensitive security or healthcare patient data?

Pick an on-device model and your audio stays on your machine — nothing is sent to us or any provider, and you can work fully offline. For teams handling PHI or regulated data, Superwhisper is HIPAA compliant and SOC 2 Type II certified, and Enterprise can route cloud requests through your own private endpoints. Read our guide on handling sensitive data.

One app, every model

Free tier runs the core Whisper models on-device. Pro unlocks Parakeet, Ultra, and every cloud model.

Voice to text in any app
Cloud and on-device AI models
100+ languages
Meeting recording and transcription
Bring-your-own enterprise models
SOC 2 Type II certified & HIPAA compliant