RocketWhisper

Voice to Text.
Instant. Accurate.

Fully Offline AI Speech Recognition & Transcription — One-Time Purchase

16 Premium Features
73 Hallucination Fixes
100% Offline Operation
Free Download
Scroll

Do you have these concerns?

Privacy Worries

You want voice input but don't want to send audio data to the cloud...

Low Recognition Accuracy

Other speech recognition software makes too many errors to be useful...

Technical Terms Not Recognized

Industry-specific terminology and proper nouns aren't converted correctly...

Manual Punctuation Is Tedious

Adding periods and commas manually is time-consuming...

RocketWhisper solves it all

High-Accuracy Recognition with OpenAI Whisper

Powered by a world-class AI speech recognition engine. Industry-leading accuracy for multiple languages.

Fully Local Processing

All speech processing happens entirely on your PC. No data is ever sent externally.

Custom Dictionary for Technical Terms

Register industry jargon and proper nouns. Leverages Whisper's initial prompt feature for improved accuracy.

Auto Punctuation

AI understands context and automatically inserts punctuation at natural positions. Perfect text effortlessly.

NEW

Blazing Fast with NVIDIA GPU

If your PC has an NVIDIA GPU, speech recognition speeds up several times to 10x or more.
No configuration needed — just launch the app and it automatically detects your GPU for optimal acceleration.

CPU Only Normal Speed
With GPU Several to 10x+ Faster
Zero config — auto detection
Works on CPU even without a GPU
NVIDIA CUDA supported
NEW

Voice-Powered AI Control & Web Search

Two powerful new features make speech recognition even more versatile.

AI Command Mode

Select text, press the hotkey, and say "translate this," "summarize," or "make it formal." AI processes your text instantly.

Select text + voice command for AI processing
Translate, summarize, rephrase and more
Works in any application

Voice Search

Just say "Search for Tokyo Tower" and a Google search opens automatically in your default browser. Research has never been smoother.

"Search for ..." for instant web search
Supports 10+ phrase patterns in English & Japanese
Opens in your default browser automatically

100% Offline Compatible
AI processing works fully offline & free with local LLMs (LM Studio / Ollama)!
Also supports cloud LLMs: OpenAI, Gemini, Claude, Groq, and more.

NEW

Instant Voice Input with Right Alt Hold

Like a walkie-talkie — hold the Right Alt key to record, release to start recognition.
No extra steps. Intuitive Push-to-Talk makes voice input even more comfortable.

Hold Mode

Hold Right Alt → Speak → Release → Auto recognition. Perfect for short inputs.

Double-Tap Mode

Quickly double-tap Right Alt → Continuous recording starts. Tap again to stop. Ideal for longer text.

Alt+Tab and other standard shortcuts still work
Completely suppresses Windows menu activation
Default hotkey since v1.1.1
NEW

One Hotkey, Instant Custom Instructions

Register your most-used AI tasks to dedicated hotkeys. Just speak, and translation, rephrasing, summarization and more are applied automatically.
No need to say "translate this" every time like AI Command mode. Register up to 20 custom instructions.

How It Works

E.g.: Assign "Translate to English" to Ctrl+Shift+1 → press the hotkey → speak in Japanese → press the hotkey again → Whisper recognizes → LLM translates to English → result auto-output

4 Built-in Presets

🌐 Translate to English / 💼 Business Style / 📝 Summary / ✔️ Grammar Fix — ready to use out of the box. Add your own custom instructions too.

Dedicated hotkey per instruction
No voice instruction needed — just speak and AI processes automatically
Auto Copy & Auto Paste supported

Rich Premium Features

01

High-Accuracy Speech Recognition

Supports Whisper small/medium/large-v3-turbo/large-v3 models. Choose the best model for your needs.

02

Fully Offline

No internet connection required. Use it on airplanes, in offline environments — anytime, anywhere.

03

Global Hotkeys

Customizable shortcuts let you start voice input instantly from any app. Right Alt hold mode also supported.

04

Batch Processing

Transcribe multiple audio and video files at once. Process meeting recordings, interviews, and video files (MP4, MKV, etc.) in bulk.

05

Recognition History

Past results are automatically saved. Search and reuse any previous transcription instantly.

06

Voice Commands

Say "new line," "new paragraph," or "delete" to format text with your voice alone.

07

Auto Error Correction

Custom rules and regex-based auto-correction. 73 built-in hallucination countermeasures included.

08

Custom Term Registration

Register industry terms and proper nouns to boost accuracy. Leverages Whisper's initial prompt feature.

09

App-Specific Processing Modes

Automatically apply different processing settings per app. Formal for email, casual for notes.

10

AI Processing (LLM Integration)

Supports OpenAI, Gemini, Claude, plus local LLMs (LM Studio/Ollama). AI processing even fully offline.

11

Recording Indicator

When minimized to system tray, a floating window with real-time waveform shows recording status during hotkey recording.

12

Voice Launcher

Say a keyword to auto-launch a registered application. Say "Notepad" and Notepad opens.

13

AI Command Mode

Select text, press the dedicated hotkey, and give voice instructions to have AI translate, summarize, or rephrase instantly.

14

Voice Search

Say "Search for ..." and a Google search opens automatically in your default browser. Supports 10+ phrase patterns.

15

Right Alt Hold Mode

Hold the Right Alt key to record, release to stop automatically. Double-tap to toggle continuous recording mode.

16

Custom Instructions

Register your most-used AI tasks to dedicated hotkeys. Translation, rephrasing, summarization — up to 20 instructions, one key each.

Easy 3 Steps

1

Download a Model

Launch the app and download your preferred Whisper model. Smaller models download in just a few minutes.

2

Speak into the Mic or Select a File

Press the record button and speak, or drag & drop an existing audio file.

3

Copy, Save, or Paste the Result

Copy text to clipboard, save to file, or paste directly into your active application.

Recommended For

Writers & Bloggers

Keep your thoughts flowing without stopping to type. Turn ideas into text instantly.

Meeting Note Takers

Quickly transcribe meeting recordings. Batch processing handles multiple files at once.

Privacy-Conscious Users

Audio data is never sent externally. Safely use voice input even with confidential information.

System Requirements & Specs

System Requirements

OS Windows 11 / Windows 10 (version 1809 or later)
* 64-bit only. 32-bit is not supported.
CPU Required: x64 (64-bit) processor
Recommended: Intel Core i5 / AMD Ryzen 5 or higher
Memory Minimum: 4GB RAM
Recommended: 8GB RAM or more (8GB+ recommended for large-v3-turbo model)
Disk Space Application: Approx. 200MB
Models: 75MB to 2.9GB (depending on selected model)
Runtime .NET 8.0 Desktop Runtime
* Included in Full edition. Lite edition prompts for installation on first launch.
Other Microphone (for real-time recognition)
Internet connection (for model download & cloud LLM AI processing)
* Internet not required when using local LLMs (LM Studio/Ollama)

Memory & Disk Requirements by Model

Model File Size Recommended RAM Best For
medium 1.5GB 8GB Short utterances under 5 seconds
large-v3-turbo 1.6GB 8GB 5-20 second utterances (fast)
large-v3 2.9GB 16GB Long utterances over 20 seconds (highest accuracy)

Technical Specifications

Framework .NET 8.0 WPF
Recognition Engine OpenAI Whisper (Whisper.NET 1.9.0)
Supported Languages Japanese / English / Chinese / Korean / Auto-detect
Supported Input Formats WAV / MP3 / FLAC / OGG / M4A / WMA
Output Formats Text / Clipboard copy / Direct paste
GPU Acceleration NVIDIA CUDA Supported (auto-detected)
Automatically accelerated with NVIDIA GPU. Works fine on CPU without a GPU.

Download Free Now

🍎 Mac version | 🐧 Linux version

Latest Version: 1.2.0

Lite

RocketWhisper Lite

Approx. 50MB

Recommended for users with fast internet. Downloads the AI model on first launch after installation.

  • Lightweight installer (approx. 50MB)
  • Auto-downloads model on first launch
  • Choose from 4 model types
  • .NET Runtime installed separately
Requirements: Internet connection (first time only), .NET 8.0 Desktop Runtime

Lite vs Full Comparison

Item Lite Edition Full Edition
Installer Size Approx. 50MB Approx. 1.7GB
AI Model Downloaded on first launch large-v3-turbo included
.NET Runtime Separate installation required Included (auto-installed)
Offline Use Available after model download Available immediately after install
First Launch Usable after model download Ready to use immediately
Best For Fast internet environments Offline & security-focused

System Requirements: Windows 11 / 10 (64-bit), 8GB+ RAM recommended, .NET 8.0 Desktop Runtime

Frequently Asked Questions

Is RocketWhisper free?

Yes, all features are available for free during the trial period. After that, a one-time license purchase (Personal: ¥4,800 / ~$32) is required. No monthly subscription fees ever.

🔒

Does it work without internet?

Yes, RocketWhisper works completely offline. All audio data is processed locally on your PC and is never sent to any external server. Perfect for privacy-conscious users.

🎯

How accurate is the recognition?

Powered by OpenAI Whisper's latest model (large-v3-turbo), it delivers industry-leading accuracy. 73 hallucination countermeasures and custom vocabulary support ensure practical results.

Do I need a GPU?

No, it works on CPU alone. If you have an NVIDIA GPU, CUDA acceleration is automatically enabled for significantly faster recognition. No setup required.

📋

Can I transcribe meetings?

Yes, you can transcribe in real-time or from recorded audio/video files via batch processing. AI processing features can automatically summarize and reformat the text.

🎬

Can I create video subtitles?

Yes, RocketWhisper can transcribe video files (MP4, MKV, AVI, MOV, WebM) and export subtitles in SRT and VTT formats for YouTube and other platforms.