v2.1.0 — AI Computer Use Inside

RocketMouse AI

The reliability of traditional RPA meets the intelligence of AI.
RPA just got a brain and eyes.

0
Blocks
0
Features
0
Unit Tests
0
AI Providers

Key Features

Powerful automation without writing a single line of code

🧩

Scratch-Style Block Editor

Drag & drop blocks to connect them. Build macros intuitively with no coding required.

🤖

AI Computer Use NEW

AI that sees, understands, and operates your screen. Autonomous task completion via Claude Computer Use API.

🌐

Browser Automation

Control Edge/Chrome via Playwright. Automate web forms, data collection, and more.

📊

Excel Integration

18 Excel operations via COM. Read/write cells, set formulas, run macros, and manage sheets.

👁

Image Recognition

OpenCV + Windows OCR to detect images and text on screen, find positions, and interact.

🐛

Step-by-Step Debugging

Pause, step through, watch variables, and set breakpoints for reliable debugging.


App Screenshots

📷

App Screenshots — Coming Soon


217+ Block Types

Comprehensive blocks for every automation scenario

Mouse

Click, drag, scroll, and more 16 blocks

Keyboard

Key input, text paste 2 blocks

Window

Window management 8 blocks

Browser

Playwright browser automation 16 blocks

Excel

COM-based Excel operations 18 blocks

File / Folder

File and folder operations 15 blocks

Control Flow

Conditions, loops, functions 17 blocks

AI

LLM API integration + Vision 1 block

Vision / OCR

Image recognition, text reading 9 blocks

Data Processing

JSON, regex, dates, lists 22 blocks

Variable

Define, evaluate, reference 5 blocks

System / Utility

App launch, ZIP, Base64, Hash 18+ blocks


Traditional RPA vs AI Computer Use vs RocketMouse AI

RocketMouse AI is the only RPA tool that combines the best of both worlds

Traditional RPA AI Computer Use RocketMouse AI Best of Both Worlds
Execution Speed ⚡ Fast (instant) 🐢 Slow (LLM inference per step) ⚡ Routine tasks run fast + AI only when needed
API / Running Cost 💲 Zero (fully offline) 💸 High (screenshot per step) 💲 Routine tasks are free + minimal AI usage
UI Layout Resilience ❌ Fragile (coordinate/selector dependent) ✅ Strong (visually identifies elements) ✅ AI visually recognizes UI elements
Unexpected Dialog Handling ❌ Can only stop ✅ Self-decides and adapts ✅ AI Autopilot handles autonomously
Ease of Creation ⚠ Manual coordinate picking ✅ Natural language instructions ✅ Drag & drop + natural language
Reproducibility ✅ Same input → same result ❌ Different path each run ✅ Deterministic for routine + AI for judgment
Self-Healing ❌ Not possible ✅ Observes and self-corrects ✅ Built-in self-healing
Offline Operation ✅ Fully supported ❌ Requires internet ✅ RPA runs offline + local LLM support
Browser Automation ⚠ Varies by tool ⚠ Screen-only (no DOM) ✅ Playwright (DOM) + AI Vision
Excel Operations ⚠ Varies by tool ❌ Screen-only ✅ COM API (18 ops) + screen control

AI That Sees, Understands, and Operates

Dual-provider Computer Use: Anthropic Claude and OpenAI GPT-5.5. RPA now has a brain and eyes.

🎯

AI Click

"Click the Save button" — AI sees the screen, locates the element, and clicks it. No coordinates needed. 2-pass refinement detects even small icons at screen edges with high accuracy.

AI Vision
🧠

AI Autopilot

"Open Notepad, type Hello World, and save" — Multi-turn autonomous operation via Claude or OpenAI GPT-5.5 Computer Use API. Takes screenshots, decides the next action, and continues until the task is complete. Switch providers freely.

Computer Use API (Claude / GPT-5.5)
👀

AI OCR

"Read the error message on screen" — AI Vision reads any text on screen and stores it in a variable. Unlike traditional OCR, it understands context and semantics.

AI Vision

AI Smart Wait

"Wait until the download dialog appears" — AI periodically checks the screen and waits intelligently until the condition is met. No more fixed-time waits.

AI Vision

AI Validate

"Verify the file was saved correctly" — AI examines the screen after an action and returns true/false for conditional branching in your macro.

AI Vision
💫

AI Boolean Condition

"Is the login screen showing?" — Use AI judgment in if/while conditions. Build dynamic branching based on visual screen state, all with no code.

AI Vision

⚡ The Hybrid Advantage — Mix RPA and AI Freely in a Single Macro

Routine operations run via Win32 API: fast, reliable, free. AI kicks in only where the UI is unpredictable. The optimal balance of cost and reliability in one macro — only with RocketMouse AI.

⚡ Win32: Launch Excel ⚡ Enter Data (fast) 🧠 AI Click: Find Save Button ⚡ Keyboard Shortcut 🧠 AI Autopilot: Review

Built-in AI Assistant

5 LLM providers supported

💬 Natural Language → Blocks

Type "Open Notepad, type Hello World, and save" to automatically generate and place the corresponding block sequence.

⚙ 5 LLM Providers

OpenAI (GPT-5.5 series), Anthropic (Claude 4.6/4.7), Google Gemini, Groq (Llama 4 Scout, etc.), and local LLMs (LM Studio/Ollama). Choose your preferred provider.


Get Started Today

Try all features free for 15 days

 Mac Version