πŸ“‹

Desktop Control

Advanced desktop automation with pixel-perfect mouse control, keyboard input, screen capture, window management, and clipboard operations. Built on PyAutoGUI.

by

matagul

Productivity & TasksAutomationDesktopPyAutoGUI
Install
npx clawhub@latest install desktop-control
Current Installs

381

All-time Installs

120

Version

v1.0.0

View Source

ClawHub

Desktop Control Overview

Desktop Control provides comprehensive desktop automation capabilities, allowing OpenClaw to interact with any application on your computer.

It offers pixel-perfect mouse control, fast keyboard input, screen capture with image recognition, window management, and clipboard operations with built-in safety features.

How It Works

1Step 1 - Initialize DesktopController

The skill uses PyAutoGUI as the core engine and exposes a DesktopController class for automation.

2Step 2 - Drive Input

Mouse moves, clicks, drags, scrolling, and keyboard events are routed through DesktopController.

3Step 3 - Observe the Screen

Screenshots, pixel checks, and optional OpenCV image recognition help locate UI targets.

4Step 4 - Stay Safe

Failsafe corners, pause controls, and optional approval mode help prevent unsafe actions.

Key Features

Mouse Control

Absolute and relative positioning, smooth movement, left/right/middle/double click, drag-and-drop, and scrolling.

Keyboard Input

Fast text typing with configurable WPM, hotkeys, special keys, and hold/release control.

Screen Operations

Full-screen and region screenshots, pixel color detection, image recognition, and multi-monitor support.

Window Management

List windows, activate by title, read window info, and minimize/maximize control.

Clipboard Operations

Copy text to the system clipboard and read text back from it.

Safety Features

Failsafe corner abort, emergency pause, bounds checking, and action logging.

Requirements

  • No VM required.
  • Python packages: PyAutoGUI, Pillow, and PyGetWindow are required. OpenCV (opencv-python) is optional for image recognition.
  • Works on macOS, Linux, and Windows; some features may require elevated privileges.
  • A graphical desktop environment is required (not compatible with headless servers).

Use Cases

Form Filling

Automatically fill out web forms or desktop application fields with typed input.

Application Automation

Control any desktop application through mouse and keyboard simulation.

Screen Monitoring

Capture screenshots, detect specific UI elements, and respond to visual changes.

File Management

Drag and drop files between folders and multi-select with Ctrl+click.

Testing & QA

Automate manual GUI testing workflows across desktop applications.

How to Install

npx clawhub@latest install desktop-control

Run in your terminal, or click the Install button at the top of this page for one-click setup.

FAQ

Which Python packages are required?

PyAutoGUI, Pillow, and PyGetWindow are required. OpenCV (opencv-python) is optional for image recognition.

Does Desktop Control work on macOS, Linux, and Windows?

Yes. It supports macOS, Linux, and Windows, though some actions may need elevated privileges.

Can I run Desktop Control on a headless server?

No. A graphical desktop environment is required.

Is there a safety failsafe?

Yes. It includes a failsafe corner, pause controls, optional approval mode, and logging.

How do I enable image recognition?

Install OpenCV (opencv-python). It enables image matching in screen operations.