screen-vision
CommunitymacOS Screen OCR & Automation
Authorsky770825
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a zero-token cost solution for understanding and interacting with your macOS screen, enabling automation for applications without APIs.
Core Features & Use Cases
- Screen Text Recognition (OCR): Extract text from any part of your screen without sending data externally.
- Element Location: Precisely identify the coordinates of text elements on your screen.
- Automated Actions: Perform clicks and other interactions at specific screen coordinates.
- Use Case: Automate repetitive tasks in applications like Slack or a custom internal tool by having the AI "see" the screen, locate buttons or information, and click or type as needed.
Quick Start
Use the screen-vision skill to find the text "Submit" on the screen and then click on it.
Dependency Matrix
Required Modules
swift
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: screen-vision Download link: https://github.com/sky770825/openclaw-console-hub/archive/main.zip#screen-vision Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.