gemini-computer-use
CommunityAutomates browser tasks with Gemini Computer Use.
Software Engineering#AI automation#browser automation#Playwright#web navigation#Gemini Computer Use#agent loop#safety confirmation
Authoram-will
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill enables building and running Gemini 2.5 Computer Use browser-control agents with Playwright to automate repetitive web browsing tasks. It supports an agent loop where the model suggests a function_call and the agent executes actions, then returns a function_response to continue the workflow, including optional safety confirmations for risky UI actions.
Core Features & Use Cases
- Gemini Computer Use agents controlled by Playwright for browser automation.
- Deterministic action execution with a loop (screenshot → function_call → action → function_response).
- Safety confirmation prompts for high-risk UI actions to prevent unwanted changes.
- Real-time feedback via screenshots and URL context after each action.
Quick Start
Tell the agent to visit example.com, take a screenshot, and return the page title.
Dependency Matrix
Required Modules
playwrightgoogle-generativeai
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: gemini-computer-use Download link: https://github.com/am-will/codex-skills/archive/main.zip#gemini-computer-use Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.