gemini-computer-use

Community

Automates browser tasks with Gemini Computer Use.

Authoram-will
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill enables building and running Gemini 2.5 Computer Use browser-control agents with Playwright to automate repetitive web browsing tasks. It supports an agent loop where the model suggests a function_call and the agent executes actions, then returns a function_response to continue the workflow, including optional safety confirmations for risky UI actions.

Core Features & Use Cases

  • Gemini Computer Use agents controlled by Playwright for browser automation.
  • Deterministic action execution with a loop (screenshot → function_call → action → function_response).
  • Safety confirmation prompts for high-risk UI actions to prevent unwanted changes.
  • Real-time feedback via screenshots and URL context after each action.

Quick Start

Tell the agent to visit example.com, take a screenshot, and return the page title.

Dependency Matrix

Required Modules

playwrightgoogle-generativeai

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gemini-computer-use
Download link: https://github.com/am-will/codex-skills/archive/main.zip#gemini-computer-use

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.