agentic-vision

Community

Analyze images and videos with Gemini.

Authortaiyousan15
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the complex analysis of visual content like images and videos, providing deep insights that would otherwise require significant manual effort and expertise.

Core Features & Use Cases

  • Advanced Visual Analysis: Utilizes Gemini 3 Flash's Agentic Vision for in-depth analysis of composition, color, text, elements, and quality.
  • Market & Content Evaluation: Ideal for market trend analysis, competitor research, and evaluating the quality of visual content for marketing or creative purposes.
  • Use Case: Analyze 100 Kindle book covers to identify common design patterns, dominant color palettes, and typography trends that contribute to higher sales.

Quick Start

Analyze the image at path/to/image.png using the comprehensive analysis type.

Dependency Matrix

Required Modules

google-genaipillowopencv-pythonnumpypandasmatplotlibapify-clientrequests

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: agentic-vision
Download link: https://github.com/taiyousan15/taisun_agent/archive/main.zip#agentic-vision

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.