multimodal-analyst

Official

Unified analysis of text, images, and video.

AuthorTECHKNOWMAD-LABS
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill tackles the challenge of analyzing diverse content types simultaneously, providing a cohesive understanding from text, images, and video inputs.

Core Features & Use Cases

  • Cross-Modal Synthesis: Integrates insights from text, image URLs, and video URLs into a single analysis.
  • Modality-Specific Analysis: Performs detailed analysis tailored to each content type.
  • Hallucination Detection: Flags potential inaccuracies or unsupported claims across modalities.
  • Use Case: Analyze a news article that includes text, an accompanying image, and an embedded video to understand the overall narrative and identify potential discrepancies or correlations between the media.

Quick Start

Analyze the provided input data containing text, image URLs, and video URLs using the multimodal-analyst skill.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: multimodal-analyst
Download link: https://github.com/TECHKNOWMAD-LABS/cortex-research-suite/archive/main.zip#multimodal-analyst

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.