Name: gemini-audio
Availability: InStock
Author: alex-tgk

System Documentation

What problem does it solve?

Gemini Audio provides transcription, analysis, and summarization of audio, plus text-to-speech generation. It streamlines workflows for podcasts, interviews, meetings, and multimedia content by turning audio into searchable text and actionable insights.

Core Features & Use Cases

Transcription with timestamps and multi-speaker support
Audio summarization and key-point extraction
Non-speech audio analysis (music, ambient sounds)
Text-to-speech (TTS) generation with controllable voice styles
File management via a Files API workflow for reuse across tasks

Quick Start

Configure GEMINI_API_KEY, then run transcribe.py or generate-speech.py to process audio files or synthesize speech.

Please help me install this Skill: Name: gemini-audio Download link: https://github.com/alex-tgk/saasaas/archive/main.zip#gemini-audio Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

gemini-audio

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper