whisperx

Community

AI-powered speech-to-text

AuthorThePlasmak
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the process of converting spoken audio into written text, providing highly accurate transcriptions with advanced features like word-level timestamps and speaker identification.

Core Features & Use Cases

  • Accurate Transcription: Leverages WhisperX for high-quality speech-to-text conversion.
  • Word-Level Timestamps: Provides precise timing for each word, enabling features like karaoke-style subtitles.
  • Speaker Diarization: Identifies and labels different speakers within the audio.
  • Subtitle Generation: Creates SRT and VTT files for easy integration with video content.
  • Use Case: Transcribe a lengthy meeting recording, identify who said what, and generate SRT subtitles for a video summary.

Quick Start

Use the whisperx skill to transcribe the audio file 'meeting_recording.mp3' and identify the speakers.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: whisperx
Download link: https://github.com/ThePlasmak/whisperx/archive/main.zip#whisperx

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.