whisperx
CommunityAI-powered speech-to-text
AuthorThePlasmak
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the process of converting spoken audio into written text, providing highly accurate transcriptions with advanced features like word-level timestamps and speaker identification.
Core Features & Use Cases
- Accurate Transcription: Leverages WhisperX for high-quality speech-to-text conversion.
- Word-Level Timestamps: Provides precise timing for each word, enabling features like karaoke-style subtitles.
- Speaker Diarization: Identifies and labels different speakers within the audio.
- Subtitle Generation: Creates SRT and VTT files for easy integration with video content.
- Use Case: Transcribe a lengthy meeting recording, identify who said what, and generate SRT subtitles for a video summary.
Quick Start
Use the whisperx skill to transcribe the audio file 'meeting_recording.mp3' and identify the speakers.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: whisperx Download link: https://github.com/ThePlasmak/whisperx/archive/main.zip#whisperx Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.