whisper-transcribe

Official

Transcribe media with context-grounded accuracy.

AuthorSpillwaveSolutions
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the transcription of audio and video files into text, using OpenAI's Whisper, and enhances accuracy with context grounding from per-directory markdown context files. It saves time, reduces manual proofreading, and supports a wide range of media formats.

Core Features & Use Cases

  • Audio/Video Transcription: Convert media files to text using OpenAI Whisper.
  • Context Grounding: Reads markdown context files in the same directory to improve terminology, names, and jargon accuracy.
  • Multi-format Support: Works with mp3, wav, m4a, mp4, webm, and more.
  • Cross-Platform: Install and run on macOS (Homebrew) or Linux.

Quick Start

Transcribe a file like '/path/to/video.mp4' and optionally ground terms from context:

  • Run basic transcription: whisper "/path/to/video.mp4" --output_dir "/path/to/output"
  • Or use the grounding script: python scripts/transcribe_with_context.py "/path/to/audio.mp3" --model base --language en

Dependency Matrix

Required Modules

None required

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: whisper-transcribe
Download link: https://github.com/SpillwaveSolutions/whisper-transcribe/archive/main.zip#whisper-transcribe

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.