LLM Transcription Skill

Community

Transcribe images and audio to markdown.

Authorkarstenheld3
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the conversion of visual and audio information into structured text, making it easily searchable, editable, and usable in downstream applications.

Core Features & Use Cases

  • Image to Markdown: Converts screenshots, documents, and diagrams into formatted Markdown text, preserving structure and extracting data from graphics.
  • Audio to Markdown: Transcribes audio files (meetings, lectures, interviews) into readable Markdown text, including speaker identification and formatting.
  • Batch Processing: Handles multiple files efficiently for large-scale transcription needs.
  • Use Case: Transcribe a scanned PDF document into editable Markdown for easy content reuse, or convert a recorded meeting into a searchable transcript with clear section breaks.

Quick Start

Use the LLM Transcription Skill to transcribe the audio file 'meeting_recording.mp3' into a markdown file named 'meeting_transcript.md'.

Dependency Matrix

Required Modules

openaianthropichttpx

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: LLM Transcription Skill
Download link: https://github.com/karstenheld3/OpenAI-BackendTools/archive/main.zip#llm-transcription-skill

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.