baidu-speech-to-text

Community

Convert voice to text, optimized for China.

Authorcastle-x
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill converts voice messages (ogg/opus) into text, specifically optimized for domestic Chinese servers and proxy environments, overcoming common connectivity issues with Baidu's API.

Core Features & Use Cases

  • Voice-to-Text Conversion: Accurately transcribes audio files into text.
  • Multi-Language Support: Supports Mandarin, English, Cantonese, and Sichuan dialect.
  • Proxy Bypass: Automatically bypasses proxy settings to ensure direct access to Baidu's API from within China.
  • Use Case: Automatically transcribe voice messages received on platforms like Discord or WhatsApp when your server is located in China and uses a proxy.

Quick Start

Convert the audio file located at '/path/to/your/voice.ogg' to text using the default Mandarin language.

Dependency Matrix

Required Modules

ffmpeg

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: baidu-speech-to-text
Download link: https://github.com/castle-x/skills-x/archive/main.zip#baidu-speech-to-text

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.