baidu-speech-to-text
CommunityConvert voice to text, optimized for China.
Authorcastle-x
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill converts voice messages (ogg/opus) into text, specifically optimized for domestic Chinese servers and proxy environments, overcoming common connectivity issues with Baidu's API.
Core Features & Use Cases
- Voice-to-Text Conversion: Accurately transcribes audio files into text.
- Multi-Language Support: Supports Mandarin, English, Cantonese, and Sichuan dialect.
- Proxy Bypass: Automatically bypasses proxy settings to ensure direct access to Baidu's API from within China.
- Use Case: Automatically transcribe voice messages received on platforms like Discord or WhatsApp when your server is located in China and uses a proxy.
Quick Start
Convert the audio file located at '/path/to/your/voice.ogg' to text using the default Mandarin language.
Dependency Matrix
Required Modules
ffmpeg
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: baidu-speech-to-text Download link: https://github.com/castle-x/skills-x/archive/main.zip#baidu-speech-to-text Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.