add-voice-transcription

Name: add-voice-transcription
Availability: InStock
Author: 0-CYBERDYNE-SYSTEMS-0

Community

Turn voice notes into transcripts for FFT_nano

Author0-CYBERDYNE-SYSTEMS-0

Version1.0.0

Installs0

System Documentation

FFT_nano handles WhatsApp voice notes by converting them into readable text, enabling the agent to understand and respond to audio content in chats.

Automatic transcription of voice messages using OpenAI's Whisper API to produce accurate text representations.
Contextual storage of transcripts so the agent can reference spoken content during conversations.
Support for registered WhatsApp groups, allowing the agent to act on voice notes consistently across chats.

Use this skill to enable voice transcription in FFT_nano. Steps:

Ensure you have an OpenAI API key for Whisper transcription and that you have added it to your transcription config.
Add the OpenAI npm package to your project dependencies by running npm install openai.
Create a .transcription.config.json in the project root with provider: "openai", model: "whisper-1", enabled: true, and your apiKey.
Build and start FFT_nano as usual; in registered groups, send a voice note to see the transcription appear as a transcribed text context for the agent.