transcribe

Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

$ 설치

git clone https://github.com/badlogic/pi-skills /tmp/pi-skills && cp -r /tmp/pi-skills/transcribe ~/.claude/skills/pi-skills

// tip: Run this command in your terminal to install the skill


name: transcribe description: Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

Transcribe

Speech-to-text using Groq Whisper API.

Setup

The script needs GROQ_API_KEY environment variable. Check if already set:

echo $GROQ_API_KEY

If not set, guide the user through setup:

  1. Ask if they have a Groq API key
  2. If not, have them sign up at https://console.groq.com/ and create an API key
  3. Have them add to their shell profile (~/.zshrc or ~/.bashrc):
    export GROQ_API_KEY="<their-api-key>"
    
  4. Then run source ~/.zshrc (or restart terminal)

Usage

{baseDir}/transcribe.sh <audio-file>

Supported Formats

  • m4a, mp3, wav, ogg, flac, webm
  • Max file size: 25MB

Output

Returns plain text transcription with punctuation and proper capitalization to stdout.