Audio Processing
357 skills in Content & Media > Audio Processing
tutopanda-documentary-producer
Coordinate Tutopanda CLI and MCP to generate, edit, inspect, and preview documentary videos (video-audio-music blueprint with FinalVideo export). Use when users ask for Tutopanda to build, edit, or review movies inside Claude Code.
course-dialog-builder
Validates narration text, generates audio via ElevenLabs, and synchronizes timestamps.
brand-voice
Brand consistency evaluation and voice alignment guidelines
lightning-address
Use when implementing Lightning address functionality - provides complete patterns for resolving Lightning addresses to invoices, generating invoices from addresses, displaying Lightning addresses in UI, and integrating with QR codes
newsletter
Create The Jungle Brief newsletter issues. Use when assembling newsletter content from ideas and posts, creating issue sections, or reviewing newsletter drafts. Includes newsletter-specific voice (curated, direct, insider feel), format guidelines, and issue templates.
livekit-prompt-builder
Guide for creating effective prompts and instructions for LiveKit voice agents. Use when building conversational AI agents with the LiveKit Agents framework, including (1) Creating new voice agent prompts from scratch, (2) Improving existing agent instructions, (3) Optimizing prompts for text-to-speech output, (4) Integrating tool/function calling capabilities, (5) Building multi-agent systems with handoffs, (6) Ensuring voice-friendly formatting and brevity for natural conversations, (7) Iteratively improving prompts based on testing and feedback, (8) Building industry-specific agents (debt collection, healthcare, banking, customer service, front desk).
vapi-skill
Mapping guidance for Vapi voice events to dashboard templates.
notebooklm-superskill
Generate slide decks, audio podcasts, infographics, and video overviews from NotebookLM notebooks. Customizable by audience, format, language (80+), orientation, and visual themes. Use when asked to generate slides, create podcast, make infographic, video overview, or automate NotebookLM content creation.
brand-voice
Apply Matt Palmer's voice, tone, and content pillars to any writing. Use for blog posts, social media, documentation, emails, or any content needing Matt's authentic brand voice.
philosophy-of-language
Master philosophy of language - meaning, reference, truth, speech acts. Use for: semantics, pragmatics, meaning theory, reference. Triggers: 'meaning', 'reference', 'Frege', 'sense', 'Kripke', 'speech act', 'semantics', 'pragmatics', 'truth conditions', 'propositions', 'names', 'descriptions', 'rigid designator', 'natural kind', 'context', 'indexical'.
tacosdedatos-editor
Use this skill to perform comprehensive editorial reviews of tacosdedatos content. Provides developmental editing (structure, flow, argument strength, pacing) and voice authenticity editing (ensuring content sounds like tacosdedatos, not generic AI). Use when reviewing drafts, giving feedback on submissions, or evaluating content before publication. Distinct from the writer skill (creates content) and copy-editor agent (grammar/mechanics polish).
tts-livekit-plugin
Build and deploy self-hosted Text-to-Speech API using MeloTTS from Hugging Face and create a LiveKit plugin for voice agents. Use this skill when building TTS systems, LiveKit voice agents, or self-hosted speech synthesis solutions.
rhythm-pacing
Use when animation needs musical flow—dance sequences, action choreography, comedic timing, scene pacing, or any motion that should feel rhythmic and well-composed over time.
pure-data
Pure Data reference documentation for audio patch development. Use when writing .pd patches, working with OSC/audio/sampling, or implementing Pure Data code.
harness-runner
Run WaveCap-SDR test harness with automated parameter sweeps and validation. Use when regression testing, validating audio quality across configurations, testing SDR hardware, or benchmarking demodulation performance.
brand-voice-therapy
Apply Jesse's confident-but-vulnerable voice to all content. Use when writing service pages, blog posts, CTAs, emails, or checking if content sounds like Jesse. Reference: /docs/branding/voice/BRAND-VOICE.md
whisper-lolo-roadmap
Guide development of the whisper-lolo project based on specifications-projet.md. Use when planning or executing a sprint/PR, validating scope or constraints, or aligning architecture, statuses, and DoD for the Next.js + Vercel + Blob + Inngest + Whisper stack.
create-memo
This skill helps capture unstructured thoughts (via text or voice) across multiple turns and converts them into structured memos that are saved to a Notion inbox database. Use when the user wants to create a memo, capture thoughts, or provide content that should be documented verbatim without analysis or response.
game-developer
Expert game development and design skill for building complete, polished games. Use when creating games, game prototypes, or interactive entertainment experiences across platforms (React Native, web, Unity concepts, Godot). Covers game mechanics, physics, AI opponents, level design, progression systems, visual effects, sound integration, and player experience. Triggers on requests to build games, create game mechanics, design levels, implement game AI, add game audio, or develop interactive entertainment.
stt-integration
ElevenLabs Speech-to-Text transcription workflows with Scribe v1 supporting 99 languages, speaker diarization, and Vercel AI SDK integration. Use when implementing audio transcription, building STT features, integrating speech-to-text, setting up Vercel AI SDK with ElevenLabs, or when user mentions transcription, STT, Scribe v1, audio-to-text, speaker diarization, or multi-language transcription.