Audio Processing

nfbs2000/vibe-with-google-ai-divorce-agent-inflearn

course-dialog-builder

Validates narration text, generates audio via ElevenLabs, and synchronizes timestamps.

uukuguy/claude-agent-framework

brand-voice

Brand consistency evaluation and voice alignment guidelines

lightning-address

Use when implementing Lightning address functionality - provides complete patterns for resolving Lightning addresses to invoices, generating invoices from addresses, displaying Lightning addresses in UI, and integrating with QR codes

PerceptLabs/nostril

Create The Jungle Brief newsletter issues. Use when assembling newsletter content from ideas and posts, creating issue sections, or reviewing newsletter drafts. Includes newsletter-specific voice (curated, direct, insider feel), format guidelines, and issue templates.

livekit-prompt-builder

Okeysir198/P20251122-claude-skills

Guide for creating effective prompts and instructions for LiveKit voice agents. Use when building conversational AI agents with the LiveKit Agents framework, including (1) Creating new voice agent prompts from scratch, (2) Improving existing agent instructions, (3) Optimizing prompts for text-to-speech output, (4) Integrating tool/function calling capabilities, (5) Building multi-agent systems with handoffs, (6) Ensuring voice-friendly formatting and brevity for natural conversations, (7) Iteratively improving prompts based on testing and feedback, (8) Building industry-specific agents (debt collection, healthcare, banking, customer service, front desk).

vapi-skill

Mapping guidance for Vapi voice events to dashboard templates.

gracebotly/flowetic-app

notebooklm-superskill

Generate slide decks, audio podcasts, infographics, and video overviews from NotebookLM notebooks. Customizable by audience, format, language (80+), orientation, and visual themes. Use when asked to generate slides, create podcast, make infographic, video overview, or automate NotebookLM content creation.

brand-voice

Apply Matt Palmer's voice, tone, and content pillars to any writing. Use for blog posts, social media, documentation, emails, or any content needing Matt's authentic brand voice.

philosophy-of-language

Master philosophy of language - meaning, reference, truth, speech acts. Use for: semantics, pragmatics, meaning theory, reference. Triggers: 'meaning', 'reference', 'Frege', 'sense', 'Kripke', 'speech act', 'semantics', 'pragmatics', 'truth conditions', 'propositions', 'names', 'descriptions', 'rigid designator', 'natural kind', 'context', 'indexical'.

chrislemke/stoffy

tacosdedatos-editor

Use this skill to perform comprehensive editorial reviews of tacosdedatos content. Provides developmental editing (structure, flow, argument strength, pacing) and voice authenticity editing (ensuring content sounds like tacosdedatos, not generic AI). Use when reviewing drafts, giving feedback on submissions, or evaluating content before publication. Distinct from the writer skill (creates content) and copy-editor agent (grammar/mechanics polish).

chekos/bns-marketplace

tts-livekit-plugin

Okeysir198/P20251122-claude-skills

Build and deploy self-hosted Text-to-Speech API using MeloTTS from Hugging Face and create a LiveKit plugin for voice agents. Use this skill when building TTS systems, LiveKit voice agents, or self-hosted speech synthesis solutions.

dylantarre/animation-principles

rhythm-pacing

Use when animation needs musical flow—dance sequences, action choreography, comedic timing, scene pacing, or any motion that should feel rhythmic and well-composed over time.

pure-data

Pure Data reference documentation for audio patch development. Use when writing .pd patches, working with OSC/audio/sampling, or implementing Pure Data code.

harness-runner

Run WaveCap-SDR test harness with automated parameter sweeps and validation. Use when regression testing, validating audio quality across configurations, testing SDR hardware, or benchmarking demodulation performance.

brand-voice-therapy

Apply Jesse's confident-but-vulnerable voice to all content. Use when writing service pages, blog posts, CTAs, emails, or checking if content sounds like Jesse. Reference: /docs/branding/voice/BRAND-VOICE.md

whisper-lolo-roadmap

Guide development of the whisper-lolo project based on specifications-projet.md. Use when planning or executing a sprint/PR, validating scope or constraints, or aligning architecture, statuses, and DoD for the Next.js + Vercel + Blob + Inngest + Whisper stack.

Lofp34/whisper-lolo

michaeldiestelberg/The-AI-enabled-Product-Builder

create-memo

This skill helps capture unstructured thoughts (via text or voice) across multiple turns and converts them into structured memos that are saved to a Notion inbox database. Use when the user wants to create a memo, capture thoughts, or provide content that should be documented verbatim without analysis or response.

johanruttens/paddle-battle

game-developer

Expert game development and design skill for building complete, polished games. Use when creating games, game prototypes, or interactive entertainment experiences across platforms (React Native, web, Unity concepts, Godot). Covers game mechanics, physics, AI opponents, level design, progression systems, visual effects, sound integration, and player experience. Triggers on requests to build games, create game mechanics, design levels, implement game AI, add game audio, or develop interactive entertainment.

stt-integration

vanman2024/ai-dev-marketplace

ElevenLabs Speech-to-Text transcription workflows with Scribe v1 supporting 99 languages, speaker diarization, and Vercel AI SDK integration. Use when implementing audio transcription, building STT features, integrating speech-to-text, setting up Vercel AI SDK with ElevenLabs, or when user mentions transcription, STT, Scribe v1, audio-to-text, speaker diarization, or multi-language transcription.