๐ŸŽจ

Audio Processing

357 skills in Content & Media > Audio Processing

openai-latam-audiobook

Create complete audiobook with OpenAI GPT-4o-mini translation to Argentine Spanish. Full pipeline - translate, TTS, video with background image. Auto-scales parallelism based on file size. Use when user wants to create audiobook from Russian text to LATAM Spanish.

majiayu000/claude-skill-registry
0
0
Updated 4d ago

asr

Implement speech-to-text (ASR/automatic speech recognition) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to transcribe audio files, convert speech to text, build voice input features, or process audio recordings. Supports base64 encoded audio files and returns accurate text transcriptions.

albertfast/radar_tinder
0
0
Updated 4d ago

campaign-page-copy

Generates full campaign page content structured for Kickstarter/Indiegogo using positioning, product, and voice assets.

Sheshiyer/Skills-claude-brand-genesis
0
0
Updated 4d ago

bigquery-object-table-agent

BigQuery Object Tables๋ฅผ ํ™œ์šฉํ•œ ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ(์˜ค๋””์˜ค, ์ด๋ฏธ์ง€ ๋“ฑ) ๋ถ„์„ ๋ฐ Audio Analytics Agent ๊ตฌ์ถ• ๊ฐ€์ด๋“œ. GCS ๋ฐ์ดํ„ฐ ์—ฐ๋™, ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ์บ์‹ฑ, AI ๋ชจ๋ธ ํ†ตํ•ฉ, ADK ์—์ด์ „ํŠธ ๊ตฌํ˜„ ํŒจํ„ด์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค.

nfbs2000/vibe-with-google-ai-divorce-agent-inflearn
0
0
Updated 4d ago

channel-optimizer

Auto-tune channel parameters to find optimal offset, squelch, and AGC settings for best audio quality. Use when setting up new channels, improving weak signals, or finding the sweet spot for demodulation settings.

majiayu000/claude-skill-registry
0
0
Updated 4d ago

podcast-production

Marketplace

Podcast production patterns and workflows. Use when recording podcasts, editing audio, transcribing episodes, generating show notes, RSS feed management, or podcast distribution.

mindmorass/reflex
0
0
Updated 4d ago

rust-candle-whisper

Implement native Rust ML inference with Candle framework. Use when building GPU-accelerated ML pipelines without Python dependencies.

gar-ai/mallorn
0
0
Updated 4d ago

transcribe-audio-to-text

Marketplace

Transcribe audio files to text using audinota cli

MacHu-GWU/sanhe-claude-code-plugins
0
0
Updated 4d ago

wavecap-hallucination

Configure WaveCap hallucination detection and prevention. Use when Whisper outputs gibberish, repeated phrases, or phantom text on silent audio.

TobiasWooldridge/WaveCap
0
0
Updated 4d ago

brand-strategy

This skill should be used when translating research insights into actionable brand strategy frameworks. Use this when developing positioning statements, messaging architectures, audience strategies, or voice guidelines based on completed research. This skill provides strategic synthesis workflows, validation frameworks, and strategy document templates for evidence-based brand strategy development.

majiayu000/claude-skill-registry
0
0
Updated 4d ago

sag

ElevenLabs text-to-speech with mac-style say UX.

majiayu000/claude-skill-registry
0
0
Updated 4d ago

marketing-writer

Create marketing content optimized for both human readers and LLM discovery (GEO/AEO). Use when the user needs to write or improve marketing materials including landing page copy, tweet threads, launch emails, blog posts, or feature announcements. Automatically analyzes the user's codebase to understand product features and value propositions. Applies casual, direct brand voice and Generative Engine Optimization principles to maximize visibility in AI search results.

AIBPM42/hodgesfooshee-site-spark
0
0
Updated 4d ago

claude-hook-builder

Interactive hook creator for Claude Code. Triggers when user mentions creating hooks, PreToolUse, PostToolUse, hook validation, hook configuration, settings.json hooks, or wants to automate tool execution workflows.

MEDICALCOR/medicalcor-core
0
0
Updated 4d ago

livekit-voice-agent

Marketplace

Guide for building production-ready LiveKit voice AI agents with multi-agent workflows and intelligent handoffs. Use when creating real-time voice agents that need to transfer control between specialized agents, implement supervisor escalation, or build complex conversational systems.

Okeysir198/P20251122-claude-skills
0
0
Updated 4d ago

content-publishing

Automated content publishing pipeline for ID8Labs. Generates essays in Eddie's voice, publishes to id8labs.app, and distributes to social media (X, LinkedIn). Triggers on keywords like release, announce, publish, essay, research article, content pipeline.

eddiebe147/claude-settings
0
0
Updated 4d ago

writer

Generate content in your authentic voice across emails, blogs, social media, and reports

majiayu000/claude-skill-registry
0
0
Updated 4d ago

m4b-audiobook-builder

Build and merge M4B audiobooks on Linux from multiple audio files or multi-part M4B sets, with chapter generation, metadata normalization, UTF-8/Russian encoding handling, and validation. Use when combining MP3/M4A/AAC/FLAC/OGG/WAV into one M4B, merging split M4B parts, or fixing audiobook chapters and metadata.

majiayu000/claude-skill-registry
0
0
Updated 4d ago