🎨

音訊處理

357 skills in 內容與媒體 > 音訊處理

docs-applying-content-quality

Universal markdown content quality standards for active voice, heading hierarchy, accessibility compliance (alt text, WCAG AA contrast, screen reader support), and professional formatting. Essential for all markdown content creation across docs/, Hugo sites, plans/, and repository files. Auto-loads when creating or editing markdown content.

wahidyankf/open-sharia-enterprise
4
0
更新於 1d ago

f5-tts

Toolkit for installing and using F5-TTS, a neural text-to-speech system. Use this skill when users request TTS (text-to-speech) functionality, want to install or troubleshoot F5-TTS, or need to run the F5-TTS Gradio interface. This skill handles Python 3.11 requirement, FFmpeg dependencies, and library path configuration on macOS.

mbailey/claude
4
1
更新於 22h ago

hooks-development

Marketplace

Claude Code hooks development guide covering all 10 hook events lifecycle, PostToolUse visibility patterns, PreToolUse guards, Stop hook schema, and debugging. Use when creating hooks, troubleshooting hook output, understanding hook lifecycle, or when user mentions decision block, hook JSON output, stop hook, or Claude Code hooks.

terrylica/cc-skills
4
0
更新於 1d ago

strudel-music

Creating live-coded algorithmic music and ambient soundscapes using Strudel pattern language. Use this skill when the user asks to create music, generate soundscapes, build ambient tracks, or work with live coding music patterns. This skill helps with setting up Strudel with Neovim integration and composing music using pattern-based programming.

mbailey/claude
4
1
更新於 1d ago

config-builder

Create configuration systems following VoiceMode and MCPro patterns. Use when building new Python projects that need flexible configuration with .env files, environment variables, CLI arguments, and cascading precedence. Implements dotenv-based config with Click CLI integration and helper functions for path expansion and boolean parsing.

mbailey/claude
4
1
更新於 1d ago

conlang

Generate phonologically consistent constructed languages for fiction. Use when you need naming languages, alien speech, or fantasy tongues without deep linguistics knowledge.

jwynia/the-kepler-testimonies
4
1
更新於 1d ago

elevenlabs-tts

This skill converts text to high-quality audio files using ElevenLabs API. Use this skill when users request text-to-speech generation, audio narration, or voice synthesis with customizable voice parameters (stability, similarity boost) and voice presets (rachel, adam, bella, elli, josh, arnold, ava).

glebis/claude-skills
4
0
更新於 1d ago

jtbd-psychographic-research

Marketplace

Provides Jobs-to-be-Done and psychographic research frameworks for brand identity work. Auto-activates during brand positioning, voice development, messaging, and strategy phases. Use when discussing target audience, customer research, JTBD, jobs to be done, four forces, push pull anxiety habit, emotional jobs, social jobs, functional jobs, limbic types, VALS segments, psychographics, or customer motivations.

mike-coulbourn/claude-vibes
3
0
更新於 18h ago

critical-bug-detector

Automatically performs critical pre-release checks when building installers, creating releases, or packaging VoiceLite. Prevents missing files, version mismatches, and configuration errors that have caused release failures.

mikha08-rgb/VoiceLite
3
1
更新於 16h ago

hooks-builder

Marketplace

Create event-driven hooks for Claude Code automation. Use when the user wants to create hooks, automate tool validation, add pre/post processing, enforce security policies, or configure settings.json hooks. Triggers: create hook, build hook, PreToolUse, PostToolUse, event automation, tool validation, security hook

mike-coulbourn/claude-vibes
3
0
更新於 22h ago

context7

Search GitHub issues, pull requests, and discussions across any repository. Activates when researching external dependencies (whisper.cpp, NAudio), looking for similar bugs, or finding implementation examples.

mikha08-rgb/VoiceLite
3
1
更新於 18h ago

audio-engineer

Activate this skill when users need help with audio configuration, troubleshooting, or optimization in OBS. Triggers include requests like "fix my audio", "adjust microphone levels", "mute desktop audio", "balance my audio sources", "check audio levels", or diagnosing audio issues like echo, distortion, or missing sound. This skill orchestrates audio tools to ensure professional sound quality.

ironystock/agentic-obs
3
1
更新於 20h ago

notebooklm

Guide for managing Google NotebookLM from the command line using nlm CLI. Use when the user wants to create notebooks, manage sources, generate audio overviews, or mentions NotebookLM, nlm, notebook management, or research organization.

ronnycoding/.claude
3
0
更新於 11h ago

wake-word-detection

Expert skill for implementing wake word detection with openWakeWord. Covers audio monitoring, keyword spotting, privacy protection, and efficient always-listening systems for JARVIS voice assistant.

martinholovsky/claude-skills-generator
3
0
更新於 20h ago

swiftui-accessibility

Marketplace

Accessibility implementation guide for SwiftUI apps. Use when implementing VoiceOver support, adding accessibilityLabel/Hint/Value, supporting Dynamic Type, ensuring color contrast, testing accessibility, or reviewing accessibility in PRs. Covers iOS accessibility APIs, WCAG guidelines, and testing tools.

xtone/ai_development_tools
3
0
更新於 16h ago

VOICEVOX Narration System

Generate Yukkuri-style voice narration from Git commits using VOICEVOX Engine. Use when creating development progress audio guides, YouTube content, or team reports from Git history.

ShunsukeHayashi/miyabi-mcp-bundle
3
1
更新於 18h ago

elevenlabs

AI-powered audio generation using ElevenLabs API - text-to-speech with lifelike voices, sound effects generation, and music creation from text descriptions. Generate natural-sounding speech in 32 languages, create custom sound effects for games and videos, and compose royalty-free music tracks. Use this skill when the user requests: - Voice generation or text-to-speech conversion - Audio narration for content (videos, audiobooks, podcasts) - Sound effects for games, videos, or applications - Music generation from text descriptions - Multi-speaker dialogue or conversation audio - Voice cloning or custom voice creation - Audio streaming for real-time applications Capabilities: Text-to-speech (32 languages, 100+ voices), sound effects generation, music composition, voice cloning, real-time audio streaming Python SDK: elevenlabs (pip install elevenlabs)

jkitchin/skillz
3
0
更新於 16h ago

web-audio-api

Web Audio API for JARVIS audio feedback and voice processing

martinholovsky/claude-skills-generator
3
0
更新於 22h ago

speech-to-text

Expert skill for implementing speech-to-text with Faster Whisper. Covers audio processing, transcription optimization, privacy protection, and secure handling of voice data for JARVIS voice assistant.

martinholovsky/claude-skills-generator
3
0
更新於 10h ago

podcast-production-guide

Podcast expert covering recording, editing, hosting, promotion, and monetization strategies

sandraschi/advanced-memory-mcp
3
1
更新於 22h ago