音訊處理
357 skills in 內容與媒體 > 音訊處理
docs-applying-content-quality
Universal markdown content quality standards for active voice, heading hierarchy, accessibility compliance (alt text, WCAG AA contrast, screen reader support), and professional formatting. Essential for all markdown content creation across docs/, Hugo sites, plans/, and repository files. Auto-loads when creating or editing markdown content.
f5-tts
Toolkit for installing and using F5-TTS, a neural text-to-speech system. Use this skill when users request TTS (text-to-speech) functionality, want to install or troubleshoot F5-TTS, or need to run the F5-TTS Gradio interface. This skill handles Python 3.11 requirement, FFmpeg dependencies, and library path configuration on macOS.
hooks-development
Claude Code hooks development guide covering all 10 hook events lifecycle, PostToolUse visibility patterns, PreToolUse guards, Stop hook schema, and debugging. Use when creating hooks, troubleshooting hook output, understanding hook lifecycle, or when user mentions decision block, hook JSON output, stop hook, or Claude Code hooks.
strudel-music
Creating live-coded algorithmic music and ambient soundscapes using Strudel pattern language. Use this skill when the user asks to create music, generate soundscapes, build ambient tracks, or work with live coding music patterns. This skill helps with setting up Strudel with Neovim integration and composing music using pattern-based programming.
config-builder
Create configuration systems following VoiceMode and MCPro patterns. Use when building new Python projects that need flexible configuration with .env files, environment variables, CLI arguments, and cascading precedence. Implements dotenv-based config with Click CLI integration and helper functions for path expansion and boolean parsing.
conlang
Generate phonologically consistent constructed languages for fiction. Use when you need naming languages, alien speech, or fantasy tongues without deep linguistics knowledge.
elevenlabs-tts
This skill converts text to high-quality audio files using ElevenLabs API. Use this skill when users request text-to-speech generation, audio narration, or voice synthesis with customizable voice parameters (stability, similarity boost) and voice presets (rachel, adam, bella, elli, josh, arnold, ava).
jtbd-psychographic-research
Provides Jobs-to-be-Done and psychographic research frameworks for brand identity work. Auto-activates during brand positioning, voice development, messaging, and strategy phases. Use when discussing target audience, customer research, JTBD, jobs to be done, four forces, push pull anxiety habit, emotional jobs, social jobs, functional jobs, limbic types, VALS segments, psychographics, or customer motivations.
critical-bug-detector
Automatically performs critical pre-release checks when building installers, creating releases, or packaging VoiceLite. Prevents missing files, version mismatches, and configuration errors that have caused release failures.
hooks-builder
Create event-driven hooks for Claude Code automation. Use when the user wants to create hooks, automate tool validation, add pre/post processing, enforce security policies, or configure settings.json hooks. Triggers: create hook, build hook, PreToolUse, PostToolUse, event automation, tool validation, security hook
context7
Search GitHub issues, pull requests, and discussions across any repository. Activates when researching external dependencies (whisper.cpp, NAudio), looking for similar bugs, or finding implementation examples.
audio-engineer
Activate this skill when users need help with audio configuration, troubleshooting, or optimization in OBS. Triggers include requests like "fix my audio", "adjust microphone levels", "mute desktop audio", "balance my audio sources", "check audio levels", or diagnosing audio issues like echo, distortion, or missing sound. This skill orchestrates audio tools to ensure professional sound quality.
notebooklm
Guide for managing Google NotebookLM from the command line using nlm CLI. Use when the user wants to create notebooks, manage sources, generate audio overviews, or mentions NotebookLM, nlm, notebook management, or research organization.
wake-word-detection
Expert skill for implementing wake word detection with openWakeWord. Covers audio monitoring, keyword spotting, privacy protection, and efficient always-listening systems for JARVIS voice assistant.
swiftui-accessibility
Accessibility implementation guide for SwiftUI apps. Use when implementing VoiceOver support, adding accessibilityLabel/Hint/Value, supporting Dynamic Type, ensuring color contrast, testing accessibility, or reviewing accessibility in PRs. Covers iOS accessibility APIs, WCAG guidelines, and testing tools.
VOICEVOX Narration System
Generate Yukkuri-style voice narration from Git commits using VOICEVOX Engine. Use when creating development progress audio guides, YouTube content, or team reports from Git history.
elevenlabs
AI-powered audio generation using ElevenLabs API - text-to-speech with lifelike voices, sound effects generation, and music creation from text descriptions. Generate natural-sounding speech in 32 languages, create custom sound effects for games and videos, and compose royalty-free music tracks. Use this skill when the user requests: - Voice generation or text-to-speech conversion - Audio narration for content (videos, audiobooks, podcasts) - Sound effects for games, videos, or applications - Music generation from text descriptions - Multi-speaker dialogue or conversation audio - Voice cloning or custom voice creation - Audio streaming for real-time applications Capabilities: Text-to-speech (32 languages, 100+ voices), sound effects generation, music composition, voice cloning, real-time audio streaming Python SDK: elevenlabs (pip install elevenlabs)
web-audio-api
Web Audio API for JARVIS audio feedback and voice processing
speech-to-text
Expert skill for implementing speech-to-text with Faster Whisper. Covers audio processing, transcription optimization, privacy protection, and secure handling of voice data for JARVIS voice assistant.
podcast-production-guide
Podcast expert covering recording, editing, hosting, promotion, and monetization strategies