media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

$ Instalar

git clone https://github.com/binhmuc/autobot-review /tmp/autobot-review && cp -r /tmp/autobot-review/.claude/skills/media-processing ~/.claude/skills/autobot-review

// tip: Run this command in your terminal to install the skill


name: media-processing description: Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs. license: MIT

Media Processing Skill

Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.

Tool Selection

TaskToolReason
Video encoding/conversionFFmpegNative codec support, streaming
Audio extraction/conversionFFmpegDirect stream manipulation
Image resize/effectsImageMagickOptimized for still images
Background removalRMBGAI-powered, local processing
Batch imagesImageMagickmogrify for in-place edits
Video thumbnailsFFmpegFrame extraction built-in
GIF creationFFmpeg/ImageMagickFFmpeg for video, ImageMagick for images

Installation

# macOS
brew install ffmpeg imagemagick
npm install -g rmbg-cli

# Ubuntu/Debian
sudo apt-get install ffmpeg imagemagick
npm install -g rmbg-cli

# Verify
ffmpeg -version && magick -version && rmbg --version

Essential Commands

# Video: Convert/re-encode
ffmpeg -i input.mkv -c copy output.mp4
ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4

# Video: Extract audio
ffmpeg -i video.mp4 -vn -c:a copy audio.m4a

# Image: Convert/resize
magick input.png output.jpg
magick input.jpg -resize 800x600 output.jpg

# Image: Batch resize
mogrify -resize 800x -quality 85 *.jpg

# Background removal
rmbg input.jpg                          # Basic (modnet)
rmbg input.jpg -m briaai -o output.png  # High quality
rmbg input.jpg -m u2netp -o output.png  # Fast

Key Parameters

FFmpeg:

  • -c:v libx264 - H.264 codec
  • -crf 22 - Quality (0-51, lower=better)
  • -preset slow - Speed/compression balance
  • -c:a aac - Audio codec

ImageMagick:

  • 800x600 - Fit within (maintains aspect)
  • 800x600^ - Fill (may crop)
  • -quality 85 - JPEG quality
  • -strip - Remove metadata

RMBG:

  • -m briaai - High quality model
  • -m u2netp - Fast model
  • -r 4096 - Max resolution

References

Detailed guides in references/:

  • ffmpeg-encoding.md - Codecs, quality, hardware acceleration
  • ffmpeg-streaming.md - HLS/DASH, live streaming
  • ffmpeg-filters.md - Filters, complex filtergraphs
  • imagemagick-editing.md - Effects, transformations
  • imagemagick-batch.md - Batch processing, parallel ops
  • rmbg-background-removal.md - AI models, CLI usage
  • common-workflows.md - Video optimization, responsive images, GIF creation
  • troubleshooting.md - Error fixes, performance tips
  • format-compatibility.md - Format support, codec recommendations