Image Processing
912 skills in Content & Media > Image Processing
image-enhancer
Improves image quality (resolution, sharpness, clarity) for screenshots, presentations, and social media. Analyzes specs and applies specific enhancements.
event-detection-temporal-intelligence-expert
Expert in temporal event detection, spatio-temporal clustering (ST-DBSCAN), and photo context understanding. Use for detecting photo events, clustering by time/location, shareability prediction, place recognition, event significance scoring, and life event detection. Activate on 'event detection', 'temporal clustering', 'ST-DBSCAN', 'spatio-temporal', 'shareability prediction', 'place recognition', 'life events', 'photo events', 'temporal diversity'. NOT for individual photo aesthetic quality (use photo-composition-critic), color palette analysis (use color-theory-palette-harmony-expert), face recognition implementation (use photo-content-recognition-curation-expert), or basic EXIF timestamp extraction.
datalab
Convert documents (PDF, EPUB, PPTX, DOCX, XLSX, HTML, images) to Markdown using Datalab cloud API. Use when user wants to use Datalab API for document conversion, or prefers cloud-based processing over local marker CLI.
computer-vision
Image processing, object detection, segmentation, and vision models. Use for image classification, object detection, or visual analysis tasks.
csharp-async-patterns
Task, ValueTask, async streams, cancellation 등 C# async/await 패턴을 사용할 때 활용합니다. 비동기 C# 코드를 작성할 때 사용합니다.
gemini-imagegen
This skill should be used when generating and editing images using the Gemini API (Nano Banana Pro). It applies when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
page-diff
Visual regression testing for web pages. Compares screenshots to detect UI changes, generates overlay images highlighting differences. Use after making frontend changes to verify visual correctness, catch unintended side effects, or validate that changes look as expected.
astro-architecture
Technical architecture for Astro lead generation websites. Use when setting up new projects, configuring build tools, or establishing project foundations. For images use astro-images skill. For SEO use astro-seo skill.
torchvision
Computer vision library for PyTorch featuring pretrained models, advanced image transforms (v2), and utilities for handling complex data types like bounding boxes and masks. (torchvision, transforms, tvtensor, resnet, cutmix, mixup, pretrained models, vision transforms)
video-format
Process videos using ffmpeg and gifsicle to edit videos. Use when working with .mp4, .avi, or .gif files, or sequences of .png or .jpg files, and the request is to compress, convert to a different format, scale, crop, remove or extract frames, concatenate multiple videos in space or in time, or speed up or slow down the video.
panel-digitizer
Extract panel schedule data from images (photos or CAD drawings) and generate PanelCalc-compatible JSON import files. Use when user provides panel schedule images, panel directory photos, or CAD drawings that need to be digitized into structured panel data. Supports single-column, dual-column, handwritten, and typed formats.
docker-web
Containerizes web applications with Docker for consistent deployments across environments. Use when creating Dockerfiles, building container images, or setting up Docker Compose for development.
comfyui-workflow-builder
Generates optimized ComfyUI workflows for image generation, editing, and enhancement. Creates JSON workflow files using available models and provides step-by-step setup instructions.
gemini-image-generator
Generate images using Google Gemini NanoBanana via browser automation. Use this skill for general-purpose AI image generation from text prompts. Includes persistent authentication, automatic environment setup, and reference image support for style matching.
registry-cleanup
Automates cleanup of old container images from DigitalOcean registries based on retention policies. Keeps last 10 tags for staging, 30 for production, always preserves :latest.
troubleshoot
Diagnoses Kubernetes pod issues automatically. Use when pods are in CrashLoopBackOff, ImagePullBackOff, Pending, or Error state. Analyzes events, logs, and resource status to identify root causes.
ragsharp-build-code-graph
Build or update a code graph index for C#/.NET repositories using ragsharp-graph.Triggers: build index, update index, refresh index, code graph, dependency graph, static analysis, Roslyn, line numbers.
pdf-analyzer
Extract text and images from PDF files, analyze diagrams and architecture charts, and convert visual diagrams to Mermaid format. Use when the user wants to read, analyze, or convert PDF documents, especially those containing diagrams, flowcharts, or architecture visualizations.
wordpress-site-speed-auditor
Performs comprehensive WordPress site speed and performance audits. Use when analyzing WordPress themes, plugins, or sites for performance optimization opportunities, checking image/video optimization, database queries, caching, or identifying speed bottlenecks.
reviewing-dotnet-code
Reviews and generates .NET/C# code following Microsoft conventions and modern patterns.Use when reviewing C# files, writing .NET code, refactoring, or when user mentions"C#", ".NET", "dotnet", "csharp", or asks about naming conventions in .NET projects.