🎨

Image Processing

912 skills in Content & Media > Image Processing

local-rag

Index local folders and query them using RAG (Retrieval Augmented Generation). Supports PDF, DOCX, PPTX, XLSX, images with OCR, and text files.

gsannikov/claude-skills
1
0
Mis à jour 2d ago

CI/CD Pipeline Management

GitLab CI/CD pipeline optimization, Docker image building, caching strategies, and 3-stage deployment workflow

hackur/web-standards-playground-showcase
1
0
Mis à jour 2d ago

event-detection-temporal-intelligence-expert

Expert in temporal event detection, spatio-temporal clustering (ST-DBSCAN), and photo context understanding. Use for detecting photo events, clustering by time/location, shareability prediction, place recognition, event significance scoring, and life event detection. Activate on 'event detection', 'temporal clustering', 'ST-DBSCAN', 'spatio-temporal', 'shareability prediction', 'place recognition', 'life events', 'photo events', 'temporal diversity'. NOT for individual photo aesthetic quality (use photo-composition-critic), color palette analysis (use color-theory-palette-harmony-expert), face recognition implementation (use photo-content-recognition-curation-expert), or basic EXIF timestamp extraction.

erichowens/some_claude_skills
1
0
Mis à jour 2d ago

writing-dockerfiles

Guides Dockerfile creation and optimization. Use when Dockerfile or Docker Compose is detected. Supports multi-stage builds, cache optimization, security hardening, and image size minimization.

sumik5/dotfiles
1
2
Mis à jour 2d ago

ai

Marketplace

Add AI features to Glide apps using AI columns like Generate Text, Image to Text, Audio to Text. Use when adding AI-powered functionality, text generation, OCR, transcription, or auto-categorization.

glideapps/glide-code
1
0
Mis à jour 2d ago

vr-avatar-engineer

Expert in photorealistic and stylized VR avatar systems for Apple Vision Pro, Meta Quest, and cross-platform metaverse. Specializes in facial tracking (52+ blend shapes), subsurface scattering, Persona-style generation, Photon networking, and real-time LOD. Activate on 'VR avatar', 'Vision Pro Persona', 'Meta avatar', 'facial tracking', 'blend shapes', 'avatar networking', 'photorealistic avatar'. NOT for 2D profile pictures (use image generation), non-VR game characters (use game engine tools), static 3D models (use modeling tools), or motion capture hardware setup.

erichowens/some_claude_skills
1
0
Mis à jour 2d ago

reachy-mini-sdk

Comprehensive guide for programming Reachy Mini robot using Python SDK v1.2.6. Use when working with Reachy Mini robot control, motion programming, sensor access, audio/video processing, or building AI applications. Covers movement control (head, antennas, body), camera/microphone access, motion recording/playback, coordinate systems, and Hugging Face integration. Essential for robotics development, AI experimentation, and interactive applications.

jjmartres/opencode
1
0
Mis à jour 2d ago

image-comparison-tool

Compare images with SSIM similarity scoring, pixel difference highlighting, and side-by-side visualization.

dkyazzentwatwa/chatgpt-skills
1
0
Mis à jour 2d ago

image-processing

Process, transform, and analyze images using common operations

tatat/agents-playground
1
0
Mis à jour 1d ago

background-remover

Remove backgrounds from images using segmentation. Support for color-based, edge detection, and AI-assisted removal methods. Batch processing available.

dkyazzentwatwa/chatgpt-skills
1
0
Mis à jour 1d ago

ocr-document-processor

Extract text from images and scanned PDFs using OCR. Supports 100+ languages, table detection, structured output (markdown/JSON), and batch processing.

dkyazzentwatwa/chatgpt-skills
1
0
Mis à jour 1d ago

fsharp-persistence

Implement data persistence using SQLite with Dapper, JSON files, or event sourcing. Use when: "database", "save data", "store", "CRUD", "create table", "query", "SQL", "SQLite", "Dapper", "file storage", "JSON file", "event sourcing", "persistence", "read from database", "write to database", "data access", "repository". Creates code in src/Server/Persistence.fs with async I/O patterns.

heimeshoff/Cinemarco
1
0
Mis à jour 1d ago

Unnamed Skill

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing, effects, composition). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs. | Sử dụng khi: xử lý hình ảnh, video, audio, FFmpeg, ImageMagick, chuyển đổi media.

wollfoo/setup-factory
1
0
Mis à jour 1d ago

fsharp-frontend

Implement F# frontend using Elmish MVU architecture with Feliz for React components. Use when: "add UI", "create component", "build form", "frontend", "client-side", "user interface", "view", "display", "render", "Elmish", "Feliz", "button", "input", "state management". Creates Model/Msg/update in src/Client/State.fs and views in src/Client/View.fs. Follows strict MVU pattern with RemoteData for async operations and TailwindCSS/DaisyUI for styling.

heimeshoff/Cinemarco
1
0
Mis à jour 1d ago

docker-multi-stage

Marketplace

Multi-stage builds for optimized, minimal production images with build/runtime separation

pluginagentmarketplace/custom-plugin-docker
1
0
Mis à jour 1d ago

editorial-image-generator

Creates sophisticated HBR-style editorial illustrations for any content using AI understanding and visual analysis. Use when creating conceptual illustrations, analyzing generated images, or compositing logos. Works with any brand configuration. AI-native approach - Claude reasons about content rather than using rigid templates.

tallyfy/documentation
1
1
Mis à jour 1d ago

scientific-schematics

Create publication-quality scientific diagrams using Nano Banana Pro AI with iterative refinement. AI generation is the default method for all diagram types. Generates high-fidelity images with automatic quality review. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations.

ovachiever/droid-tings
1
0
Mis à jour 1d ago

pdf-to-markdown-converter

Converts PDF files to Markdown format using PyMuPDF, extracting text content and embedded images. Fast and lightweight. Automatically fixes LaTeX umlauts (¨a → ä, etc.) and converts ß to ss (Swiss German). Use when converting PDFs to Markdown, extracting document content, or processing PDF files for text analysis. Generates one .md file and 0..n .png files for images.

talent-factory/dotfiles
1
0
Mis à jour 1d ago

receipt-scanner

Extract vendor, date, items, amounts, and total from receipt images using OCR and pattern matching with structured JSON output.

dkyazzentwatwa/chatgpt-skills
1
0
Mis à jour 1d ago

ecosystem

Marketplace

JavaScript ecosystem including npm, build tools (Webpack, Vite), testing (Jest, Vitest), linting, and CI/CD integration.

pluginagentmarketplace/custom-plugin-javascript
1
0
Mis à jour 1d ago