🎨

Image Processing

912 skills in Content & Media > Image Processing

csharp-developer

Expert C# developer specializing in modern .NET development, ASP.NET Core, and cloud-native applications. Masters C# 12 features, Blazor, and cross-platform development with emphasis on performance and clean architecture.

zenobi-us/dotfiles

Mis à jour 3d ago

document-pptx

Create, edit, and analyze PowerPoint presentations with slides, layouts, charts, images, animations, and speaker notes. Supports python-pptx and pptxgenjs for automated presentation generation in Python and Node.js.

vasilyu1983/AI-Agents-public

Mis à jour 3d ago

replicate-integration

Integrate Replicate API for AI model deployment. Use when generating images with Flux, SDXL, or custom LoRA models via Replicate.

daniel-carreon/saas-factory-setup

Mis à jour 3d ago

nano-banana-image-combine

Combine multiple images using Gemini 2.5 Flash (Nano Banana) via OpenRouter. Use when merging 2-8 images with AI-guided composition.

daniel-carreon/saas-factory-setup

Mis à jour 3d ago

media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

binhmuc/autobot-review

Mis à jour 3d ago

Video Generation

Implement AI-powered video generation capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to generate videos from text prompts or images, create video content programmatically, or build applications that produce video outputs. Supports asynchronous task management with status polling and result retrieval.

AnswerZhao/agent-skills

Mis à jour 3d ago

VLM

Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to analyze images, describe visual content, or create applications that combine image understanding with conversational AI. Supports image URLs and base64 encoded images for multimodal interactions.

AnswerZhao/agent-skills

Mis à jour 3d ago

nanobanana

Guide for generating and editing images using generative AI with the nanobanana CLI

maragudk/skills

Mis à jour 3d ago

sf-imagen

Marketplace

AI-powered visual content generation for Salesforce development. Generates ERD diagrams, LWC mockups, architecture visuals using Nano Banana Pro. Also provides Gemini as a parallel sub-agent for code review and research.

Jaganpro/sf-skills

Mis à jour 3d ago

performance-optimization

Marketplace

Expert in React Native 0.83+ performance optimization including Hermes V1, React 19.2 concurrent features, Intersection Observer, Web Performance APIs, bundle size reduction, memory management, rendering optimization, FlashList, expo-image v2, memoization, lazy loading, code splitting. Activates for performance, slow app, lag, memory leak, bundle size, optimization, flatlist performance, re-render, fps, jank, startup time, app size, hermes, concurrent rendering.

anton-abyzov/specweave

Mis à jour 3d ago

cv-pipeline-builder

Marketplace

Computer vision ML pipelines for image classification, object detection, semantic segmentation, and image generation. Activates for "computer vision", "image classification", "object detection", "CNN", "ResNet", "YOLO", "image segmentation", "image preprocessing", "data augmentation". Builds end-to-end CV pipelines with PyTorch/TensorFlow, integrated with SpecWeave increments.

anton-abyzov/specweave

Mis à jour 3d ago

creating-daiv-yml-config

Creates or updates .daiv.yml configuration file with sandbox settings (base_image and format_code commands) based on repository content. Use when users request DAIV configuration setup or sandbox configuration.

srtab/daiv

Mis à jour 3d ago

seedream-image-generator

Generate images using the Doubao SeeDream API based on text prompts. Use this skill when users request AI-generated images, artwork, illustrations, or visual content creation. The skill handles API calls, downloads generated images to the project's /pic folder, and supports batch generation of up to 4 sequential images.

eze-is/seedream-image-generator•Python

Mis à jour 3d ago

gemini-image

当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

Ceeon/gemini-image-skill

Mis à jour 3d ago

image-generation

自动为文章生成配图，支持AI生成图片、公共领域图片、免费图库图片，并上传到ImgBB图床生成Markdown链接。当用户提到"配图"、"插图"、"图片"、"生成图片"、"文章配图"时使用此技能。

alchaincyf/glm-claude

Mis à jour 3d ago

Docker

Container management with Docker and Podman. USE WHEN building images, managing containers, working with compose files, debugging containers, managing networks/volumes, scanning for vulnerabilities, or optimizing images.

vdemeester/home

Mis à jour 3d ago

Tekton

Tekton Pipelines CI/CD best practices for Kubernetes-native workflows. USE WHEN working with Tekton tasks, pipelines, triggers, building container images, GitOps integration, or cloud-native CI/CD.

vdemeester/home

Mis à jour 3d ago

video-thumbnail-check

基于MrBeast策略检查视频标题、缩略图和内容钩子，优化点击率和观看时长。当用户提到"视频标题"、"封面图"、"缩略图"、"点击率"、"CTR"、"观看时长"、"视频开头"时使用此技能。

alchaincyf/glm-claude

Mis à jour 3d ago

value-prop-sharpener

Marketplace

Refine value propositions from emotional, logical, and status angles, then synthesize into a powerful 15-word statement with evolution process.

majesticlabs-dev/majestic-marketplace

Mis à jour 3d ago

install-windows-3-11

Guidance for setting up legacy Windows VMs (like Windows 3.11) in QEMU with web-based remote access via noVNC. This skill should be used when tasks involve running legacy operating systems in virtual machines, configuring QEMU for older OS images, setting up VNC/noVNC web interfaces, or establishing programmatic keyboard control via QMP. Covers VM boot verification strategies, nginx reverse proxy configuration, and websockify setup.

letta-ai/skills

Mis à jour 3d ago