Computer Vision
257 skills in Data & AI > Computer Vision
socratic-ideation-tree
Transform abstract goals into concrete, actionable tasks through Socratic questioning and hierarchical decomposition. Use when users need to (1) break down complex projects into manageable parts, (2) clarify vague ideas into specific implementations, (3) generate systematic project structures, (4) explore multiple solution paths, (5) create task hierarchies for any domain (software, business, personal goals, research, construction). Triggers include requests to "break down", "decompose", "plan", "structure", "ideate", "brainstorm systematically", or transform ideas into actionable tasks.
brainstorming
Use when creating or developing anything, before writing code or implementation plans - refines rough ideas into fully-formed designs through structured Socratic questioning, alternative exploration, and incremental validation
brainstorming
IMMEDIATELY USE THIS SKILL when creating or develop anything and before writing code or implementation plans - refines rough ideas into fully-formed designs through structured Socratic questioning, alternative exploration, and incremental validation
learning-discussion-designer
Design discussion prompts, Socratic questions, debate structures, and facilitation guides for meaningful learning conversations. Use for discussion-based learning. Activates on "discussion questions", "debate design", "Socratic method", or "forum prompts".
orchestrationcreating-workflows
Use when user says "create workflow", "create a workflow", "design workflow", "orchestrate", "automate multiple steps", "coordinate agents", "multi-agent workflow". Creates orchestration workflows from natural language using Socratic questioning to plan multi-agent workflows with visualization.
format-resume
Intelligently format CV/resume content using semantic understanding and visual verification
docrev
Document revision workflow tool (CLI: `rev`). Use when working with Word documents containing reviewer comments, importing track changes to markdown, replying to reviewer comments, building PDF/DOCX outputs, generating response letters, validating citations/DOIs, or any document revision task.
cv-manager
Expert in managing professional candidate CVs and interview records. **Use this skill whenever the user mentions 'add a CV', 'add new professional interview', 'add an interview', 'view the CV', 'summarize a candidate', 'candidate summary', or requests to process recruitment-related documents and interview notes.** Handles CV upload to Google Drive, information extraction, interview transcription, and structured documentation in Google Docs. (project, gitignored)
brainstorming
Refine ideas into detailed designs through Socratic dialogue.Use when: user has rough idea, needs to clarify requirements, explore approaches.Triggers: "brainstorm", "discuss idea", "I'm thinking about", "what if","help me think through", "explore options", "/brainstorm".
ai-multimodal
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.
resume-creator
Create professional resumes using first-principles thinking, company research, LaTeX Harvard-style formatting, and iterative visual refinement. Use when the user wants to create, update, or optimize a resume/CV for a specific job, company, or career goal.
ai-multimodal
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.
cve-tracking-system
Эксперт CVE tracking. Используй для vulnerability management, security advisories и patch prioritization.
styling-with-tailwind-cva
Use when creating design systems or styling components with Tailwind CSS v4 and Class Variance Authority (CVA) - provides type-safe component variants with CSS variables for design tokens
dev-brainstorm
This skill should be used when the user asks to "brainstorm a feature", "discuss requirements", "clarify the design", "refine the idea", or as Phase 1 of the /dev workflow. Uses Socratic questioning to explore design before implementation.
cve-checker
CVE vulnerability checker for software dependencies and components. Includes KEV (Known Exploited Vulnerabilities) analysis, exploit verification, and white hat security assessment. Use when checking for known vulnerabilities, scanning dependencies, analyzing SBOM for CVEs, verifying security of packages/libraries, identifying actively exploited vulnerabilities, or performing KEV-based security assessments. Triggers on: CVE check, vulnerability scan, dependency security, SBOM analysis, security audit, package vulnerability, KEV check, exploited vulnerability, ransomware CVE, CISA KEV, exploit verification, white hat, attack vector analysis.
brainstorming
Use when creating or developing anything, before writing code or implementation plans - refines rough ideas into fully-formed designs through structured Socratic questioning, alternative exploration, and incremental validation
job-application
Analyse d'offres d'emploi, adéquation profil-poste, et génération de CV/lettre adaptés. Activer ce skill quand l'utilisateur parle de candidature, offre d'emploi, lettre de motivation, ou CV adapté.
task-management
Système de gestion de tâches avec workflows Git, analyse comparative, et priorisation intelligente. Activer ce skill quand l'utilisateur parle de tâches, backlog, analyses de CV, ou demande quelle tâche faire ensuite.
brainstorming
Use when creating or developing anything, before writing code or implementation plans - refines rough ideas into fully-formed designs through structured Socratic questioning, alternative exploration, and incremental validation