Marketplace

gemini-media

Gemini media and multimodal workflows across image, audio, and video.

allowed_tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite

model: sonnet

$ インストール

git clone https://github.com/DNYoussef/context-cascade /tmp/context-cascade && cp -r /tmp/context-cascade/skills/platforms/gemini-media ~/.claude/skills/context-cascade

// tip: Run this command in your terminal to install the skill

SKILL.md

View on GitHub →

name: gemini-media description: Gemini media and multimodal workflows across image, audio, and video. allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite model: sonnet x-version: 3.2.0 x-category: platforms x-vcl-compliance: v3.1.1 x-cognitive-frames:

HON
MOR
COM
CLS
EVD
ASP
SPC

Purpose

Design multimodal prompts and pipelines that combine text with media inputs/outputs safely.

Trigger Conditions

Use this skill when: Need Gemini to analyze or generate images/audio/video with grounded instructions.
Reroute when: If only text reasoning is required, use gemini-search or gemini-megacontext.

Guardrails (Inherited from Skill-Forge + Prompt-Architect)

Structure-first: every platform skill keeps SKILL.md, examples/, and tests/ populated; create resources/ and references/ as needed. Log any missing artifact and fill a placeholder before proceeding.
Confidence ceilings are mandatory in outputs: inference/report 0.70, research 0.85, observation/definition 0.95. State as Confidence: X.XX (ceiling: TYPE Y.YY).
English-only user-facing text; keep VCL markers internal. Do not leak internal notation.
Adversarial validation is required before sign-off: boundary, failure, and COV checks with notes.
MCP tagging for runs: WHO=gemini-media-{session}, WHY=skill-execution, namespace skills/platforms/gemini-media/{project}.

Execution Framework

Intent & Constraints — clarify task goal, inputs, success criteria, and risk limits; extract hard/soft/inferred constraints explicitly.
Plan & Docs — outline steps, needed examples/tests, and data contracts; confirm platform-specific policies.
Build & Optimize — apply platform playbook below; keep iterative checkpoints and diffs.
Validate — run adversarial tests, measure KPIs, and record evidence with ceilings.
Deliver & Hand off — summarize decisions, artifacts, and next actions; capture learnings for reuse.

Platform Playbook

Workflow patterns:
- Collect and validate media inputs with content safety filters
- Structure multimodal prompts with captions and constraints
- Post-process outputs with watermarking or metadata
Anti-patterns to avoid: Using media without safety filters, Mixing unlabeled modalities in prompts, Returning generated media without provenance notes
Example executions:
- Describe and tag uploaded images with safety review
- Generate storyboard frames with content policy checks

Documentation & Artifacts

SKILL.md (this file) is canonical; keep quick-reference notes in README.md if present.
examples/ should hold runnable or narrative examples; tests/ should include validation steps or checklists.
resources/ stores helper scripts/templates; references/ stores background links or research.
Update metadata.json version if behavior meaningfully changes.

Verification Checklist

Trigger matched and reroute considered
Examples/tests present or stubbed with TODOs
Constraints captured and confidence ceiling stated
Validation evidence captured (boundary, failure, COV)
MCP tags applied for this run

Confidence: 0.70 (ceiling: inference 0.70) - Standardized platform skill rewrite aligned with skill-forge + prompt-architect guardrails.