Marketplace
gemini-media
Gemini media and multimodal workflows across image, audio, and video.
allowed_tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite
model: sonnet
$ インストール
git clone https://github.com/DNYoussef/context-cascade /tmp/context-cascade && cp -r /tmp/context-cascade/skills/platforms/gemini-media ~/.claude/skills/context-cascade// tip: Run this command in your terminal to install the skill
SKILL.md
name: gemini-media description: Gemini media and multimodal workflows across image, audio, and video. allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite model: sonnet x-version: 3.2.0 x-category: platforms x-vcl-compliance: v3.1.1 x-cognitive-frames:
- HON
- MOR
- COM
- CLS
- EVD
- ASP
- SPC
Purpose
Design multimodal prompts and pipelines that combine text with media inputs/outputs safely.
Trigger Conditions
- Use this skill when: Need Gemini to analyze or generate images/audio/video with grounded instructions.
- Reroute when: If only text reasoning is required, use gemini-search or gemini-megacontext.
Guardrails (Inherited from Skill-Forge + Prompt-Architect)
- Structure-first: every platform skill keeps
SKILL.md,examples/, andtests/populated; createresources/andreferences/as needed. Log any missing artifact and fill a placeholder before proceeding. - Confidence ceilings are mandatory in outputs: inference/report 0.70, research 0.85, observation/definition 0.95. State as
Confidence: X.XX (ceiling: TYPE Y.YY). - English-only user-facing text; keep VCL markers internal. Do not leak internal notation.
- Adversarial validation is required before sign-off: boundary, failure, and COV checks with notes.
- MCP tagging for runs:
WHO=gemini-media-{session},WHY=skill-execution, namespaceskills/platforms/gemini-media/{project}.
Execution Framework
- Intent & Constraints — clarify task goal, inputs, success criteria, and risk limits; extract hard/soft/inferred constraints explicitly.
- Plan & Docs — outline steps, needed examples/tests, and data contracts; confirm platform-specific policies.
- Build & Optimize — apply platform playbook below; keep iterative checkpoints and diffs.
- Validate — run adversarial tests, measure KPIs, and record evidence with ceilings.
- Deliver & Hand off — summarize decisions, artifacts, and next actions; capture learnings for reuse.
Platform Playbook
- Workflow patterns:
- Collect and validate media inputs with content safety filters
- Structure multimodal prompts with captions and constraints
- Post-process outputs with watermarking or metadata
- Anti-patterns to avoid: Using media without safety filters, Mixing unlabeled modalities in prompts, Returning generated media without provenance notes
- Example executions:
- Describe and tag uploaded images with safety review
- Generate storyboard frames with content policy checks
Documentation & Artifacts
SKILL.md(this file) is canonical; keep quick-reference notes inREADME.mdif present.examples/should hold runnable or narrative examples;tests/should include validation steps or checklists.resources/stores helper scripts/templates;references/stores background links or research.- Update
metadata.jsonversion if behavior meaningfully changes.
Verification Checklist
- Trigger matched and reroute considered
- Examples/tests present or stubbed with TODOs
- Constraints captured and confidence ceiling stated
- Validation evidence captured (boundary, failure, COV)
- MCP tags applied for this run
Confidence: 0.70 (ceiling: inference 0.70) - Standardized platform skill rewrite aligned with skill-forge + prompt-architect guardrails.
Repository

DNYoussef
Author
DNYoussef/context-cascade/skills/platforms/gemini-media
8
Stars
2
Forks
Updated3d ago
Added1w ago