Marketplace

testing-quality

Coordinate research-focused testing and quality validation with clear constraints, gates, and confidence ceilings.

allowed_tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite

model: sonnet

$ Installer

git clone https://github.com/DNYoussef/context-cascade /tmp/context-cascade && cp -r /tmp/context-cascade/skills/research/testing-quality ~/.claude/skills/context-cascade

// tip: Run this command in your terminal to install the skill

SKILL.md

View on GitHub →

name: testing-quality description: Coordinate research-focused testing and quality validation with clear constraints, gates, and confidence ceilings. allowed-tools: Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite model: sonnet x-version: 3.2.0 x-category: research x-vcl-compliance: v3.1.1 x-cognitive-frames:

HON
MOR
COM
CLS
EVD
ASP
SPC

STANDARD OPERATING PROCEDURE

Purpose

Provide a hub for research testing skills (code review, functionality audit, verification, style audit, theater detection).
Enforce constraint hygiene, quality gates, and explicit confidence ceilings on test results.
Maintain structure-first documentation for all sub-skills.

Trigger Conditions

Positive: any request to validate research code, results, or quality using the contained sub-skills.
Negative: non-research production testing (route to quality/operations categories).

Guardrails

Capture HARD / SOFT / INFERRED testing constraints (coverage targets, latency, risk tolerance, compliance).
Two-pass refinement: choose appropriate sub-skill(s) → validate findings with evidence and ceilings.
Route tasks to sub-skills: code-review-assistant, functionality-audit, verification-quality, style-audit, theater-detection.

Inputs

Artifact to test (code, models, data, results) and context.
Constraints and success criteria; environment details if applicable.

Workflow

Scope & Route: Record constraints and select relevant sub-skills; confirm INFERRED risks.
Plan Checks: Define coverage, tools, and expected outputs per sub-skill.
Execute Tests: Run selected skills, gather evidence, and log configurations.
Validate & Synthesize: Aggregate findings, mark severity, and apply confidence ceilings.
Deliver & Store: Provide consolidated report and next actions; update references/examples.

Validation & Quality Gates

Sub-skill selection justified and constraints mapped.
Evidence attached to findings; ceilings aligned to evidence type.
Recommendations include remediation steps and owners.

Response Template

**Scope & Constraints**
- HARD / SOFT / INFERRED.

**Selected Checks**
- Sub-skill → rationale.

**Findings**
- Issue → evidence → severity → confidence ceiling.

**Actions**
- Remediation + owner + due date.

Confidence: 0.81 (ceiling: inference 0.70) - based on executed checks and evidence.

Confidence: 0.81 (ceiling: inference 0.70) - reflects validated testing coverage and findings.