🔒

Testing & Security

Testing frameworks, security tools, and best practices

9063 skills in this category

reviewing-changes

Android-specific code review workflow additions for Bitwarden Android. Provides change type refinements, checklist loading, and reference material organization. Complements bitwarden-code-reviewer agent's base review standards.

bitwarden/android
8.1k
908
Actualizado 4d ago

web-e2e

Run, create, and debug Playwright e2e tests for the web app. ALWAYS invoke this skill using the SlashCommand tool (i.e., `/web-e2e`) BEFORE attempting to run any e2e tests, playwright tests, anvil tests, or debug test failures. DO NOT run `bun playwright test` or other e2e commands directly - you must invoke this skill first to learn the correct commands and test architecture.

Uniswap/interface
5.4k
5.4k
Actualizado 4d ago

evaluation

Marketplace

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.

muratcankoylan/Agent-Skills-for-Context-Engineering
5.4k
424
Actualizado 4d ago

frontend-developer

Build user interfaces using Redpanda UI Registry components with React, TypeScript, and Vitest testing. Use when user requests UI components, pages, forms, or mentions 'build UI', 'create component', 'design system', 'frontend', or 'registry'.

redpanda-data/console
4.2k
409
Actualizado 4d ago

e2e-tester

Write and run Playwright E2E tests for Redpanda Console using testcontainers. Analyzes test failures, adds missing testids, and improves test stability. Use when user requests E2E tests, Playwright tests, integration tests, test failures, missing testids, or mentions 'test workflow', 'browser testing', 'end-to-end', or 'testcontainers'.

redpanda-data/console
4.2k
409
Actualizado 4d ago

dspy-ruby

Marketplace

This skill should be used when working with DSPy.rb, a Ruby framework for building type-safe, composable LLM applications. Use this when implementing predictable AI features, creating LLM signatures and modules, configuring language model providers (OpenAI, Anthropic, Gemini, Ollama), building agent systems with tools, optimizing prompts, or testing LLM-powered functionality in Ruby applications.

EveryInc/compound-engineering-plugin
3.8k
323
Actualizado 4d ago

monorepo

Monorepo script commands and conventions for this codebase. Use when running builds, tests, formatting, linting, or type checking.

EpicenterHQ/epicenter
3.8k
251
Actualizado 4d ago

Creating Financial Models

This skill provides an advanced financial modeling suite with DCF analysis, sensitivity testing, Monte Carlo simulations, and scenario planning for investment decisions

modelscope/ms-agent
3.6k
412
Actualizado 4d ago

my-first-skill

Example skill demonstrating Anthropic SKILL.md format. Load when learning to create skills or testing the OpenSkills loader.

numman-ali/openskills
3.4k
248
Actualizado 4d ago

test-with-spanner

Run unit tests that require the Spanner emulator. Use this skill when the user wants to run tests in packages like satellite/metabase, satellite/metainfo, or any other tests that interact with Spanner. Automatically handles checking for and configuring the Spanner emulator environment.

storj/storj
3.2k
421
Actualizado 4d ago

pydeseq2

Marketplace

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

statsmodels

Marketplace

Statistical models library for Python. Use when you need specific model classes (OLS, GLM, mixed models, ARIMA) with detailed diagnostics, residuals, and inference. Best for econometrics, time series, rigorous inference with coefficient tables. For guided statistical test selection with APA reporting use statistical-analysis.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

adaptyv

Marketplace

Cloud laboratory platform for automated protein testing and validation. Use when designing proteins and needing experimental validation including binding assays, expression testing, thermostability measurements, enzyme activity assays, or protein sequence optimization. Also use for submitting experiments via API, tracking experiment status, downloading results, optimizing protein sequences for better expression using computational tools (NetSolP, SoluProt, SolubleMPNN, ESM), or managing protein design workflows with wet-lab validation.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

networkx

Marketplace

Comprehensive toolkit for creating, analyzing, and visualizing complex networks and graphs in Python. Use when working with network/graph data structures, analyzing relationships between entities, computing graph algorithms (shortest paths, centrality, clustering), detecting communities, generating synthetic networks, or visualizing network topologies. Applicable to social networks, biological networks, transportation systems, citation networks, and any domain involving pairwise relationships.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

hypothesis-generation

Marketplace

Generate testable hypotheses. Formulate from observations, design experiments, explore competing explanations, develop predictions, propose mechanisms, for scientific inquiry across domains.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

statistical-analysis

Marketplace

Guided statistical analysis with test selection and reporting. Use when you need help choosing appropriate tests for your data, assumption checking, power analysis, and APA-formatted results. Best for academic research reporting, test selection guidance. For implementing specific models programmatically use statsmodels.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

cosmic-database

Marketplace

Access COSMIC cancer mutation database. Query somatic mutations, Cancer Gene Census, mutational signatures, gene fusions, for cancer research and precision oncology. Requires authentication.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

hypogenic

Marketplace

Automated LLM-driven hypothesis generation and testing on tabular datasets. Use when you want to systematically explore hypotheses about patterns in empirical data (e.g., deception detection, content analysis). Combines literature insights with data-driven hypothesis testing. For manual hypothesis formulation use hypothesis-generation; for creative ideation use scientific-brainstorming.

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

pyhealth

Marketplace

Comprehensive healthcare AI toolkit for developing, testing, and deploying machine learning models with clinical data. This skill should be used when working with electronic health records (EHR), clinical prediction tasks (mortality, readmission, drug recommendation), medical coding systems (ICD, NDC, ATC), physiological signals (EEG, ECG), healthcare datasets (MIMIC-III/IV, eICU, OMOP), or implementing deep learning models for healthcare applications (RETAIN, SafeDrug, Transformer, GNN).

K-Dense-AI/claude-scientific-skills
3.0k
334
Actualizado 4d ago

end-to-end-tests

after making changes, run end-to-end tests to ensure that the product still works

git-town/git-town
3.0k
132
Actualizado 4d ago