🤖

LLM & Agents

6763 skills in Data & AI > LLM & Agents

llm-evaluation

Marketplace

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.

HermeticOrmus/after-the-third-cup
0
0
Actualizado 1w ago

langchain-patterns

Marketplace

LangChain implementation patterns with templates, scripts, and examples for RAG pipelines

vanman2024/ai-dev-marketplace
0
0
Actualizado 1w ago

skill-manager

Manage your installed Claude Code skills - install, update, rename, uninstall, and list skills from GitHub URLs. Use when the user wants to install a skill, update a skill, list installed skills, rename a skill, remove/delete/uninstall a skill, or provides a GitHub URL to a skills directory.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

rl-environments

Gym/gymnasium API - custom environments, spaces, wrappers, vectorization, debugging

tachyon-beep/hamlet
0
0
Actualizado 1w ago

ui-design-agent

Creates UI/UX design patterns, components, and user experience guidelines

Unicorn/Radium
0
0
Actualizado 1w ago

ds-star

Multi-agent data science framework using DS-STAR (Data Science - Structured Thought and Action) architecture. Automates data analysis through collaborative AI agents with multi-model support (Haiku, Sonnet, Opus). Use for exploratory data analysis, automated insights, and iterative data science workflows.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

tauri-ipc-developer

Specialized agent for implementing type-safe IPC communication between React frontend and Rust backend in Tauri v2 applications. Use when adding new Tauri commands, implementing bidirectional events, debugging IPC serialization issues, or optimizing command performance.

iammarkps/eqapo-gui
0
0
Actualizado 1w ago

goap-agent

Invoke for complex multi-step tasks requiring intelligent planning and multi-agent coordination. Use when tasks need decomposition, dependency mapping, parallel/sequential/swarm/iterative execution strategies, or coordination of multiple specialized agents with quality gates and dynamic optimization.

d-oit/do-novelist-ai
0
0
Actualizado 1w ago

nexus-prompt-engineer

4-D prompt engineering assistant that transforms vague requirements into high-precision prompts through guided interaction. Trigger when users need to: (1) craft high-quality system prompts, (2) optimize existing prompts, (3) use '/fast' for quick generation or '/audit' for prompt review. Applicable to any scenario requiring carefully designed prompts.

pianzhu/my-claude-skills
0
0
Actualizado 1w ago

create-new-skills

Creates new Agent Skills for Claude Code following best practices and documentation. Use when the user wants to create a new skill, extend Claude's capabilities, or package domain expertise into a reusable skill.

GolferGeek/orchestrator-ai
0
0
Actualizado 1w ago

reviewing-with-claude

現在のClaudeセッション内でクイックレビューを実施します。コンテキストを保持したまま即座にコード品質、セキュリティ、パフォーマンスを評価します。

sekka/dotfiles
0
0
Actualizado 1w ago

context-store

Marketplace

Context Store - Document management system for storing, querying, and retrieving documents across Claude Code sessions. Use this to maintain knowledge bases, share documents between agents. Whenever you encounter a <document id=*> in a session, use this skill to retrieve its content.

rawe/claude-agent-orchestrator
0
0
Actualizado 1w ago

langgraph

Expert guidance for building stateful, multi-actor AI agents with LangGraph - graphs, nodes, edges, state management, and agent architectures.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

pont-de-londres

Pattern d'intégration pour relier un graphe de domaine (structuré, issu d'un CSV) à un graphe lexical (extrait automatiquement de documents non-structurés via LLM). Utiliser cette skill lorsque Claude doit construire un Knowledge Graph hybride combinant données structurées et extraction automatique, notamment avec neo4j-graphrag et SimpleKGPipeline. Cas d'usage: GraphRAG, ingestion de PDFs avec métadonnées, construction de Knowledge Graphs à partir de sources hétérogènes.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

observability

Real-time monitoring dashboard for PAI multi-agent activity. USE WHEN user says 'start observability', 'stop dashboard', 'restart observability', 'monitor agents', 'show agent activity', or needs to debug multi-agent workflows.

mikegil/MaxAI
0
0
Actualizado 1w ago

revision-agent

Specialized agent for systematic prose revision using 3-column method and house-rulebook enforcement. Reviews structure, style, and mechanics top-down. Use when user asks to "revise", "edit", "improve prose", or explicitly invokes revision agent.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

ai-evaluation-suite

Comprehensive AI/LLM evaluation toolkit for production AI systems. Covers LLM output quality, prompt engineering, RAG evaluation, agent performance, hallucination detection, bias assessment, cost/token optimization, latency metrics, model comparison, and fine-tuning evaluation. Includes BLEU/ROUGE metrics, perplexity, F1 scores, LLM-as-judge patterns, and benchmarks like MMLU and HumanEval.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

claude-skill-bash

Apply comprehensive bash scripting standards including main function pattern, usage documentation, argument parsing, dependency checking, and error handling. Triggers when creating/editing .sh files, bash scripts, or discussing shell scripting, deployment scripts, automation tasks, or bash conventions.

majiayu000/claude-skill-registry
0
0
Actualizado 1w ago

claude-mcp-expert

Expert on Model Context Protocol (MCP) integration, MCP servers, installation, configuration, and authentication. Triggers when user mentions MCP, MCP servers, installing MCP, connecting tools, MCP resources, MCP prompts, or remote/local MCP servers.

MEDICALCOR/medicalcor-core
0
0
Actualizado 1w ago

unify

Validate spec-implementation-test alignment and convergence. Checks spec completeness, implementation conformance, test coverage, and contract consistency. Use after implementation and tests are complete.

matthew-plusprogramming/monorepo
0
0
Actualizado 1w ago