🤖

LLM 與 Agent

6763 skills in 數據與 AI > LLM 與 Agent

model-pruning

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

phoenix-observability

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running evaluations on datasets, or monitoring production AI systems with real-time insights.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

qdrant-vector-search

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

crewai-multi-agent

Multi-agent orchestration framework for autonomous AI collaboration. Use when building teams of specialized agents working together on complex tasks, when you need role-based agent collaboration with memory, or for production workflows requiring sequential/hierarchical execution. Built without LangChain dependencies for lean, fast execution.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

awq-quantization

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

dspy

Build complex AI systems with declarative programming, optimize prompts automatically, create modular RAG systems and agents with DSPy - Stanford NLP's framework for systematic LM programming

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

autogpt-agents

Autonomous AI agent platform for building and deploying continuous agents. Use when creating visual workflow agents, deploying persistent autonomous agents, or building complex multi-step AI automation systems.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

knowledge-distillation

Compress large language models using knowledge distillation from teacher to student models. Use when deploying smaller models with retained performance, transferring GPT-4 capabilities to open-source models, or reducing inference costs. Covers temperature scaling, soft targets, reverse KLD, logit distillation, and MiniLLM training strategies.

zechenzhangAGI/AI-research-SKILLs

更新於 5d ago

github-assistant

GitHub workflow automation using `gh` CLI. Use when creating PRs, reviewing code, managing issues, auto-replying to fixed issues, adding comments, assigning users, managing labels, merging PRs, or checking CI status. Includes auto-reply script and Vietnamese templates.

khaphanspace/gonhanh.org

更新於 5d ago

notion-spec-to-implementation

Turns product or tech specs into concrete Notion tasks that Claude code can implement. Breaks down spec pages into detailed implementation plans with clear tasks, acceptance criteria, and progress tracking to guide development from requirements to completion.

Prat011/awesome-llm-skills

更新於 5d ago

openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

steipete/clawdis

更新於 5d ago

qmd

Local search/indexing CLI (BM25 + vectors + rerank) with MCP mode.

steipete/clawdis

更新於 5d ago

clawdis-notify

Send system notifications to specific Clawdis nodes (macOS computers) via the Gateway and CLI. Use when you need to alert a person or confirm a remote action on a particular machine, or when an agent must push a notification to another computer.

steipete/clawdis

更新於 5d ago

openai-image-gen

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

steipete/clawdis

更新於 5d ago

clawdis-nodes

Discover, interpret, and target Clawdis nodes (paired devices) via the Gateway/CLI. Use when an agent must find available nodes, choose the best target machine, or reason about presence vs node availability (Tailnet/Tailscale optional).

steipete/clawdis

更新於 5d ago

oracle

Best practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).

steipete/clawdis

更新於 5d ago

subagent-driven-development

Use when executing implementation plans with independent tasks in the current session

Trevoke/org-gtd.el

更新於 5d ago

ai-prompting

Effective communication strategies for AI-assisted development. Learn context-first prompting, phased interactions, iterative refinement, and validation techniques to get better results from Claude and other AI coding assistants.

jasonkneen/kiro

更新於 5d ago

web-browser

Allows to interact with web pages by performing actions such as clicking buttons, filling out forms, and navigating links. It works by remote controlling Google Chrome or Chromium browsers using the Chrome DevTools Protocol (CDP). When Claude needs to browse the web, it can use this skill to do so.

nicksp/dotfiles

更新於 5d ago