數據工程
525 skills in 數據與 AI > 數據工程
data-engineer
Expert in data pipelines, ETL processes, and data infrastructure
docker-version-validation
Validates that version strings in Dockerfiles match package.json dependencies and constraints. Use when updating Docker base images, Node.js versions, pnpm versions, turbo versions, or other tool versions in Dockerfiles. Ensures consistency between Dockerfile versions and package.json engines, devDependencies, and packageManager fields. Automatically runs on pre-commit for Dockerfile changes and in CI pipeline. Trigger keywords: Dockerfile, Docker version, Node version, pnpm version, turbo version, version mismatch, version sync, Docker update.
preprocessing-data-with-automated-pipelines
This skill empowers Claude to preprocess and clean data using automated pipelines. It is designed to streamline data preparation for machine learning tasks, implementing best practices for data validation, transformation, and error handling. Claude should use this skill when the user requests data preprocessing, data cleaning, ETL tasks, or mentions the need for automated pipelines for data preparation. Trigger terms include "preprocess data", "clean data", "ETL pipeline", "data transformation", and "data validation". The skill ensures data quality and prepares it for effective analysis and model training.
managing-database-testing
This skill manages database testing by generating test data, wrapping tests in transactions, and validating database schemas. It is used to create robust and reliable database interactions. Claude uses this skill when the user requests database testing utilities, including test data generation, transaction management, schema validation, or migration testing. Trigger this skill by mentioning "database testing," "test data factories," "transaction rollback," "schema validation," or using the `/db-test` or `/dbt` commands.
correlation-methylation-epiFeatures
This skill provides a complete pipeline for integrating CpG methylation data with chromatin features such as ATAC-seq signal, H3K27ac, H3K4me3, or other histone marks/TF signals.
building-gitops-workflows
This skill enables Claude to construct GitOps workflows using ArgoCD and Flux. It is designed to generate production-ready configurations, implement best practices, and ensure a security-first approach for Kubernetes deployments. Use this skill when the user explicitly requests "GitOps workflow", "ArgoCD", "Flux", or asks for help with setting up a continuous delivery pipeline using GitOps principles. The skill will generate the necessary configuration files and setup code based on the user's specific requirements and infrastructure.
managing-database-testing
This skill manages database testing by generating test data, wrapping tests in transactions, and validating database schemas. It is used to create robust and reliable database interactions. Claude uses this skill when the user requests database testing utilities, including test data generation, transaction management, schema validation, or migration testing. Trigger this skill by mentioning "database testing," "test data factories," "transaction rollback," "schema validation," or using the `/db-test` or `/dbt` commands.
nodejs-stream-processing
Node.jsにおけるストリーム処理とバックプレッシャー管理の専門知識。大容量ファイルの効率的処理、メモリ使用量の最適化、Readable/Writable/Transform/Duplexストリームの適切な活用方法を提供。Anchors:• Node.js Streams API Documentation / 適用: ストリームAPI全般 / 目的: 公式APIの正確な使用• Backpressuring in Streams (Node.js official) / 適用: バックプレッシャー管理 / 目的: メモリ効率最適化• The Pragmatic Programmer (Hunt and Thomas) / 適用: 実装品質 / 目的: 保守性とテスタビリティTrigger:Use when implementing stream processing in Node.js, handling large files, managing backpressure, or building data transformation pipelines.nodejs stream, stream processing, backpressure, readable stream, writable stream, transform stream, duplex stream, pipeline, large file
event-driven-architecture
イベント駆動アーキテクチャの設計・実装スキル。メッセージングパターン、Event Sourcing、CQRS、Sagaを活用し、スケーラブルで疎結合なシステムを構築する。Anchors:• Enterprise Integration Patterns (Gregor Hohpe) / 適用: メッセージングパターン / 目的: 疎結合な統合設計• Designing Event-Driven Systems (Ben Stopford) / 適用: EDAアーキテクチャ / 目的: スケーラブルな非同期処理• Domain-Driven Design (Eric Evans) / 適用: ドメインイベント / 目的: ビジネスイベントの表現Trigger:Use when designing event-driven systems, implementing event sourcing, CQRS, message brokers, saga patterns, or asynchronous service integration.event-driven, messaging, pub/sub, event sourcing, cqrs, saga, kafka, rabbitmq, async
managing-container-registries
This skill enables Claude to manage container registries, including ECR, GCR, and Harbor. It should be used when the user needs to create, configure, or manage container image registries. It helps generate production-ready configurations, implement best practices, and ensure a security-first approach. Use this skill when the user mentions terms like "container registry," "ECR," "GCR," "Harbor," "image repository," or requests assistance with managing container images. It's also helpful for generating configuration code for DevOps pipelines related to container registries.
nixtla-prod-pipeline-generator
Transform experiment workflows into production-ready inference pipelines with orchestration
devops-engineering
Domain specialist for infrastructure, CI/CD, containers, observability, and DevOps operations. Scope: CI/CD pipelines, containerization, infrastructure as code, monitoring and observability, container security, release strategies, infrastructure reliability patterns. Excludes: code-level security, application architecture, database design, API design, performance profiling. Triggers: "CI/CD", "Docker", "Kubernetes", "K8s", "deployment", "pipeline", "monitoring", "observability", "Terraform", "Ansible", "infrastructure".
Unnamed Skill
Create and execute image workflows using floimg. Use when the user needs to generate AI images, transform existing images (resize, caption, filters), or create multi-step pipelines. Also handles charts, diagrams, QR codes, and screenshots. Trigger words: image, picture, photo, illustration, hero, thumbnail, resize, transform, caption, watermark, generate, AI, DALL-E, chart, diagram, QR, screenshot.
github-actions-security
GitHub Actionsワークフローのセキュリティ強化スキル。Repository/Environment Secretsの安全管理、ログマスキング、品質ゲート統合、CI/CDパイプラインの脅威対策を行う。Anchors:• OWASP Top 10 CI/CD Security Risks / 適用: リスク評価・脅威特定 / 目的: 業界標準に基づくリスク優先度決定• GitHub Actions Security Hardening Guide / 適用: ワークフロー実装 / 目的: 公式ベストプラクティス準拠• Threat Modeling (Adam Shostack) / 適用: STRIDE脅威分析 / 目的: 体系的なセキュリティ設計Trigger:Use when securing GitHub Actions workflows, configuring Environment/Repository Secrets, implementing log masking, adding quality gates to CI/CD pipelines, or performing threat modeling on workflows.github actions security, secrets management, log masking, quality gate, CI/CD security, threat modeling
building-cicd-pipelines
This skill enables Claude to generate CI/CD pipeline configurations for various platforms, including GitHub Actions, GitLab CI, and Jenkins. It is used when a user requests the creation of a CI/CD pipeline, specifies a platform (e.g., "GitHub Actions"), or mentions specific pipeline stages like "test," "build," "security," or "deploy." This skill is also useful when the user needs to automate software delivery, integrate security scanning, or set up multi-environment deployments. The skill is triggered by terms such as "CI/CD pipeline," "GitHub Actions pipeline," "GitLab CI configuration," or "Jenkins pipeline."
github-actions-ci
GitHub Actions CI/CD guide for the dotfiles repository. Use when creating or modifying GitHub Actions workflows, adding CI/CD pipelines, setting up build/test automation, configuring matrix builds, working with artifacts, implementing E2E tests in containers, or troubleshooting workflow issues. Covers workflow patterns, security best practices, caching strategies, and multi-platform testing.
azure-expert
Comprehensive Azure cloud expertise covering all major services (App Service, Functions, Container Apps, AKS, databases, storage, monitoring). Use when working with Azure infrastructure, deployments, troubleshooting, cost optimization, IaC (Bicep/ARM), CI/CD pipelines, or any Azure-related development tasks. Provides scripts, templates, and best practices for production-ready Azure solutions.
command-advanced-patterns
高度なコマンド設計パターンを整理し、パイプライン/メタコマンド/インタラクティブ設計を支援するスキル。パターン選定、テンプレート適用、検証手順を一貫して整理する。Anchors:• The Pragmatic Programmer (Andrew Hunt, David Thomas) / 適用: 実践パターン選定 / 目的: 再現性のある設計• Design Patterns (GoF) / 適用: パターン適用判断 / 目的: 拡張性の確保• User-Centered Design / 適用: インタラクティブ設計 / 目的: 誤操作の抑制Trigger:Use when designing pipeline commands, meta-commands, or interactive command flows.command patterns, pipeline command, meta command, interactive command
coverport-integration
Integrate coverport into Go repositories with Tekton pipelines to enable e2e test coverage collection and upload to Codecov. Use this skill when users ask to integrate coverport, add e2e coverage tracking, or set up coverage instrumentation for Go projects.
docker-containerization
This skill should be used when containerizing applications with Docker, creating Dockerfiles, docker-compose configurations, or deploying containers to various platforms. Ideal for Next.js, React, Node.js applications requiring containerization for development, production, or CI/CD pipelines. Use this skill when users need Docker configurations, multi-stage builds, container orchestration, or deployment to Kubernetes, ECS, Cloud Run, etc.