synthetic-data-generation

Generate realistic synthetic data using Faker and Spark, with non-linear distributions, integrity constraints, and save to Databricks Volumes. Use when creating test data, demo datasets, or synthetic tables.

$ 安裝

git clone https://github.com/databricks-solutions/ai-dev-kit /tmp/ai-dev-kit && cp -r /tmp/ai-dev-kit/databricks-skills/synthetic-data-generation ~/.claude/skills/ai-dev-kit

// tip: Run this command in your terminal to install the skill