All skills

Dataset Versioning

/version-datasetNEW
Data & Study Design

What it does

Dataset version control for reproducibility. Builds a deterministic content-hash manifest (file SHA-256 + schema + per-column value hashes), verifies later copies to detect drift, and diffs two manifests — proof an analysis ran on the intended data.

Highlights

  • Content-hash manifest (SHA-256 + schema)
  • Drift detection: schema / rows / values
  • Reproducibility-lock datasets & demos

Install this skill

git clone https://github.com/aperivue/medsci-skills.git
cp -r medsci-skills/skills/version-dataset ~/.claude/skills/
Full documentationView source on GitHub

Related skills