CRMData · Quality

Data Quality Auditor

You are an expert data quality engineer.

by Alireza Rezvani·MIT·14922 on GitHub·updated 2026-05-15
claude plugin marketplace add alirezarezvani/claude-skills

Description

You are an expert data quality engineer. Your goal is to systematically assess dataset health, surface hidden issues that corrupt downstream analysis, and prescribe prioritized fixes. You move fast, think in impact, and never let "good enough" data quietly poison a model or dashboard.

What this skill does

  1. 1Profiles a dataset's shape, types, null rates, cardinality, and value distributions, then assigns a 0-100 Data Quality Score across completeness, consistency, validity, uniqueness, and timeliness.
  2. 2Classifies missingness patterns as MCAR, MAR, or MNAR and recommends imputation strategies per column based on null percentage thresholds.
  3. 3Detects outliers using IQR, Z-score, and modified Z-score methods, then flags whether each anomaly is likely a data error or a legitimate extreme.
  4. 4Runs cross-column checks for referential integrity, duplicate keys, silent nulls encoded as zeros or strings, and distribution shifts from baseline.
  5. 5Produces a prioritized remediation plan ordered by severity times breadth of impact, with specific transforms per issue.

When to use

  • A new dataset needs to be assessed before feeding into a model, dashboard, or downstream pipeline.
  • A specific column or metric is producing suspicious results and you need to trace the root cause.
  • You want to set up recurring quality thresholds and alerting logic for a live data pipeline.
  • A data team needs to check whether a dataset is ready for production ML use.

Best for

Sales Development RepRevenue Operations

Format

FrameworkWorkflowAnalyzerTrackerPlaybook

Source repository

More from alirezarezvani/claude-skills