Data quality, in 90 seconds. No code.
Connect Postgres. Get a six-dimension scorecard. Fix the obvious stuff automatically. The platform Collibra wishes it could ship.
Built on real algorithms, not vibes
Peer-reviewed methods. Published sources. No black boxes.
HyperLogLog cardinality
Flajolet et al., 2007
Estimate distinct values in billions of rows with <2% error.
t-digest quantiles
Dunning & Ertl, 2019
Accurate median and percentile estimates with bounded memory.
Mahalanobis outlier detection
Mahalanobis, 1936
Catch multivariate outliers accounting for correlations.
Kolmogorov-Smirnov drift
Kolmogorov, 1933
Detect distribution drift between profiling runs.
MinHash + Jaccard similarity
Broder, 1997
Find near-duplicate columns across your entire warehouse.
Benford's law accuracy
Benford, 1938
Flag financial columns that deviate from natural digit distributions.
Isolation Forest anomalies
Liu et al., 2008
Unsupervised tabular anomaly detection without labeled data.
Six dimensions, one scorecard
Each dimension scored 0–100. One overall DQ score for your exec dashboard.
Completeness
Null rates, missing values, required fields.
Uniqueness
Duplicate detection via HyperLogLog sketches.
Validity
Format checks, type enforcement, regex patterns.
Consistency
Cross-column and cross-table rule validation.
Integrity
Referential and relational constraint checks.
Accuracy
Statistical outliers and Benford's law analysis.
Two minds in one product
Executive mode for the room. Analyst mode for you.
Stata/SAS-grade plots: histograms, Q-Q, correlation matrices, Mahalanobis ellipses.
Tune it like a Jetta
Bring your own algorithm. Bring your own LLM. Bring your own rules.
Sandboxed Python plugins. BYOK AI. Permission-gated.
Simple, honest pricing
Pro
- 100M rows/mo
- 25 connections
- Unlimited rules
- Hourly profile
14-day trial included
Start trialHow DQ compares
| vs | DQ wins on | |
|---|---|---|
| Great Expectations | 0 lines of YAML to start | Full comparison → |
| Collibra | $99 vs $50,000+ | Full comparison → |
| Informatica | Connect in minutes, not quarters | Full comparison → |
| Atacama | Same auto-discovery, modern stack | Full comparison → |
| Soda | Includes catalog + lineage + remediation | Full comparison → |
| Monte Carlo | Prevention + remediation, not just alerts | Full comparison → |