About Our Data

Our Commitment to Truth

Every statistic on CompareMOTHistory.com is derived from real UK MOT test data. We publish our methodology, acknowledge our limitations, and invite scrutiny. Truth > Clicks.


The Dataset

Source

What We Have

✅ Test date, result, mileage, vehicle age
✅ Make, model, fuel type, engine capacity
✅ Test type (initial vs retest)
✅ Postcode area (anonymized for privacy)

What We Don't Have

❌ Specific failure reasons (not included in this dataset)
❌ Advisory details
❌ Individual vehicle identifiers
❌ Owner information

Important: Our analyses focus on pass/fail patterns, mileage norms, and retest rates. We cannot analyze specific component failures with this dataset.


Data Quality

Completeness

Validation

Every record passes through:

  1. Schema validation (Pydantic models enforce data types)
  2. Range checks (e.g., mileage 0-500,000, age 0-50 years)
  3. Consistency checks (test date > first use date)

Records failing validation are excluded from analysis (< 0.1% of raw data).


Methodology

How We Calculate Pass Rates

Pass Rate = (Tests with result 'P' or 'PRS') / Total Tests × 100

Sample Size Confidence

We apply strict thresholds for publishing statistics:

Sample Size Confidence We Publish?
< 50 tests Low ❌ No
50-500 tests Medium ⚠️ With caveat
500+ tests High ✅ Yes
2,000+ tests Very High ✅ Yes, featured

Example: Our Ford Focus guide is based on 7,616 tests - very high confidence.

Age Cohorts

Vehicles are grouped by age at time of MOT test:

Pass rates are calculated separately for each cohort.

Mileage Benchmarks

For each model/age combination, we report:

This allows you to see where a specific car sits relative to its peers.


Limitations & Caveats

1. Single-Month Snapshot

Our current analysis covers January 2024 only. This is a limitation. Seasonal patterns may exist (e.g., more retests after winter salt corrosion).

Mitigation: We will expand to full-year 2024 data when available and update all statistics.

2. Selection Bias

MOT tests are required at 3 years, then annually. Newer cars (<3 years) are not represented. Our data skews towards older vehicles.

3. Geographic Variance

Postcode-level pass rates may be influenced by:

We report geographic trends but urge caution in interpreting causality.

4. No Causal Inference

Correlation ≠ Causation. If Model A has a higher pass rate than Model B, we cannot definitively say Model A is "better engineered." It could reflect owner demographics, maintenance habits, or usage patterns.

We present data; we invite you to draw informed conclusions.


For Researchers & Data Journalists

Reproducibility

All our visualizations and statistics are generated from code. Our process:

  1. Raw data ingestion (data_ingest.py)
  2. Cleaning & validation (clean_transform.py)
  3. Feature engineering (features.py)
  4. Statistical analysis (analytics.py)
  5. Visualization (viz.py)

Code: Available on request for verification purposes.

Downloadable Data

In the spirit of transparency, we offer our aggregated, anonymized dataset for download:

⬇️ Download Aggregated Data (CSV)

Pass rates by make/model/age, mileage benchmarks, retest rates
Anonymized, aggregated only - no individual vehicle data

What's included:

What's NOT included:

Citation

If you use our data or insights, please cite:

CompareMOTHistory.com (2025). UK MOT Pass Rates Analysis: January 2024 Dataset. Retrieved from https://comparemothistory.com/about-data/


Our Commitment

We will:

We won't:

The Honest Mechanic doesn't just show you the data - we show you how we got it.


Questions?

Technical questions: [email protected]
Data requests: [email protected]
Press inquiries: [email protected]


Ready to Check a Specific Car?

Our cohort data shows the big picture. For the specific vehicle you're considering, check its individual MOT history:

Check Individual MOT History →

Free: MOT history, advisories, mileage
Premium (£9.49): Finance, write-off, stolen checks