NHTSA Health
Operator dashboard for the NHTSA harvesting subsystem. Read-only Phase 0 scaffold; live data wires in Phases 1–5.
Data Types in Catalog
21
9 flat files · 8 JSON APIs · 4 S3 PDF listings
Rolling Window
2017–2026
10 model years (WINDOW_YEARS=10)
Open Root Causes
5
Of 5 tracked Stage 1.7 root causes
Last Audit
—
Audit freshness pending Phase 1
Root Causes (Stage 1.7)
URL Drift (stale flat-file URLs)
6 confirmed drift cases as of Stage 1.5. Decodo HEAD/GET parity pending Phase 2.
Akamai Stale ETag (HEAD blocked)
Datacenter IP block confirmed; residential proxy bypass validated. Wire-up pending Phase 2.
Circuit Breaker Stuck Open
Per-year breakers + resumable runs pending Phase 4.
Distributor Never Run
ekis-zero/source/nhtsa/ + ekis-one/source/nhtsa/ are empty. Distributor cron pending Phase 3.
Operator Visibility Gap
This dashboard is the Phase 0 fix. Live data wires in Phases 1-5.
10-Year Rolling Window Enforcement
Source-of-truth window from @ekis/config. Vehicles outside this window are out of scope for harvest, ingest, and MILA retrieval.
URL Drift
Decodo HEAD/GET parity sweep + URL canonicalization — pending Phase 2.
Akamai ETag Freshness
Residential-proxy ETag refresh job — pending Phase 2.
Source Library Drift
R2 prefix counts across the three tiers — pending Phase 3 distributor unfreeze.
Per-Data-Type Circuit Breakers
IR-6 circuit breaker state per data type — pending Phase 4.
Audit Truncation
Detect bimodal audit sizes (4039 byte truncation vs ~19KB full) — pending Phase 1.
Live Pipeline Heartbeat
PL-3 real-time progress feed for active pipeline runs — pending Phase 1.
Data Type Contract Matrix (Stage 1.8)
| Channel | Data Type | Contract | Rows / Files | Last Validated |
|---|---|---|---|---|
| flat-file | Recalls (FLAT_RCL.zip) | Pending data | — | — |
| flat-file | Complaints (FLAT_CMPL.zip) | Pending data | — | — |
| flat-file | Investigations (FLAT_INV.zip) | Pending data | — | — |
| flat-file | TSBs / Mfr Communications (chunked archives) | Pending data | — | — |
| flat-file | Manufacturer Communications bulk | Pending data | — | — |
| flat-file | Recall Completion Rates | Pending data | — | — |
| flat-file | Data Dictionaries | Pending data | — | — |
| flat-file | vPIC Vehicle List (Lite) | Pending data | — | — |
| flat-file | Recall Communications bulk PDF | Pending data | — | — |
| json-api | Recalls by Vehicle | Pending data | — | — |
| json-api | Complaints by Vehicle | Pending data | — | — |
| json-api | Investigations API | Pending data | — | — |
| json-api | Safety Ratings (NCAP) | Pending data | — | — |
| json-api | Manufacturer Communications | Pending data | — | — |
| json-api | CSSI Station (Car Seat Inspection) | Pending data | — | — |
| json-api | vPIC Vehicle Catalog | Pending data | — | — |
| json-api | Recalls by Campaign Number | Pending data | — | — |
| s3-pdf-listing | Recall PDFs (S3 listing) | Pending data | — | — |
| s3-pdf-listing | TSB PDFs (S3 listing) | Pending data | — | — |
| s3-pdf-listing | Investigation PDFs (S3 listing) | Pending data | — | — |
| s3-pdf-listing | Mfr Comm PDFs (S3 listing) | Pending data | — | — |
Per-type contracts (NHV-1..7 validation gates) are written to docs/harvesting/nhtsa/data-types/ during Phases 2–5.