Three-engine separation
Surface Data Engine: ingestion + multi-API aggregation. Listing URL parsing (Zillow __NEXT_DATA__ JSON, MLS Bridge in Phase 3), property records (ATTOM-first, Estated absorbed into ATTOM so this is forward-compatible), comp pulls.
Diagnostic Engine: CV + NLP condition inference. Listing-photo analysis for build / design quality and interior / exterior style, listing-text NLP for condition signals, lead paint compliance flags for pre-1978 listings (new EPA standard effective Jan 12, 2026).
Deterministic Engine: cross-analysis synthesis + scoring. ARV (3-6 comps, 0.5mi, 90-180d, weighted top 30-40%, drop high / low for outliers, never adjust >25% of comp), rehab band (Light $15-25/sqft, Medium $50-75, Heavy $90-135+, 15% contingency standard), profit math, risk flag composition, headline verdict.