BigQuery Data Matching Analysis

Archetype vs Attom Tax Assessor Data Comparison

⚠️ PRELIMINARY RESULTS - VALIDATION REQUIRED

Executive Summary

Total Matched Addresses
660,108
Unique Properties
Match Rate
64.6%
Of Archetype Addresses
Full Name Matches
264,489
Highest Confidence
Total Records Processed
3.75M
Combined Dataset

Match Distribution Analysis

Full Name + Address
264,489
40.1%
Avg: 2.45 records/address
Last Name + Address
67,220
10.2%
Avg: 2.25 records/address
Address Only
328,399
49.7%
Avg: 1.98 records/address

Match Quality Distribution

Archetype Dataset Overview

Total Records
2.05M
Unique Addresses
1.02M
Avg Owners/Address
2.0
Max Owners
20

Attom Dataset Overview

Total Records
1.71M
Unique Properties
1.12M
Unique Addresses
923K
Avg Owners/Property
1.53

Data Quality Insights

Attom Properties with Owner Data
99.7%
Attom Properties without Owners
3,357
Multi-Owner Properties (Archetype)
551,392
Multi-Owner Properties (Attom)
482,736
⚠️ Data Quality Alert: Potential Duplicate Properties in Attom

Attom dataset shows 1,118,062 unique property IDs (ATTOMID) but only 922,969 unique addresses. This discrepancy of ~195,000 records suggests potential duplicate property records or multiple units sharing the same standardized address. Further investigation recommended to identify root cause.

Dataset Coverage Comparison