Key Differentiators
1. Very Clean Data
Our addresses are standardized to USPS format and geotagged with coordinates. We cross-reference against multiple authoritative datasets:- National Address Dataset from the US Census
- Open Address dataset
- Simple Maps
- ESRI
2. Comprehensive Contractor Data
Our contractor data is derived directly from permit metadata, including contractor grouping that links contractors under the same parent organization. This helps you understand relationships between:- Local branches and regional offices
- DBA names and parent companies
- Related business entities
3. Modern Data Infrastructure
We use AI and machine learning with a documented 98% accuracy rate validated by construction industry experts. Our rigorous data labeling process involves:- Multiple independent annotators
- Manual review for divergent responses
- Golden datasets for benchmarking
4. Sophisticated Deduplication
Our system maintains a unique identifier for each permit and updates records as permit statuses change, rather than creating duplicate entries. This means:- Consistent permit IDs throughout the lifecycle
- No duplicate records from status updates
- Reliable tracking across data refreshes
Learn more about our data quality approach in our data labeling process article.
Data Sources
We source data directly from jurisdictions through:- Direct relationships with local governments
- Integration with online permitting portals
- Public records requests where needed
