Data

County-Direct vs Aggregated Property Data: Why Your Data Source Matters

PropIntel Team·PropIntel Research·April 14, 2026·8 min read

The property data industry has a supply chain that most investors never think about. Data moves from county assessors to wholesale aggregators to platform companies to end users, with each step introducing latency, field loss, and normalization artifacts. For most research tasks, this is invisible and inconsequential. For serious investors relying on data to drive acquisitions, it is a meaningful source of edge or, depending on which platform you use, a meaningful source of risk.

How Property Data Actually Moves

Start at the source: county assessors and appraisers. Every county in the United States maintains a public database of property characteristics — owner of record, legal description, assessed value, building attributes (year built, square footage, bedroom and bathroom count), land use classification, and usually a lot more, depending on the county's CAMA (Computer Assisted Mass Appraisal) system.

Counties update their data on various cadences. Urban counties with large staff often refresh daily or weekly. Rural counties may update quarterly or on the annual assessment cycle. But the data, when it leaves the assessor's office, is as current as the county's own systems.

Step 1 — Aggregator: Companies like ATTOM, CoreLogic, and Black Knight license this data from counties directly, or in some cases from state-level agencies that aggregate county data. These companies normalize the raw county data into a standardized national schema. The normalization is necessary for a nationwide product — you cannot ship 3,100 different county schemas to enterprise customers. But normalization has a cost.

Step 2 — Platform: PropStream, Privy, DealMachine, and other investor-facing platforms license data from ATTOM, CoreLogic, or similar aggregators. They build search UIs, list management, and skip-tracing tools on top of this licensed data. The data they display is already one step removed from the county source, and now it is two.

Step 3 — Investor: You search a platform, pull a list, and make decisions. How old is the data you are looking at? How many fields were dropped in aggregator normalization? How many addresses were altered by standardization?

What Gets Lost at Each Step

Field Stripping

Aggregators normalize to a standard national schema with a fixed set of fields. If a county CAMA system carries 140 distinct fields — and many do — the aggregator maps the 40 or so that fit their schema and discards the rest.

What gets discarded varies, but common casualties include:

Building permit history: When the county assessor's system tracks permits, that data rarely survives aggregation. Knowing that a property pulled a permit for an addition in 2019 is relevant to value estimation and may indicate an unreported room count.
Code enforcement and violation records: Counties with integrated code enforcement data do not typically push violations into parcel feeds. Aggregators do not acquire this data separately in most cases.
Environmental and special assessment liens: Special assessments, PACE liens, and environmental notices of violation are recorded at the county level but are almost never represented in aggregated parcel data.
Specific CAMA fields: Effective age, condition grade, functional depreciation, economic depreciation — the fields that a trained appraiser uses — are typically not in aggregated schemas. Only some counties populate these, but when they do, they are worth having.
Probate and estate indicators: Some county clerk systems flag properties where the owner of record is a decedent estate. Aggregators do not carry this.

Latency

Aggregators do not update in real time. Most major aggregators refresh their county data quarterly or, in some cases, less frequently. For owner-of-record information, this means a property that sold six months ago may still show the prior owner in an aggregated database.

This creates specific failure modes for investors:

You pull a list of absentee owners and contact a property. The current owner lives there and has no interest in selling — they bought it from the absentee owner eight months ago.
You identify a distressed property based on delinquent tax data that was current six months ago. The owner paid the taxes three months ago — the distress signal is stale.

County-direct data does not eliminate these problems entirely — counties have their own update cadences — but it eliminates the aggregator's additional lag on top of the county's lag.

Normalization Artifacts

Address standardization is one of the less-discussed costs of aggregation. Aggregators run raw county address strings through standardization engines (USPS validation, CASS certification, and similar). This is appropriate for building a national mailing database. But it introduces changes.

"SW 1/4 OF NE 1/4 OF LOT 3 BLK 4" becomes a standardized parcel address. "CO RD 441" becomes "County Road 441." Some of these changes are harmless. Some introduce errors, particularly for rural addresses, fractional legal descriptions, and multi-parcel assemblages.

When you are doing precise due diligence on a specific parcel — cross-referencing against recorded documents, title chains, or plat maps — working with county-original address strings reduces the risk of a normalization artifact sending you in the wrong direction.

What County-Direct Data Means in Practice

County-direct means building an adapter for each county's actual data endpoint: their ArcGIS REST API, their bulk download portal, their FTP server, or their direct database export. This is not a small undertaking — there are over 3,100 counties in the United States, each with its own system, schema, and access method. It is, however, the only way to guarantee that the data you are working with is what the county actually has, not a filtered and time-delayed derivative.

County-direct access gives you:

Full field sets: Whatever the county CAMA system tracks, you can request.
Source-native freshness: Data is as current as the county's last update, with no additional aggregator lag.
Direct schema access: No guessing about what a normalized field name maps to in the original county system.
Access to non-parcel county data: Code enforcement, building permits, official records (liens, lis pendens, foreclosure filings) are available from county clerk and code enforcement systems, separately from the parcel database.

Fields Available County-Direct That Aggregators Do Not Carry

The following categories of data are generally available from county systems but are not represented in aggregated national property databases:

Code enforcement violations: Dozens of counties and municipalities publish their code enforcement citation databases through Socrata open data portals or their own GIS download pages. Miami-Dade, Orlando, Baltimore City, and Montgomery County (Maryland) all do this. These records tell you whether a property has open violations, the nature of the violation (structural, electrical, occupancy), and the current status of remediation.

Lis pendens and foreclosure filings: County clerks of court maintain these records in their official records indices. This data is available for direct query or bulk download in many jurisdictions, with case numbers, plaintiff (lender) identity, and filing dates.

Tax delinquency: County tax collectors maintain delinquent rolls that are public record. Aggregators do not typically carry this data — it requires a separate data acquisition relationship with each county tax collector.

Environmental liens: Notices of environmental violation, LUST (leaking underground storage tank) disclosures, and Brownfields designations are available from state environmental agencies and EPA databases. They are rarely in aggregated parcel products.

Building permits: Many counties make permit history available through their building department portals. This is structurally valuable data — it tells you what improvements were made and when, often revealing square footage or bedroom additions that the assessor has not yet reflected in the official record.

Data Freshness: A Concrete Example

Consider a property in Hillsborough County, Florida that falls two years delinquent on taxes in March 2026. The county tax collector's delinquent roll reflects this immediately. If you are pulling data county-direct, you see this in the current roll.

An aggregated platform that refreshes quarterly will not reflect this delinquency until July or later, depending on their update cycle. For competitive lead generation, that is a four-month window during which a county-direct investor can reach the owner before the delinquency surfaces on aggregated platforms and competitors start marketing to the same list.

The PropIntel Approach

PropIntel maintains individual county adapters for every data source in the platform — not a single license from ATTOM or any other aggregator. Each adapter pulls directly from the county's native endpoint: ArcGIS REST, FTP bulk download, direct HTTP, or county portal API. Records carry timestamps from the source system, so you can see exactly when the county last updated each record.

This extends to distress data. Lis pendens records are pulled from county clerk official records systems. Tax delinquency is pulled from tax collector portals. Code enforcement violations are pulled from county and municipal code enforcement databases. None of this data passes through an aggregator.

For a direct comparison of what this means relative to specific aggregated platforms, see the PropStream comparison and the ATTOM comparison pages. For a broader overview of how PropIntel handles data across the 300+ county adapters currently in production, see the Regrid comparison.

What This Means for Investors

For investors pulling lists for direct mail or cold outreach, the difference between quarterly- refreshed and weekly-refreshed data may be material or may be negligible, depending on how competitive your market is and how precisely you are targeting distress signals.

For investors doing due diligence on specific properties before making an offer, the difference between aggregated field sets and county-native field sets can be significant. Building permit history, code enforcement status, and environmental liens are the kinds of fields that surface problems before closing, not after.

The argument for county-direct data is not that aggregated platforms are useless — they are not. It is that for investors who want the highest-fidelity picture of a property and its owner's situation, working with county-original data eliminates the distortions introduced by a supply chain that was not designed for the kind of granular, multi-signal analysis that serious acquisitions require.