Data Sources

Hi3 Water aggregates water asset data from federal agencies, state databases, and community directories. Here's exactly where our data comes from and how we process it.

USGS National Water Information System (NWIS)

Federal Agency
Visit source

The USGS NWIS is the nation's principal repository of water resources data. We ingest site metadata for springs (site type SP) and wells (site type GW) including location, elevation, aquifer codes, and site status. Over 24,000 sites currently indexed.

Data Types
  • Springs
  • Wells
  • Streamflow gauges
  • Groundwater monitoring sites
Coverage

All 50 US states

Update Frequency

Real-time monitoring, site data updated quarterly

Fields Ingested
  • Site name & ID
  • Latitude / Longitude
  • Elevation
  • Aquifer code
  • Site type
  • State & County

USGS Principal Aquifers Map

Federal Dataset
Visit source

The Principal Aquifers dataset provides polygon boundaries for all major aquifer systems in the United States. We store over 4,600 boundary polygons with rock type classification (sandstone, carbonate, igneous/metamorphic, sand & gravel), enabling the Aquifer Boundaries map overlay.

Data Types
  • Aquifer boundary polygons
  • Rock type classification
  • Geological formation data
Coverage

Continental US, Hawaii, Puerto Rico, US Virgin Islands

Update Frequency

Static dataset (updated 2023)

Fields Ingested
  • Aquifer name & code
  • Rock type
  • Boundary polygon (MultiPolygon)
  • Centroid coordinates

EPA Safe Drinking Water Information System (SDWIS)

Federal Agency
Visit source

The EPA SDWIS database tracks over 160,000 public water systems nationwide. We ingest system locations, water source classifications (groundwater vs surface water), population served counts, and link to violation records. Coordinates are geocoded from city/state data using the OpenStreetMap Nominatim service.

Data Types
  • Public water systems
  • Water source type
  • Population served
  • Violations
Coverage

All US states and territories

Update Frequency

Quarterly updates

Fields Ingested
  • System name & PWSID
  • Water source type
  • Population served
  • City / State / County
  • Geocoded coordinates

FindASpring.org

Community Directory
Visit source

FindASpring is a community-maintained directory of natural springs. We scrape listings with full robots.txt compliance and a 2-second rate limit between requests. Coordinates are extracted from listing pages using multiple strategies (embedded maps, text parsing, geocoding). All scraped data is validated and deduplicated against existing records.

Data Types
  • Natural springs
  • Community water sources
  • Spring descriptions
Coverage

North America (community-contributed)

Update Frequency

Ongoing community submissions

Fields Ingested
  • Spring name
  • Description
  • Coordinates
  • State / Region
  • Community notes

Ingestion Pipeline

Every data source goes through our 5-step pipeline before appearing on the platform.

1. Fetch

Raw data retrieved from source API or scraped from directory

2. Validate

Coordinates, names, and data fields normalized and checked

3. Deduplicate

Haversine distance + fuzzy name matching to prevent duplicates

4. Score

Water Intelligence Score (WIS) computed across 5 dimensions

5. Store

Asset saved to PostGIS database with full metadata

Data Quality Tiers

Every asset is assigned a data quality tier based on the depth and verification of available data.

Gold

Field verified + lab tested water quality

Silver

Reported data with partial verification

Bronze

User-reported or community-sourced

Unverified

Pending review and verification

Know a data source we should integrate? Have corrections for existing data?