Data validation example
Background
This page provides a simple example that visualizes the results of a GitHub Actions workflow when downloading daily chlorophyll-a (Chla) data from ERDDAP within territorial waters of US states bordering the Gulf of Mexico. Data validation is performed across pixels of the raster product within these waters, as well as for the average values per state. Validation results are summarized in reports based on user-defined conditional statements.
Given the lag in data availability for the nesdisVHNSQchlaDaily Chla product (9 days), the data accessed in this example will be accessed from 2 weeks prior (i.e., 2025-09-17).
This example process involves:
- Approximating territorial waters by buffering a spatial layer of state boundaries
- Access the Chla data for the region of interest
- Masking the Chla data to territorial state waters in GoM
- Calculating mean Chla values per state
- Validating pixel-level and state-level Chla data to check for issues
- Generate reports from data validation
Report output
Report for pixel-level data
Report for state-level data
Pointblank Validation | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
[2025-10-01|18:53:45]
data frame Gulf state means for daily Chl-a dataWARN 0.20 STOP 0.25 NOTIFY 0.20
|
|||||||||||||
STEP | COLUMNS | VALUES | TBL | EVAL | UNITS | PASS | FAIL | W | S | N | EXT | ||
1 | col_vals_not_null()
|
|
— | ✓ | 5 |
5 1 |
0 0 |
○ | ○ | ○ | — | ||
2 | col_vals_between()
|
|
|
✓ | 5 |
5 1 |
0 0 |
○ | ○ | ○ | — | ||
3 | col_vals_lt()
|
|
|
✓ | 5 |
5 1 |
0 0 |
○ | ○ | ○ | — | ||
2025-10-01 18:53:45 EDT < 1 s 2025-10-01 18:53:46 EDT |