Intercomparison of six national empirical models for PM2.5 air pollution in the contiguous US

12 December 2022, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Empirical models, previously called land-use regression (LUR), are used to understand and predict spatial variability in levels of outdoor air pollution at unmeasured locations, for example, to conduct health risk assessment, environmental epidemiology, or environmental justice analysis. Many methods are used to generate empirical models, yet almost no research compares models generated by separate research groups. We intercompare six national-scale empirical models for year-2010 concentrations of PM2.5 in the US, each generated by a different research group. Despite substantial differences in the statistical methods and input data used to build the models, our main finding is a relatively high degree of agreement among model predictions. For example, in pairwise intercomparisons, the average Pearson correlation coefficient is 0.87 (range: 0.84 to 0.92); the RMSD (root-mean-square-difference; units: μg/m3) is 1.1 on average (range: 0.8 to 1.4), or ~12% of the average concentration; and many best-fit lines are near the 1:1 line. The underlying reason for this agreement is likely that, while the methods and the independent variables differ among the models, in all cases the models are built using, and are calibrated to, the same information: publicly available measurement at US EPA regulatory monitoring stations. Findings here suggest that future improvements to national empirical models will come not from further refinements to the methods (e.g., more-advanced models) but from employing a fundamentally different set of observations, in addition to regulatory monitoring data.

Keywords

Land use regression
exposure assessment
air quality models
empirical model comparison
point-based models
gridded models

Supplementary materials

Title
Description
Actions
Title
Intercomparison of six national empirical models for PM2.5 air pollution in the contiguous US
Description
The supporting information include: Figures. S1 to S9
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.
Comment number 1, Tianjun Lu: Nov 18, 2023, 16:41

The final version has been published in the journal "Findings" and can be downloaded from this link: https://doi.org/10.32866/001c.89423. Please cite the final paper using "Bechle, Matthew J., Michelle L. Bell, Daniel L. Goldberg, Steve Hankey, Tianjun Lu, Albert A. Presto, Allen L. Robinson, et al. 2023. “Intercomparison of Six National Empirical Models for PM2.5 Air Pollution in the Contiguous US.” Findings, November. https://doi.org/10.32866/001c.89423."