全球变化知识资源中心

Knowledge Resources Center for Global Change


Email:
Passwd
验证:
	换一张
Have you forgotten your password? Stay signed in Log In

globalchange > 气候变化与战略

DOI:	10.1016/j.atmosenv.2019.117130
论文题名:	A comparison of statistical and machine learning methods for creating national daily maps of ambient PM2.5 concentration
作者:	Berrocal V.J.; Guan Y.; Muyskens A.; Wang H.; Reich B.J.; Mulholland J.A.; Chang H.H.
刊名:	Atmospheric Environment
ISSN:	1352-2310
出版年:	2020
卷:	222
语种:	英语
英文关键词:	Air quality ; Correlation methods ; Decision trees ; Information use ; Interpolation ; Land use ; Learning algorithms ; Least squares approximations ; Machine learning ; Mean square error ; Regression analysis ; Statistics ; Inverse distance weighting ; Linear regression models ; Machine learning methods ; Machine learning techniques ; Mean absolute deviations ; Root mean squared errors ; Spatial and temporal scale ; Support vector regression (SVR) ; Inverse problems ; air quality ; ambient air ; atmospheric pollution ; comparative study ; concentration (composition) ; epidemiology ; GIS ; machine learning ; methodology ; particulate matter ; pollution exposure ; statistical analysis ; air pollution ; air quality ; article ; artificial neural network ; geographic information system ; gravity model ; kriging ; land use ; least square analysis ; linear regression analysis ; prediction ; random forest ; season ; support vector machine
中文摘要:	A typical challenge in air pollution epidemiology is to perform detailed exposure assessment for individuals for which health data are available. To address this problem, in the last few years, substantial research efforts have been placed in developing statistical methods or machine learning techniques to generate estimates of air pollution at fine spatial and temporal scales (daily, usually) with complete coverage. However, it is not clear how much the predicted exposures yielded by the various methods differ, and which method generates more reliable estimates. In this paper, we aim to address this gap by evaluating a variety of exposure modeling approaches, comparing their predictive performance. Using PM2.5 in year 2011 over the continental U.S. as a case study, we generate national maps of ambient PM2.5 concentration using: (i) ordinary least squares and inverse distance weighting; (ii) kriging; (iii) statistical downscaling models, that is, spatial statistical models that use the information contained in air quality model outputs; (iv) land use regression, that is, linear regression modeling approaches that leverage the information in Geographical Information System (GIS) covariates; and (v) machine learning methods, such as neural networks, random forests and support vector regression. We examine the various methods’ predictive performance via cross-validation using Root Mean Squared Error, Mean Absolute Deviation, Pearson correlation, and Mean Spatial Pearson Correlation. Additionally, we evaluated whether factors such as, season, urbanicity, and levels of PM2.5 concentration (low, medium or high) affected the performance of the different methods. Overall, statistical methods that explicitly modeled the spatial correlation, e.g. universal kriging and the downscaler model, outperform all the other exposure assessment approaches regardless of season, urbanicity and PM2.5 concentration level. We posit that the better predictive performance of spatial statistical models over machine learning methods is due to the fact that they explicitly account for spatial dependence, thus borrowing information from neighboring observations. In light of our findings, we suggest that future exposure assessment methods for regional PM2.5 incorporate information from neighboring sites when deriving predictions at unsampled locations or attempt to account for spatial dependence. © 2019 Elsevier Ltd
Citation statistics:
资源类型:	期刊论文
标识符:	http://119.78.100.158/handle/2HF3EXSE/160598
Appears in Collections:	气候变化与战略

Files in This Item:

There are no files associated with this item.

作者单位:

University of California - Irvine, Department of Statistics, Irvine, CA, United States; University of Nebraska, Department of Statistics, Lincoln, NE, United States; Lawrence Livermore National Laboratory, Livermore, CA, United States; SAS, CaryNC, United States; Georgia Institute of Technology, Atlanta, United States; Emory University, Department of Biostatistics and Bioinformatics, Atlanta, United States

Recommended Citation:

Berrocal V.J.,Guan Y.,Muyskens A.,et al. A comparison of statistical and machine learning methods for creating national daily maps of ambient PM2.5 concentration[J]. Atmospheric Environment,2020-01-01,222

Service
	Recommend this item
	Sava as my favorate item
	Show this item's statistics
	Export Endnote File
Google Scholar
	Similar articles in Google Scholar
	[Berrocal V.J.]'s Articles
	[Guan Y.]'s Articles
	[Muyskens A.]'s Articles
百度学术
	Similar articles in Baidu Scholar
	[Berrocal V.J.]'s Articles
	[Guan Y.]'s Articles
	[Muyskens A.]'s Articles
CSDL cross search
	Similar articles in CSDL Cross Search
	[Berrocal V.J.]‘s Articles
	[Guan Y.]‘s Articles
	[Muyskens A.]‘s Articles
Related Copyright Policies
Null
收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请填写以下表单，管理员会尽快联系您。
内容：
Email：	*
单位：
验证码：	刷新

您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

全球变化知识资源中心

用户服务

站点统计

相关链接

联系我们