PM2.5 Concentration Prediction for Air Pollution using Machine Learning Algorithms

Document Type : Original Article

Authors

1 Computer Science and Engineering Department, Faculty of Electronic Engineering in Menouf, Menoufia University

2 Computer Science and Engineering Department Faculty of Electronic Engineering Menouf, Egypt

Abstract

Air pollution is a phenomenon harmful to both human being existence as well as the ecological system. It is caused by the excess of some substances above a particular concentration in the atmosphere. Atmospheric particulate matter (APM) – or PM for short – threatens life because of its tiny size – diameter is up to 10 micrometers. Their danger comes from their ability to penetrate deeper inside the human respiratory system. (PM2.5) particulates are less than 2.5 micrometers and are more hazardous when compared to (PM10) coarse particles–10 micrometers in size. Hence, environmental agencies and governments seek to explore new methods to predict future air pollution. These endeavors mainly focus on mitigating environmental pollution and predicting pollutants concentrations to take enough precautions for citizens protection.  This paper presents various machine learning algorithms that predict PM2.5 concentration for the next hour. These algorithms are Support Vector Regression (SVR), Long Short-Term Memory (LSTM), Random Forest, and Extra Trees. Their performance is measured in Root Mean Square Error (RMSE), coefficient of determination R2, and duration in seconds. Extra Trees shows the least RMSE and the highest coefficient of determination R2.

Keywords


[1]          A. C. Kemp, B. P. Horton, J. P. Donnelly, M. E. Mann, M. Vermeer, and S. Rahmstorf, “Climate related sea-level variations over the past two millennia,” Proceedings of the National Academy of Sciences, vol. 108, no. 27, pp. 11017–11022, Jul. 2011.
[2]          A. Kumar and P. Goyal, “Forecasting of daily air quality index in Delhi,” Science of the Total Environment, vol. 409, no. 24, pp. 5517–5523, 2011.
[3]          J. Wang, H. Jiang, Q. Zhou, J. Wu, and S. Qin, “China’s natural gas production and consumption analysis based on the multicycle Hubbert model and rolling Grey model,” Renewable and Sustainable Energy Reviews, vol. 53, pp. 1149–1167, Jan. 2016.
[4]          Y. Song, S. Qin, J. Qu, and F. Liu, “The forecasting research of early warning systems for atmospheric pollutants: A case in Yangtze River Delta region,” Atmospheric Environment, vol. 118, pp. 58–69, Oct. 2015.
[5]          T. Chahine et al., “Particulate Air Pollution, Oxidative Stress Genes, and Heart Rate Variability in an Elderly Cohort,” Environmental Health Perspectives, vol. 115, no. 11, pp. 1617–1622, Nov. 2007.
[6]          J. Lelieveld, J. S. Evans, M. Fnais, D. Giannadaki, and A. Pozzer, “The contribution of outdoor air pollution sources to premature mortality on a global scale,” Nature, vol. 525, no. 7569, pp. 367–371, Sep. 2015.
[7]          Wolrd Health Organization (WHO), “Ambient air pollution: A global assessment of exposure and burden of disease,” 2016. [Online]. Available: https://apps.who.int/iris/bitstream/handle/10665/250141/9789241511353-eng.pdf. [Accessed: 26-Jul-2019].
[8]          P. Eastwood and T. Gupta, Air Pollution and Control. Singapore: Springer Singapore, 2018.
[9]          “Ministry of Environment - EEAA > Topics > Air > Air Quality > Air Quality Forecast.” [Online]. Available: http://www.eeaa.gov.eg/en-us/topics/air/airquality/airqualityforecast.aspx. [Accessed: 17-Feb-2019].
[10]        C. Li, N. C. Hsu, and S.-C. Tsay, “A study on the potential applications of satellite data in air quality monitoring and forecasting,” Atmospheric Environment, vol. 45, no. 22, pp. 3663–3675, Jul. 2011.
[11]        B. C. Liu, A. Binaykia, P. C. Chang, M. K. Tiwari, and C. C. Tsao, “Urban air quality forecasting based on multidimensional collaborative Support Vector Regression (SVR): A case study of Beijing-Tianjin-Shijiazhuang,” PLoS ONE, vol. 12, no. 7, pp. 1–17, 2017.
[12]        E. Kalapanidas and N. Avouris, “Short-term air quality prediction using a case-based classifier,” Environmental Modelling and Software, vol. 16, no. 3, pp. 263–272, 2001.
[13]        W. Z. Lu and W. J. Wang, “Potential assessment of the ‘support vector machine’ method in forecasting ambient air pollutant trends,” Chemosphere, vol. 59, no. 5, pp. 693–701, 2005.
[14]        S. Deleawe, J. Kusznir, B. Lamb, and D. J. Cook, “Predicting air quality in smart environments,” Journal of Ambient Intelligence and Smart Environments, vol. 2, no. 2, pp. 145–154, 2010.
[15]        C. M. Vong, W. F. Ip, P. K. Wong, and C. C. Chiu, “Predicting minority class for suspended particulate matters level by extreme learning machine,” Neurocomputing, vol. 128, pp. 136–144, 2014.
[16]        X. Y. Ni, H. Huang, and W. P. Du, “Relevance analysis and short-term prediction of PM 2.5 concentrations in Beijing based on multi-source data,” Atmospheric Environment, vol. 150, no. February 2017, pp. 146–161, 2017.
[17]        X. Hu et al., “Estimating PM 2.5 Concentrations in the Conterminous United States Using the Random Forest Approach,” Environmental Science & Technology, vol. 51, no. 12, pp. 6936–6944, Jun. 2017.
[18]        K.-R. Müller, A. J. Smola, G. Rätsch, B. Schölkopf, J. Kohlmorgen, and V. Vapnik, “Predicting time series with support vector machines,” in Proceeding of International Conference on Artificial Neural Networks, 2005, pp. 999–1004.
[19]        S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, Nov. 1997.
[20]        A. Azzouni and G. Pujolle, “A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction,” Networking and Internet Architecture, pp. 1–6, 2017.
[21]        X. Li, L. Peng, Y. Hu, J. Shao, and T. Chi, “Deep learning architecture for air quality predictions,” Environmental Science and Pollution Research, vol. 23, no. 22, pp. 22408–22417, Nov. 2016.
[22]        C. Olah, “Understanding LSTM Networks,” 2015. [Online]. Available: http://colah.github.io/posts/2015-08-Understanding-LSTMs/. [Accessed: 01-Jul-2019].
[23]        X. Li et al., “Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation,” Environmental Pollution, vol. 231, pp. 997–1004, Dec. 2017.
[24]        L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp. 5–32, 2001.
[25]        A. Liaw, Wiener, and Mathe, “Classification and regression by randomForest,” Journal of Dental Research, vol. 83, no. 5, pp. 434–438, 2002.
[26]        V. John, Z. Liu, C. Guo, S. Mita, Kidono, and Kiyosumi, “Real-Time Lane Estimation Using Deep Features and Extra Trees Regression,” Springer International Publishing, vol. 9431, pp. 721–733, 2016.
[27]        M. W. Ahmad, J. Reynolds, and Y. Rezgui, “Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees,” Journal of Cleaner Production, vol. 203, pp. 810–821, 2018.
[28]        Y. Rybarczyk and R. Zalakeviciute, “Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review,” Applied Sciences, vol. 8, no. 12, p. 2570, Dec. 2018.
[29]        X. Liang et al., “Assessing Beijing’s PM 2.5 pollution: severity, weather impact, APEC and winter heating,” Proceedings of the Royal Society A: Mathematical, Physical and Engineering Science, vol. 471, no. 2182, p. 20150257, Oct. 2015.
[30]        J. Brownlee, “How to Convert a Time Series to a Supervised Learning Problem in Python,” 2017. [Online]. Available: https://machinelearningmastery.com/convert-time-series-supervised-learning-problem-python/. [Accessed: 14-Jul-2019].
[31]        V. Reddy, P. Yedavalli, S. Mohanty, and U. Nakhat, “Deep Air: Forecasting Air Pollution in Beijing, China,” 2017. [Online]. Available: https://www.ischool.berkeley.edu/sites/default/files/sproject_attachments/deep-air-forecasting_final.pdf.
[32]        Z. Wen, J. Shi, Q. Li, B. He, and J. Chen, “{ThunderSVM}: A Fast {SVM} Library on {GPUs} and {CPUs},” Journal of Machine Learning Research, vol. 19, pp. 1–5, 2018.
Volume 28, ICEEM2019-Special Issue
ICEEM2019-Special Issue: 1st International Conference on Electronic Eng., Faculty of Electronic Eng., Menouf, Egypt, 7-8 Dec.
2019
Pages 349-354