Loading…

A random forest partition model for predicting NO2 concentrations from traffic flow and meteorological conditions

High concentrations of nitrogen dioxide in the air, particularly in heavily urbanised areas, have an adverse effect on many aspects of residents' health (short-term and long-term damage, unpleasant odour and other). A method is proposed for modelling atmospheric NO2 concentrations in a conurbat...

Full description

Saved in:
Bibliographic Details
Published in:The Science of the total environment 2019-02, Vol.651, p.475-483
Main Author: Kamińska, Joanna A.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:High concentrations of nitrogen dioxide in the air, particularly in heavily urbanised areas, have an adverse effect on many aspects of residents' health (short-term and long-term damage, unpleasant odour and other). A method is proposed for modelling atmospheric NO2 concentrations in a conurbation, using a partition model M consisting of two separate models: ML for lower concentration values and MU for upper values. An advanced data mining technique, that of random forests, is used. This is a method based on machine learning, involving the simultaneous compilation of information from multiple random trees. Using the example of data recorded in Wrocław (Poland) in 2015–2017, an iterative method was applied to determine the boundary concentration y˜ for which the mean absolute deviation error for the partition model attained its lowest value. The resulting model had an R2 value of 0.82, compared with 0.60 for a classical random forest model. The importances of the variables in the model ML, similarly as in the classical case, indicate that the greatest influence on NO2 concentrations comes from traffic flow, followed by meteorological factors, in particular the wind direction and speed. In the model MU the importances of the variables are significantly different: while traffic flow still has the greatest impact, the effects of temperature and relative humidity are almost as great. This confirms the justifiability of constructing separate models for low and high pollution concentrations. [Display omitted] •NO2 pollution is caused mainly by road transport and modified by weather factors.•A random forest describes well the dependence of NO2 on the explanatory factors.•A new RF-based partition model improves the description of NO2 concentrations.•The value of R2 is increased from 0.60 to 0.82.•Traffic flow has a greater impact on NO2 in low concentration ranges.
ISSN:0048-9697
1879-1026
DOI:10.1016/j.scitotenv.2018.09.196