Loading…

Data analysis and preprocessing techniques for air quality prediction: a survey

Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the...

Full description

Saved in:
Bibliographic Details
Published in:Stochastic environmental research and risk assessment 2024-06, Vol.38 (6), p.2095-2117
Main Authors: Yu, Chengqing, Tan, Jing, Cheng, Yihan, Mi, Xiwei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793
container_end_page 2117
container_issue 6
container_start_page 2095
container_title Stochastic environmental research and risk assessment
container_volume 38
creator Yu, Chengqing
Tan, Jing
Cheng, Yihan
Mi, Xiwei
description Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.
doi_str_mv 10.1007/s00477-024-02693-4
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3059654952</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3059654952</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIlMIPcLLEObB-x9xQeUqVeoGz5SR2cVWS1k6Q8ve4BMGNw2rnMDM7OwhdErgmAOomAXClCqA8j9Ss4EdoRjiTBaNCH_9iDqfoLKUNAFFK8Bla3dveYtva7ZhCyqDBu-h2satdSqFd497V723YDy5h30VsQ8T7wW5DPx6ITaj70LW32OI0xE83nqMTb7fJXfzsOXp7fHhdPBfL1dPL4m5Z1FRBX2jQQlNeQ-U9VyXJwUsPja6a0gsuvZVCSiVkJZkCrZytSuF4XRHnqK-UZnN0NfnmqIdwvdl0Q8xvJMNAaCm4FjSz6MSqY5dSdN7sYviwcTQEzKE4MxVn8n3zXZzhWcQmUcrkdu3in_U_qi_PKnDP</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3059654952</pqid></control><display><type>article</type><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><source>Springer Link</source><creator>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</creator><creatorcontrib>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</creatorcontrib><description>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</description><identifier>ISSN: 1436-3240</identifier><identifier>EISSN: 1436-3259</identifier><identifier>DOI: 10.1007/s00477-024-02693-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Air pollution ; Air quality ; Algorithms ; Aquatic Pollution ; Chemistry and Earth Sciences ; Cognition &amp; reasoning ; Computational Intelligence ; Computer Science ; Data analysis ; Data processing ; Decomposition ; Earth and Environmental Science ; Earth Sciences ; Emergency communications systems ; Environment ; Environmental governance ; Environmental research ; Feature selection ; Forecasting ; Interpolation ; Math. Appl. in Environmental Science ; Missing data ; Nitrogen dioxide ; Nonlinear systems ; Outdoor air quality ; Outliers (statistics) ; Physics ; Pollutants ; Preprocessing ; Probability Theory and Stochastic Processes ; Review Paper ; Risk assessment ; Spatial data ; Statistics for Engineering ; Waste Water Technology ; Water Management ; Water Pollution Control</subject><ispartof>Stochastic environmental research and risk assessment, 2024-06, Vol.38 (6), p.2095-2117</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Yu, Chengqing</creatorcontrib><creatorcontrib>Tan, Jing</creatorcontrib><creatorcontrib>Cheng, Yihan</creatorcontrib><creatorcontrib>Mi, Xiwei</creatorcontrib><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><title>Stochastic environmental research and risk assessment</title><addtitle>Stoch Environ Res Risk Assess</addtitle><description>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</description><subject>Air pollution</subject><subject>Air quality</subject><subject>Algorithms</subject><subject>Aquatic Pollution</subject><subject>Chemistry and Earth Sciences</subject><subject>Cognition &amp; reasoning</subject><subject>Computational Intelligence</subject><subject>Computer Science</subject><subject>Data analysis</subject><subject>Data processing</subject><subject>Decomposition</subject><subject>Earth and Environmental Science</subject><subject>Earth Sciences</subject><subject>Emergency communications systems</subject><subject>Environment</subject><subject>Environmental governance</subject><subject>Environmental research</subject><subject>Feature selection</subject><subject>Forecasting</subject><subject>Interpolation</subject><subject>Math. Appl. in Environmental Science</subject><subject>Missing data</subject><subject>Nitrogen dioxide</subject><subject>Nonlinear systems</subject><subject>Outdoor air quality</subject><subject>Outliers (statistics)</subject><subject>Physics</subject><subject>Pollutants</subject><subject>Preprocessing</subject><subject>Probability Theory and Stochastic Processes</subject><subject>Review Paper</subject><subject>Risk assessment</subject><subject>Spatial data</subject><subject>Statistics for Engineering</subject><subject>Waste Water Technology</subject><subject>Water Management</subject><subject>Water Pollution Control</subject><issn>1436-3240</issn><issn>1436-3259</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIlMIPcLLEObB-x9xQeUqVeoGz5SR2cVWS1k6Q8ve4BMGNw2rnMDM7OwhdErgmAOomAXClCqA8j9Ss4EdoRjiTBaNCH_9iDqfoLKUNAFFK8Bla3dveYtva7ZhCyqDBu-h2satdSqFd497V723YDy5h30VsQ8T7wW5DPx6ITaj70LW32OI0xE83nqMTb7fJXfzsOXp7fHhdPBfL1dPL4m5Z1FRBX2jQQlNeQ-U9VyXJwUsPja6a0gsuvZVCSiVkJZkCrZytSuF4XRHnqK-UZnN0NfnmqIdwvdl0Q8xvJMNAaCm4FjSz6MSqY5dSdN7sYviwcTQEzKE4MxVn8n3zXZzhWcQmUcrkdu3in_U_qi_PKnDP</recordid><startdate>20240601</startdate><enddate>20240601</enddate><creator>Yu, Chengqing</creator><creator>Tan, Jing</creator><creator>Cheng, Yihan</creator><creator>Mi, Xiwei</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>KR7</scope><scope>SOI</scope></search><sort><creationdate>20240601</creationdate><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><author>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Air pollution</topic><topic>Air quality</topic><topic>Algorithms</topic><topic>Aquatic Pollution</topic><topic>Chemistry and Earth Sciences</topic><topic>Cognition &amp; reasoning</topic><topic>Computational Intelligence</topic><topic>Computer Science</topic><topic>Data analysis</topic><topic>Data processing</topic><topic>Decomposition</topic><topic>Earth and Environmental Science</topic><topic>Earth Sciences</topic><topic>Emergency communications systems</topic><topic>Environment</topic><topic>Environmental governance</topic><topic>Environmental research</topic><topic>Feature selection</topic><topic>Forecasting</topic><topic>Interpolation</topic><topic>Math. Appl. in Environmental Science</topic><topic>Missing data</topic><topic>Nitrogen dioxide</topic><topic>Nonlinear systems</topic><topic>Outdoor air quality</topic><topic>Outliers (statistics)</topic><topic>Physics</topic><topic>Pollutants</topic><topic>Preprocessing</topic><topic>Probability Theory and Stochastic Processes</topic><topic>Review Paper</topic><topic>Risk assessment</topic><topic>Spatial data</topic><topic>Statistics for Engineering</topic><topic>Waste Water Technology</topic><topic>Water Management</topic><topic>Water Pollution Control</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Chengqing</creatorcontrib><creatorcontrib>Tan, Jing</creatorcontrib><creatorcontrib>Cheng, Yihan</creatorcontrib><creatorcontrib>Mi, Xiwei</creatorcontrib><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Stochastic environmental research and risk assessment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Chengqing</au><au>Tan, Jing</au><au>Cheng, Yihan</au><au>Mi, Xiwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data analysis and preprocessing techniques for air quality prediction: a survey</atitle><jtitle>Stochastic environmental research and risk assessment</jtitle><stitle>Stoch Environ Res Risk Assess</stitle><date>2024-06-01</date><risdate>2024</risdate><volume>38</volume><issue>6</issue><spage>2095</spage><epage>2117</epage><pages>2095-2117</pages><issn>1436-3240</issn><eissn>1436-3259</eissn><abstract>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00477-024-02693-4</doi><tpages>23</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1436-3240
ispartof Stochastic environmental research and risk assessment, 2024-06, Vol.38 (6), p.2095-2117
issn 1436-3240
1436-3259
language eng
recordid cdi_proquest_journals_3059654952
source Springer Link
subjects Air pollution
Air quality
Algorithms
Aquatic Pollution
Chemistry and Earth Sciences
Cognition & reasoning
Computational Intelligence
Computer Science
Data analysis
Data processing
Decomposition
Earth and Environmental Science
Earth Sciences
Emergency communications systems
Environment
Environmental governance
Environmental research
Feature selection
Forecasting
Interpolation
Math. Appl. in Environmental Science
Missing data
Nitrogen dioxide
Nonlinear systems
Outdoor air quality
Outliers (statistics)
Physics
Pollutants
Preprocessing
Probability Theory and Stochastic Processes
Review Paper
Risk assessment
Spatial data
Statistics for Engineering
Waste Water Technology
Water Management
Water Pollution Control
title Data analysis and preprocessing techniques for air quality prediction: a survey
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T00%3A23%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data%20analysis%20and%20preprocessing%20techniques%20for%20air%20quality%20prediction:%20a%20survey&rft.jtitle=Stochastic%20environmental%20research%20and%20risk%20assessment&rft.au=Yu,%20Chengqing&rft.date=2024-06-01&rft.volume=38&rft.issue=6&rft.spage=2095&rft.epage=2117&rft.pages=2095-2117&rft.issn=1436-3240&rft.eissn=1436-3259&rft_id=info:doi/10.1007/s00477-024-02693-4&rft_dat=%3Cproquest_cross%3E3059654952%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3059654952&rft_id=info:pmid/&rfr_iscdi=true