Loading…
Data analysis and preprocessing techniques for air quality prediction: a survey
Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the...
Saved in:
Published in: | Stochastic environmental research and risk assessment 2024-06, Vol.38 (6), p.2095-2117 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793 |
container_end_page | 2117 |
container_issue | 6 |
container_start_page | 2095 |
container_title | Stochastic environmental research and risk assessment |
container_volume | 38 |
creator | Yu, Chengqing Tan, Jing Cheng, Yihan Mi, Xiwei |
description | Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research. |
doi_str_mv | 10.1007/s00477-024-02693-4 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3059654952</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3059654952</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIlMIPcLLEObB-x9xQeUqVeoGz5SR2cVWS1k6Q8ve4BMGNw2rnMDM7OwhdErgmAOomAXClCqA8j9Ss4EdoRjiTBaNCH_9iDqfoLKUNAFFK8Bla3dveYtva7ZhCyqDBu-h2satdSqFd497V723YDy5h30VsQ8T7wW5DPx6ITaj70LW32OI0xE83nqMTb7fJXfzsOXp7fHhdPBfL1dPL4m5Z1FRBX2jQQlNeQ-U9VyXJwUsPja6a0gsuvZVCSiVkJZkCrZytSuF4XRHnqK-UZnN0NfnmqIdwvdl0Q8xvJMNAaCm4FjSz6MSqY5dSdN7sYviwcTQEzKE4MxVn8n3zXZzhWcQmUcrkdu3in_U_qi_PKnDP</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3059654952</pqid></control><display><type>article</type><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><source>Springer Link</source><creator>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</creator><creatorcontrib>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</creatorcontrib><description>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</description><identifier>ISSN: 1436-3240</identifier><identifier>EISSN: 1436-3259</identifier><identifier>DOI: 10.1007/s00477-024-02693-4</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Air pollution ; Air quality ; Algorithms ; Aquatic Pollution ; Chemistry and Earth Sciences ; Cognition & reasoning ; Computational Intelligence ; Computer Science ; Data analysis ; Data processing ; Decomposition ; Earth and Environmental Science ; Earth Sciences ; Emergency communications systems ; Environment ; Environmental governance ; Environmental research ; Feature selection ; Forecasting ; Interpolation ; Math. Appl. in Environmental Science ; Missing data ; Nitrogen dioxide ; Nonlinear systems ; Outdoor air quality ; Outliers (statistics) ; Physics ; Pollutants ; Preprocessing ; Probability Theory and Stochastic Processes ; Review Paper ; Risk assessment ; Spatial data ; Statistics for Engineering ; Waste Water Technology ; Water Management ; Water Pollution Control</subject><ispartof>Stochastic environmental research and risk assessment, 2024-06, Vol.38 (6), p.2095-2117</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Yu, Chengqing</creatorcontrib><creatorcontrib>Tan, Jing</creatorcontrib><creatorcontrib>Cheng, Yihan</creatorcontrib><creatorcontrib>Mi, Xiwei</creatorcontrib><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><title>Stochastic environmental research and risk assessment</title><addtitle>Stoch Environ Res Risk Assess</addtitle><description>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</description><subject>Air pollution</subject><subject>Air quality</subject><subject>Algorithms</subject><subject>Aquatic Pollution</subject><subject>Chemistry and Earth Sciences</subject><subject>Cognition & reasoning</subject><subject>Computational Intelligence</subject><subject>Computer Science</subject><subject>Data analysis</subject><subject>Data processing</subject><subject>Decomposition</subject><subject>Earth and Environmental Science</subject><subject>Earth Sciences</subject><subject>Emergency communications systems</subject><subject>Environment</subject><subject>Environmental governance</subject><subject>Environmental research</subject><subject>Feature selection</subject><subject>Forecasting</subject><subject>Interpolation</subject><subject>Math. Appl. in Environmental Science</subject><subject>Missing data</subject><subject>Nitrogen dioxide</subject><subject>Nonlinear systems</subject><subject>Outdoor air quality</subject><subject>Outliers (statistics)</subject><subject>Physics</subject><subject>Pollutants</subject><subject>Preprocessing</subject><subject>Probability Theory and Stochastic Processes</subject><subject>Review Paper</subject><subject>Risk assessment</subject><subject>Spatial data</subject><subject>Statistics for Engineering</subject><subject>Waste Water Technology</subject><subject>Water Management</subject><subject>Water Pollution Control</subject><issn>1436-3240</issn><issn>1436-3259</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIlMIPcLLEObB-x9xQeUqVeoGz5SR2cVWS1k6Q8ve4BMGNw2rnMDM7OwhdErgmAOomAXClCqA8j9Ss4EdoRjiTBaNCH_9iDqfoLKUNAFFK8Bla3dveYtva7ZhCyqDBu-h2satdSqFd497V723YDy5h30VsQ8T7wW5DPx6ITaj70LW32OI0xE83nqMTb7fJXfzsOXp7fHhdPBfL1dPL4m5Z1FRBX2jQQlNeQ-U9VyXJwUsPja6a0gsuvZVCSiVkJZkCrZytSuF4XRHnqK-UZnN0NfnmqIdwvdl0Q8xvJMNAaCm4FjSz6MSqY5dSdN7sYviwcTQEzKE4MxVn8n3zXZzhWcQmUcrkdu3in_U_qi_PKnDP</recordid><startdate>20240601</startdate><enddate>20240601</enddate><creator>Yu, Chengqing</creator><creator>Tan, Jing</creator><creator>Cheng, Yihan</creator><creator>Mi, Xiwei</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>KR7</scope><scope>SOI</scope></search><sort><creationdate>20240601</creationdate><title>Data analysis and preprocessing techniques for air quality prediction: a survey</title><author>Yu, Chengqing ; Tan, Jing ; Cheng, Yihan ; Mi, Xiwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Air pollution</topic><topic>Air quality</topic><topic>Algorithms</topic><topic>Aquatic Pollution</topic><topic>Chemistry and Earth Sciences</topic><topic>Cognition & reasoning</topic><topic>Computational Intelligence</topic><topic>Computer Science</topic><topic>Data analysis</topic><topic>Data processing</topic><topic>Decomposition</topic><topic>Earth and Environmental Science</topic><topic>Earth Sciences</topic><topic>Emergency communications systems</topic><topic>Environment</topic><topic>Environmental governance</topic><topic>Environmental research</topic><topic>Feature selection</topic><topic>Forecasting</topic><topic>Interpolation</topic><topic>Math. Appl. in Environmental Science</topic><topic>Missing data</topic><topic>Nitrogen dioxide</topic><topic>Nonlinear systems</topic><topic>Outdoor air quality</topic><topic>Outliers (statistics)</topic><topic>Physics</topic><topic>Pollutants</topic><topic>Preprocessing</topic><topic>Probability Theory and Stochastic Processes</topic><topic>Review Paper</topic><topic>Risk assessment</topic><topic>Spatial data</topic><topic>Statistics for Engineering</topic><topic>Waste Water Technology</topic><topic>Water Management</topic><topic>Water Pollution Control</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Chengqing</creatorcontrib><creatorcontrib>Tan, Jing</creatorcontrib><creatorcontrib>Cheng, Yihan</creatorcontrib><creatorcontrib>Mi, Xiwei</creatorcontrib><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Stochastic environmental research and risk assessment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Chengqing</au><au>Tan, Jing</au><au>Cheng, Yihan</au><au>Mi, Xiwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data analysis and preprocessing techniques for air quality prediction: a survey</atitle><jtitle>Stochastic environmental research and risk assessment</jtitle><stitle>Stoch Environ Res Risk Assess</stitle><date>2024-06-01</date><risdate>2024</risdate><volume>38</volume><issue>6</issue><spage>2095</spage><epage>2117</epage><pages>2095-2117</pages><issn>1436-3240</issn><eissn>1436-3259</eissn><abstract>Air quality prediction technology can provide effective technical means for environmental governance. In recent years, due to the strong nonlinearity of data, there has been extensive research on data analysis and preprocessing techniques. This paper aims to comprehensively summarize and analyze the methods used in air quality forecasting, specifically focusing on four categories: data decomposition, dimensionality reduction, data correction, and spatial interpolation. Each method's purpose, characteristics, improvements, and implementation details are described in detail. The evaluation of data preprocessing methods is based on popularity, accuracy improvements, time consumption, maturity, and implementation difficulty. Among the existing methods, data decomposition and feature selection are commonly used and well-developed. However, outlier detection and spatial interpolation have limited applications and require further research. Furthermore, this paper discusses current challenges in applying these methods and future development trends, providing a valuable reference for future research.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00477-024-02693-4</doi><tpages>23</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1436-3240 |
ispartof | Stochastic environmental research and risk assessment, 2024-06, Vol.38 (6), p.2095-2117 |
issn | 1436-3240 1436-3259 |
language | eng |
recordid | cdi_proquest_journals_3059654952 |
source | Springer Link |
subjects | Air pollution Air quality Algorithms Aquatic Pollution Chemistry and Earth Sciences Cognition & reasoning Computational Intelligence Computer Science Data analysis Data processing Decomposition Earth and Environmental Science Earth Sciences Emergency communications systems Environment Environmental governance Environmental research Feature selection Forecasting Interpolation Math. Appl. in Environmental Science Missing data Nitrogen dioxide Nonlinear systems Outdoor air quality Outliers (statistics) Physics Pollutants Preprocessing Probability Theory and Stochastic Processes Review Paper Risk assessment Spatial data Statistics for Engineering Waste Water Technology Water Management Water Pollution Control |
title | Data analysis and preprocessing techniques for air quality prediction: a survey |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T00%3A23%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data%20analysis%20and%20preprocessing%20techniques%20for%20air%20quality%20prediction:%20a%20survey&rft.jtitle=Stochastic%20environmental%20research%20and%20risk%20assessment&rft.au=Yu,%20Chengqing&rft.date=2024-06-01&rft.volume=38&rft.issue=6&rft.spage=2095&rft.epage=2117&rft.pages=2095-2117&rft.issn=1436-3240&rft.eissn=1436-3259&rft_id=info:doi/10.1007/s00477-024-02693-4&rft_dat=%3Cproquest_cross%3E3059654952%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-9095924c0bff47810248f0d9bd8f546fa6566756b637097eab85e4cb1ee2fb793%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3059654952&rft_id=info:pmid/&rfr_iscdi=true |