Loading…

Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation

Predicting the duration of traffic incidents is a challenging task due to the stochastic nature of events. The ability to accurately predict how long accidents will last can provide significant benefits to both end-users in their route choice and traffic operation managers in handling of non-recurre...

Full description

Saved in:
Bibliographic Details
Published in:Transportation research. Part C, Emerging technologies Emerging technologies, 2022-08, Vol.141, p.103721, Article 103721
Main Authors: Grigorev, Artur, Mihaita, Adriana-Simona, Lee, Seunghyeon, Chen, Fang
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3
cites cdi_FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3
container_end_page
container_issue
container_start_page 103721
container_title Transportation research. Part C, Emerging technologies
container_volume 141
creator Grigorev, Artur
Mihaita, Adriana-Simona
Lee, Seunghyeon
Chen, Fang
description Predicting the duration of traffic incidents is a challenging task due to the stochastic nature of events. The ability to accurately predict how long accidents will last can provide significant benefits to both end-users in their route choice and traffic operation managers in handling of non-recurrent traffic congestion. This paper presents a novel bi-level machine learning framework enhanced with outlier removal and intra–extra joint optimisation for predicting the incident duration on three heterogeneous data sets collected for both arterial roads and motorways from Sydney, Australia and San-Francisco, U.S.A. Firstly, we use incident data logs to develop a binary classification prediction approach, which allows us to classify traffic incidents as short-term or long-term. We find the optimal threshold between short-term versus long-term traffic incident duration, targeting both class balance and prediction performance while also comparing the binary versus multi-class classification approaches using quantiled duration groups and varying threshold split. Secondly, for more granularity of the incident duration prediction to the minute level, we propose a new intra–extra Joint Optimisation algorithm (IEO-ML) which extends multiple baseline ML models tested against several regression scenarios across the data sets. Final results indicate that: (a) 40–45 min is the best split threshold for identifying short versus long-term incidents and that these incidents should be modelled separately, (b) our proposed IEO-ML approach significantly outperforms baseline ML models in 66% of all cases showcasing its great potential for accurate incident duration prediction. Lastly, we evaluate the feature importance and show that time, location, incident type, incident reporting source and weather at among the top 10 critical factors which influence how long incidents will last. •We propose a novel bi-level framework for predicting the incident durations.•We predict incident duration on three data sets with different road networks.•Short-term and long-term traffic accidents should be modelled separately.•Different incident duration extrapolation scenarios analysed.•Our proposed IEO-ML approach outperformed baseline ML models in 66% of cases.
doi_str_mv 10.1016/j.trc.2022.103721
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_trc_2022_103721</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0968090X22001589</els_id><sourcerecordid>S0968090X22001589</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3</originalsourceid><addsrcrecordid>eNp9kE1OwzAQhS0EEqVwAHa-QIrtOEktVqjip1IlNiCxsxx7Qh0Su7LdFnaIK3BDTkLasmY18zR6b2Y-hC4pmVBCy6t2koKeMMLYoPOK0SM0otNKZCwvxDEaEVFOMyLIyyk6i7ElhFBRVCP0NXfaGnAJm3VQyXqHVwGM1ft2Ha17xQrXNutgAx3ulV5aB7gDFdxu1gTVw9aHN7y1aYn9OnUWAg7Q-43qsHIGW5eC-vn8hveh4tYPGvtVsr2N-4Xn6KRRXYSLvzpGz3e3T7OHbPF4P5_dLDLNRJUyVot6WuuGcM4IVzQXeanrihYlVVyThjMBuVF1UzLONWgwrCKGFDkHJkzZ5GNED7k6-BgDNHIVbK_Ch6RE7iDKVg4Q5Q6iPEAcPNcHDwyHbYbPZNQW3BBuA-gkjbf_uH8BPEB_Dw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation</title><source>Elsevier</source><creator>Grigorev, Artur ; Mihaita, Adriana-Simona ; Lee, Seunghyeon ; Chen, Fang</creator><creatorcontrib>Grigorev, Artur ; Mihaita, Adriana-Simona ; Lee, Seunghyeon ; Chen, Fang</creatorcontrib><description>Predicting the duration of traffic incidents is a challenging task due to the stochastic nature of events. The ability to accurately predict how long accidents will last can provide significant benefits to both end-users in their route choice and traffic operation managers in handling of non-recurrent traffic congestion. This paper presents a novel bi-level machine learning framework enhanced with outlier removal and intra–extra joint optimisation for predicting the incident duration on three heterogeneous data sets collected for both arterial roads and motorways from Sydney, Australia and San-Francisco, U.S.A. Firstly, we use incident data logs to develop a binary classification prediction approach, which allows us to classify traffic incidents as short-term or long-term. We find the optimal threshold between short-term versus long-term traffic incident duration, targeting both class balance and prediction performance while also comparing the binary versus multi-class classification approaches using quantiled duration groups and varying threshold split. Secondly, for more granularity of the incident duration prediction to the minute level, we propose a new intra–extra Joint Optimisation algorithm (IEO-ML) which extends multiple baseline ML models tested against several regression scenarios across the data sets. Final results indicate that: (a) 40–45 min is the best split threshold for identifying short versus long-term incidents and that these incidents should be modelled separately, (b) our proposed IEO-ML approach significantly outperforms baseline ML models in 66% of all cases showcasing its great potential for accurate incident duration prediction. Lastly, we evaluate the feature importance and show that time, location, incident type, incident reporting source and weather at among the top 10 critical factors which influence how long incidents will last. •We propose a novel bi-level framework for predicting the incident durations.•We predict incident duration on three data sets with different road networks.•Short-term and long-term traffic accidents should be modelled separately.•Different incident duration extrapolation scenarios analysed.•Our proposed IEO-ML approach outperformed baseline ML models in 66% of cases.</description><identifier>ISSN: 0968-090X</identifier><identifier>EISSN: 1879-2359</identifier><identifier>DOI: 10.1016/j.trc.2022.103721</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Arterial road versus motorways incident management ; Classification ; Extreme-boosted decision-trees ; Incident duration prediction ; Intra–extra joint optimisation ; Light gradient boosting modelling ; Machine learning ; Regression</subject><ispartof>Transportation research. Part C, Emerging technologies, 2022-08, Vol.141, p.103721, Article 103721</ispartof><rights>2022 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3</citedby><cites>FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3</cites><orcidid>0000-0001-6875-3568</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Grigorev, Artur</creatorcontrib><creatorcontrib>Mihaita, Adriana-Simona</creatorcontrib><creatorcontrib>Lee, Seunghyeon</creatorcontrib><creatorcontrib>Chen, Fang</creatorcontrib><title>Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation</title><title>Transportation research. Part C, Emerging technologies</title><description>Predicting the duration of traffic incidents is a challenging task due to the stochastic nature of events. The ability to accurately predict how long accidents will last can provide significant benefits to both end-users in their route choice and traffic operation managers in handling of non-recurrent traffic congestion. This paper presents a novel bi-level machine learning framework enhanced with outlier removal and intra–extra joint optimisation for predicting the incident duration on three heterogeneous data sets collected for both arterial roads and motorways from Sydney, Australia and San-Francisco, U.S.A. Firstly, we use incident data logs to develop a binary classification prediction approach, which allows us to classify traffic incidents as short-term or long-term. We find the optimal threshold between short-term versus long-term traffic incident duration, targeting both class balance and prediction performance while also comparing the binary versus multi-class classification approaches using quantiled duration groups and varying threshold split. Secondly, for more granularity of the incident duration prediction to the minute level, we propose a new intra–extra Joint Optimisation algorithm (IEO-ML) which extends multiple baseline ML models tested against several regression scenarios across the data sets. Final results indicate that: (a) 40–45 min is the best split threshold for identifying short versus long-term incidents and that these incidents should be modelled separately, (b) our proposed IEO-ML approach significantly outperforms baseline ML models in 66% of all cases showcasing its great potential for accurate incident duration prediction. Lastly, we evaluate the feature importance and show that time, location, incident type, incident reporting source and weather at among the top 10 critical factors which influence how long incidents will last. •We propose a novel bi-level framework for predicting the incident durations.•We predict incident duration on three data sets with different road networks.•Short-term and long-term traffic accidents should be modelled separately.•Different incident duration extrapolation scenarios analysed.•Our proposed IEO-ML approach outperformed baseline ML models in 66% of cases.</description><subject>Arterial road versus motorways incident management</subject><subject>Classification</subject><subject>Extreme-boosted decision-trees</subject><subject>Incident duration prediction</subject><subject>Intra–extra joint optimisation</subject><subject>Light gradient boosting modelling</subject><subject>Machine learning</subject><subject>Regression</subject><issn>0968-090X</issn><issn>1879-2359</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1OwzAQhS0EEqVwAHa-QIrtOEktVqjip1IlNiCxsxx7Qh0Su7LdFnaIK3BDTkLasmY18zR6b2Y-hC4pmVBCy6t2koKeMMLYoPOK0SM0otNKZCwvxDEaEVFOMyLIyyk6i7ElhFBRVCP0NXfaGnAJm3VQyXqHVwGM1ft2Ha17xQrXNutgAx3ulV5aB7gDFdxu1gTVw9aHN7y1aYn9OnUWAg7Q-43qsHIGW5eC-vn8hveh4tYPGvtVsr2N-4Xn6KRRXYSLvzpGz3e3T7OHbPF4P5_dLDLNRJUyVot6WuuGcM4IVzQXeanrihYlVVyThjMBuVF1UzLONWgwrCKGFDkHJkzZ5GNED7k6-BgDNHIVbK_Ch6RE7iDKVg4Q5Q6iPEAcPNcHDwyHbYbPZNQW3BBuA-gkjbf_uH8BPEB_Dw</recordid><startdate>202208</startdate><enddate>202208</enddate><creator>Grigorev, Artur</creator><creator>Mihaita, Adriana-Simona</creator><creator>Lee, Seunghyeon</creator><creator>Chen, Fang</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-6875-3568</orcidid></search><sort><creationdate>202208</creationdate><title>Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation</title><author>Grigorev, Artur ; Mihaita, Adriana-Simona ; Lee, Seunghyeon ; Chen, Fang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Arterial road versus motorways incident management</topic><topic>Classification</topic><topic>Extreme-boosted decision-trees</topic><topic>Incident duration prediction</topic><topic>Intra–extra joint optimisation</topic><topic>Light gradient boosting modelling</topic><topic>Machine learning</topic><topic>Regression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Grigorev, Artur</creatorcontrib><creatorcontrib>Mihaita, Adriana-Simona</creatorcontrib><creatorcontrib>Lee, Seunghyeon</creatorcontrib><creatorcontrib>Chen, Fang</creatorcontrib><collection>CrossRef</collection><jtitle>Transportation research. Part C, Emerging technologies</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Grigorev, Artur</au><au>Mihaita, Adriana-Simona</au><au>Lee, Seunghyeon</au><au>Chen, Fang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation</atitle><jtitle>Transportation research. Part C, Emerging technologies</jtitle><date>2022-08</date><risdate>2022</risdate><volume>141</volume><spage>103721</spage><pages>103721-</pages><artnum>103721</artnum><issn>0968-090X</issn><eissn>1879-2359</eissn><abstract>Predicting the duration of traffic incidents is a challenging task due to the stochastic nature of events. The ability to accurately predict how long accidents will last can provide significant benefits to both end-users in their route choice and traffic operation managers in handling of non-recurrent traffic congestion. This paper presents a novel bi-level machine learning framework enhanced with outlier removal and intra–extra joint optimisation for predicting the incident duration on three heterogeneous data sets collected for both arterial roads and motorways from Sydney, Australia and San-Francisco, U.S.A. Firstly, we use incident data logs to develop a binary classification prediction approach, which allows us to classify traffic incidents as short-term or long-term. We find the optimal threshold between short-term versus long-term traffic incident duration, targeting both class balance and prediction performance while also comparing the binary versus multi-class classification approaches using quantiled duration groups and varying threshold split. Secondly, for more granularity of the incident duration prediction to the minute level, we propose a new intra–extra Joint Optimisation algorithm (IEO-ML) which extends multiple baseline ML models tested against several regression scenarios across the data sets. Final results indicate that: (a) 40–45 min is the best split threshold for identifying short versus long-term incidents and that these incidents should be modelled separately, (b) our proposed IEO-ML approach significantly outperforms baseline ML models in 66% of all cases showcasing its great potential for accurate incident duration prediction. Lastly, we evaluate the feature importance and show that time, location, incident type, incident reporting source and weather at among the top 10 critical factors which influence how long incidents will last. •We propose a novel bi-level framework for predicting the incident durations.•We predict incident duration on three data sets with different road networks.•Short-term and long-term traffic accidents should be modelled separately.•Different incident duration extrapolation scenarios analysed.•Our proposed IEO-ML approach outperformed baseline ML models in 66% of cases.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.trc.2022.103721</doi><orcidid>https://orcid.org/0000-0001-6875-3568</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0968-090X
ispartof Transportation research. Part C, Emerging technologies, 2022-08, Vol.141, p.103721, Article 103721
issn 0968-090X
1879-2359
language eng
recordid cdi_crossref_primary_10_1016_j_trc_2022_103721
source Elsevier
subjects Arterial road versus motorways incident management
Classification
Extreme-boosted decision-trees
Incident duration prediction
Intra–extra joint optimisation
Light gradient boosting modelling
Machine learning
Regression
title Incident duration prediction using a bi-level machine learning framework with outlier removal and intra–extra joint optimisation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T08%3A03%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Incident%20duration%20prediction%20using%20a%20bi-level%20machine%20learning%20framework%20with%20outlier%20removal%20and%20intra%E2%80%93extra%20joint%20optimisation&rft.jtitle=Transportation%20research.%20Part%20C,%20Emerging%20technologies&rft.au=Grigorev,%20Artur&rft.date=2022-08&rft.volume=141&rft.spage=103721&rft.pages=103721-&rft.artnum=103721&rft.issn=0968-090X&rft.eissn=1879-2359&rft_id=info:doi/10.1016/j.trc.2022.103721&rft_dat=%3Celsevier_cross%3ES0968090X22001589%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c297t-2b9b8bcf044204a13936cb71561a4c0f429e3dabf6244ceced270d0534e29d6f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true