Loading…
Predictive analytics on open big data for supporting smart transportation services
In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data...
Saved in:
Published in: | Procedia computer science 2020, Vol.176, p.3009-3018 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3 |
---|---|
cites | cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3 |
container_end_page | 3018 |
container_issue | |
container_start_page | 3009 |
container_title | Procedia computer science |
container_volume | 176 |
creator | F. Balbin, Paul Patrick Barker, Jackson C.R. Leung, Carson K. Tran, Marvin Wall, Riley P. Cuzzocrea, Alfredo |
description | In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services. |
doi_str_mv | 10.1016/j.procs.2020.09.202 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7531986</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1877050920321049</els_id><sourcerecordid>2450012102</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</originalsourceid><addsrcrecordid>eNp9UU1LAzEUDKJYUX-Blxy9tOajye4eFET8AkERPYc0eVtTtsmapIX-e7O2iF7MZULezLyXNwidUTKhhMqLxaSPwaQJI4xMSDPgHjqidVWNiSDN_q_7CJ2mtCDl8LpuaHWIRpyTKeNUHqHXlwjWmezWgLXX3SY7k3DwOPTg8czNsdVZ4zZEnFZ9H2J2fo7TUseMc9Q-DU86u6JIENfOQDpBB63uEpzu8Bi9392-3TyMn57vH2-un8aGC8rGUlQgeAtSajsDKq2xVEgrWiOk5KxtZV0xqJqqZYVEiAE7pWComdJ6Bgb4Mbra-var2RKsAV8G6lQfXZluo4J26m_Fuw81D2tVCU6bWhaD851BDJ8rSFktXTLQddpDWCXFpoIQyihhhcq3VBNDShHanzaUqCEQtVDfgaghEEWaAYvqcquCsoa1g6iSceDLT1wEk5UN7l_9F7YJlpw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2450012102</pqid></control><display><type>article</type><title>Predictive analytics on open big data for supporting smart transportation services</title><source>BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS</source><source>ScienceDirect®</source><creator>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</creator><creatorcontrib>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</creatorcontrib><description>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</description><identifier>ISSN: 1877-0509</identifier><identifier>EISSN: 1877-0509</identifier><identifier>DOI: 10.1016/j.procs.2020.09.202</identifier><identifier>PMID: 33042316</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>big data ; frequent patterns ; large-scale systems ; on-time performance ; open data ; Predictive analytics ; software engineering ; transportation data ; Winnipeg open data</subject><ispartof>Procedia computer science, 2020, Vol.176, p.3009-3018</ispartof><rights>2020</rights><rights>2020 The Author(s). Published by Elsevier B.V. 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</citedby><cites>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1877050920321049$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,3549,4024,27923,27924,27925,45780</link.rule.ids></links><search><creatorcontrib>F. Balbin, Paul Patrick</creatorcontrib><creatorcontrib>Barker, Jackson C.R.</creatorcontrib><creatorcontrib>Leung, Carson K.</creatorcontrib><creatorcontrib>Tran, Marvin</creatorcontrib><creatorcontrib>Wall, Riley P.</creatorcontrib><creatorcontrib>Cuzzocrea, Alfredo</creatorcontrib><title>Predictive analytics on open big data for supporting smart transportation services</title><title>Procedia computer science</title><description>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</description><subject>big data</subject><subject>frequent patterns</subject><subject>large-scale systems</subject><subject>on-time performance</subject><subject>open data</subject><subject>Predictive analytics</subject><subject>software engineering</subject><subject>transportation data</subject><subject>Winnipeg open data</subject><issn>1877-0509</issn><issn>1877-0509</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9UU1LAzEUDKJYUX-Blxy9tOajye4eFET8AkERPYc0eVtTtsmapIX-e7O2iF7MZULezLyXNwidUTKhhMqLxaSPwaQJI4xMSDPgHjqidVWNiSDN_q_7CJ2mtCDl8LpuaHWIRpyTKeNUHqHXlwjWmezWgLXX3SY7k3DwOPTg8czNsdVZ4zZEnFZ9H2J2fo7TUseMc9Q-DU86u6JIENfOQDpBB63uEpzu8Bi9392-3TyMn57vH2-un8aGC8rGUlQgeAtSajsDKq2xVEgrWiOk5KxtZV0xqJqqZYVEiAE7pWComdJ6Bgb4Mbra-var2RKsAV8G6lQfXZluo4J26m_Fuw81D2tVCU6bWhaD851BDJ8rSFktXTLQddpDWCXFpoIQyihhhcq3VBNDShHanzaUqCEQtVDfgaghEEWaAYvqcquCsoa1g6iSceDLT1wEk5UN7l_9F7YJlpw</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>F. Balbin, Paul Patrick</creator><creator>Barker, Jackson C.R.</creator><creator>Leung, Carson K.</creator><creator>Tran, Marvin</creator><creator>Wall, Riley P.</creator><creator>Cuzzocrea, Alfredo</creator><general>Elsevier B.V</general><general>The Author(s). Published by Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>2020</creationdate><title>Predictive analytics on open big data for supporting smart transportation services</title><author>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>big data</topic><topic>frequent patterns</topic><topic>large-scale systems</topic><topic>on-time performance</topic><topic>open data</topic><topic>Predictive analytics</topic><topic>software engineering</topic><topic>transportation data</topic><topic>Winnipeg open data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>F. Balbin, Paul Patrick</creatorcontrib><creatorcontrib>Barker, Jackson C.R.</creatorcontrib><creatorcontrib>Leung, Carson K.</creatorcontrib><creatorcontrib>Tran, Marvin</creatorcontrib><creatorcontrib>Wall, Riley P.</creatorcontrib><creatorcontrib>Cuzzocrea, Alfredo</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Procedia computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>F. Balbin, Paul Patrick</au><au>Barker, Jackson C.R.</au><au>Leung, Carson K.</au><au>Tran, Marvin</au><au>Wall, Riley P.</au><au>Cuzzocrea, Alfredo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Predictive analytics on open big data for supporting smart transportation services</atitle><jtitle>Procedia computer science</jtitle><date>2020</date><risdate>2020</risdate><volume>176</volume><spage>3009</spage><epage>3018</epage><pages>3009-3018</pages><issn>1877-0509</issn><eissn>1877-0509</eissn><abstract>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</abstract><pub>Elsevier B.V</pub><pmid>33042316</pmid><doi>10.1016/j.procs.2020.09.202</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1877-0509 |
ispartof | Procedia computer science, 2020, Vol.176, p.3009-3018 |
issn | 1877-0509 1877-0509 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7531986 |
source | BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS; ScienceDirect® |
subjects | big data frequent patterns large-scale systems on-time performance open data Predictive analytics software engineering transportation data Winnipeg open data |
title | Predictive analytics on open big data for supporting smart transportation services |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T14%3A24%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Predictive%20analytics%20on%20open%20big%20data%20for%20supporting%20smart%20transportation%20services&rft.jtitle=Procedia%20computer%20science&rft.au=F.%20Balbin,%20Paul%20Patrick&rft.date=2020&rft.volume=176&rft.spage=3009&rft.epage=3018&rft.pages=3009-3018&rft.issn=1877-0509&rft.eissn=1877-0509&rft_id=info:doi/10.1016/j.procs.2020.09.202&rft_dat=%3Cproquest_pubme%3E2450012102%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2450012102&rft_id=info:pmid/33042316&rfr_iscdi=true |