Loading…

Predictive analytics on open big data for supporting smart transportation services

In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data...

Full description

Saved in:
Bibliographic Details
Published in:Procedia computer science 2020, Vol.176, p.3009-3018
Main Authors: F. Balbin, Paul Patrick, Barker, Jackson C.R., Leung, Carson K., Tran, Marvin, Wall, Riley P., Cuzzocrea, Alfredo
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3
cites cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3
container_end_page 3018
container_issue
container_start_page 3009
container_title Procedia computer science
container_volume 176
creator F. Balbin, Paul Patrick
Barker, Jackson C.R.
Leung, Carson K.
Tran, Marvin
Wall, Riley P.
Cuzzocrea, Alfredo
description In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.
doi_str_mv 10.1016/j.procs.2020.09.202
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7531986</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1877050920321049</els_id><sourcerecordid>2450012102</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</originalsourceid><addsrcrecordid>eNp9UU1LAzEUDKJYUX-Blxy9tOajye4eFET8AkERPYc0eVtTtsmapIX-e7O2iF7MZULezLyXNwidUTKhhMqLxaSPwaQJI4xMSDPgHjqidVWNiSDN_q_7CJ2mtCDl8LpuaHWIRpyTKeNUHqHXlwjWmezWgLXX3SY7k3DwOPTg8czNsdVZ4zZEnFZ9H2J2fo7TUseMc9Q-DU86u6JIENfOQDpBB63uEpzu8Bi9392-3TyMn57vH2-un8aGC8rGUlQgeAtSajsDKq2xVEgrWiOk5KxtZV0xqJqqZYVEiAE7pWComdJ6Bgb4Mbra-var2RKsAV8G6lQfXZluo4J26m_Fuw81D2tVCU6bWhaD851BDJ8rSFktXTLQddpDWCXFpoIQyihhhcq3VBNDShHanzaUqCEQtVDfgaghEEWaAYvqcquCsoa1g6iSceDLT1wEk5UN7l_9F7YJlpw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2450012102</pqid></control><display><type>article</type><title>Predictive analytics on open big data for supporting smart transportation services</title><source>BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS</source><source>ScienceDirect®</source><creator>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</creator><creatorcontrib>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</creatorcontrib><description>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</description><identifier>ISSN: 1877-0509</identifier><identifier>EISSN: 1877-0509</identifier><identifier>DOI: 10.1016/j.procs.2020.09.202</identifier><identifier>PMID: 33042316</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>big data ; frequent patterns ; large-scale systems ; on-time performance ; open data ; Predictive analytics ; software engineering ; transportation data ; Winnipeg open data</subject><ispartof>Procedia computer science, 2020, Vol.176, p.3009-3018</ispartof><rights>2020</rights><rights>2020 The Author(s). Published by Elsevier B.V. 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</citedby><cites>FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1877050920321049$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,3549,4024,27923,27924,27925,45780</link.rule.ids></links><search><creatorcontrib>F. Balbin, Paul Patrick</creatorcontrib><creatorcontrib>Barker, Jackson C.R.</creatorcontrib><creatorcontrib>Leung, Carson K.</creatorcontrib><creatorcontrib>Tran, Marvin</creatorcontrib><creatorcontrib>Wall, Riley P.</creatorcontrib><creatorcontrib>Cuzzocrea, Alfredo</creatorcontrib><title>Predictive analytics on open big data for supporting smart transportation services</title><title>Procedia computer science</title><description>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</description><subject>big data</subject><subject>frequent patterns</subject><subject>large-scale systems</subject><subject>on-time performance</subject><subject>open data</subject><subject>Predictive analytics</subject><subject>software engineering</subject><subject>transportation data</subject><subject>Winnipeg open data</subject><issn>1877-0509</issn><issn>1877-0509</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9UU1LAzEUDKJYUX-Blxy9tOajye4eFET8AkERPYc0eVtTtsmapIX-e7O2iF7MZULezLyXNwidUTKhhMqLxaSPwaQJI4xMSDPgHjqidVWNiSDN_q_7CJ2mtCDl8LpuaHWIRpyTKeNUHqHXlwjWmezWgLXX3SY7k3DwOPTg8czNsdVZ4zZEnFZ9H2J2fo7TUseMc9Q-DU86u6JIENfOQDpBB63uEpzu8Bi9392-3TyMn57vH2-un8aGC8rGUlQgeAtSajsDKq2xVEgrWiOk5KxtZV0xqJqqZYVEiAE7pWComdJ6Bgb4Mbra-var2RKsAV8G6lQfXZluo4J26m_Fuw81D2tVCU6bWhaD851BDJ8rSFktXTLQddpDWCXFpoIQyihhhcq3VBNDShHanzaUqCEQtVDfgaghEEWaAYvqcquCsoa1g6iSceDLT1wEk5UN7l_9F7YJlpw</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>F. Balbin, Paul Patrick</creator><creator>Barker, Jackson C.R.</creator><creator>Leung, Carson K.</creator><creator>Tran, Marvin</creator><creator>Wall, Riley P.</creator><creator>Cuzzocrea, Alfredo</creator><general>Elsevier B.V</general><general>The Author(s). Published by Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>2020</creationdate><title>Predictive analytics on open big data for supporting smart transportation services</title><author>F. Balbin, Paul Patrick ; Barker, Jackson C.R. ; Leung, Carson K. ; Tran, Marvin ; Wall, Riley P. ; Cuzzocrea, Alfredo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>big data</topic><topic>frequent patterns</topic><topic>large-scale systems</topic><topic>on-time performance</topic><topic>open data</topic><topic>Predictive analytics</topic><topic>software engineering</topic><topic>transportation data</topic><topic>Winnipeg open data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>F. Balbin, Paul Patrick</creatorcontrib><creatorcontrib>Barker, Jackson C.R.</creatorcontrib><creatorcontrib>Leung, Carson K.</creatorcontrib><creatorcontrib>Tran, Marvin</creatorcontrib><creatorcontrib>Wall, Riley P.</creatorcontrib><creatorcontrib>Cuzzocrea, Alfredo</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Procedia computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>F. Balbin, Paul Patrick</au><au>Barker, Jackson C.R.</au><au>Leung, Carson K.</au><au>Tran, Marvin</au><au>Wall, Riley P.</au><au>Cuzzocrea, Alfredo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Predictive analytics on open big data for supporting smart transportation services</atitle><jtitle>Procedia computer science</jtitle><date>2020</date><risdate>2020</risdate><volume>176</volume><spage>3009</spage><epage>3018</epage><pages>3009-3018</pages><issn>1877-0509</issn><eissn>1877-0509</eissn><abstract>In the current era of big data, huge quantities of valuable data, which may be of different levels of veracity, are being generated at a rapid rate. Embedded into these big data are implicit, previously unknown and potentially useful information and valuable knowledge that can be discovered by data science solutions, which apply techniques like data mining. There has been a trend that more and more collections of these big data have been made openly available in science, government and non-profit organizations so that people could collaboratively study and analysis these open big data. In this article, we focus on open big data for public transit because public transit (e.g., bus) as a means of transportation is a vital part of many people’s lives. As time is a precious resource, bus delays could negatively affect commuters’ plans. Unfortunately, they are inevitable. Hence, many existing works focused on predicting bus delays. However, predicting on-time or early buses is also important. For instance, commuters who come to a bus stop on time may still miss their buses if the buses leave early. So, in this article, we examine open big data about bus performance (e.g., early, on-time, and late stops). We analyze the data with frequent pattern mining and make predictions with decision-tree based classification. For illustration, we perform predictive analytics on real-life open big data available on Winnipeg Open Data Portal, about bus performance from Winnipeg Transit. It shows the benefits of predictive analytics on open big data for supporting smart transportation services.</abstract><pub>Elsevier B.V</pub><pmid>33042316</pmid><doi>10.1016/j.procs.2020.09.202</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1877-0509
ispartof Procedia computer science, 2020, Vol.176, p.3009-3018
issn 1877-0509
1877-0509
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7531986
source BACON - Elsevier - GLOBAL_SCIENCEDIRECT-OPENACCESS; ScienceDirect®
subjects big data
frequent patterns
large-scale systems
on-time performance
open data
Predictive analytics
software engineering
transportation data
Winnipeg open data
title Predictive analytics on open big data for supporting smart transportation services
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T14%3A24%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Predictive%20analytics%20on%20open%20big%20data%20for%20supporting%20smart%20transportation%20services&rft.jtitle=Procedia%20computer%20science&rft.au=F.%20Balbin,%20Paul%20Patrick&rft.date=2020&rft.volume=176&rft.spage=3009&rft.epage=3018&rft.pages=3009-3018&rft.issn=1877-0509&rft.eissn=1877-0509&rft_id=info:doi/10.1016/j.procs.2020.09.202&rft_dat=%3Cproquest_pubme%3E2450012102%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3512-657e53fe66adbe16dcd156d5fc56632ff6872e797f2fe600ced41ec1c418bece3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2450012102&rft_id=info:pmid/33042316&rfr_iscdi=true