Loading…
A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing
Along with the development of digital health, efficient machine learning is anxiously needed to handle the growing health data. Among various machine learning algorithms, back propagation neural network (BPNN) shows great effectiveness in both academia and industrial fields. However, it is frequentl...
Saved in:
Published in: | IEEE access 2019, Vol.7, p.114700-114713 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43 |
---|---|
cites | cdi_FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43 |
container_end_page | 114713 |
container_issue | |
container_start_page | 114700 |
container_title | IEEE access |
container_volume | 7 |
creator | Liu, Yang Chen, Xianbang Xu, Lixiong Li, Huaqiang Li, Maozhen |
description | Along with the development of digital health, efficient machine learning is anxiously needed to handle the growing health data. Among various machine learning algorithms, back propagation neural network (BPNN) shows great effectiveness in both academia and industrial fields. However, it is frequently reported that the conventional BPNN algorithm encounters low efficiency issue in dealing with large-scale digital health data. Therefore this paper presents a Hadoop based parallelized BPNN algorithm which is able to process the large-scale data efficiently. In order to complement the potential accuracy loss issue for the parallelized data processing, ensemble learning techniques are also involved. Additionally although Hadoop supplies a number of default schedulers, the heterogeneous distributed computing environment may still impact the efficiency of the parallelized BPNN. Consequently, this paper also presents a gene expression programming (GEP) algorithm based load balancing approach, which enables the computing resource awareness and the optimal scheduling of the parallelized BPNN. The experiments employ the classification task as the underlying testing basis. Two types of the experiments are carried out, in which the first one focuses on evaluating the accuracy of the presented algorithm with classifying the benchmark dataset; the second one focuses on evaluating the efficiency of the presented algorithm with classifying the large-scale dataset. The experimental results show the effectiveness of the presented resource aware parallelized BPNN algorithm. |
doi_str_mv | 10.1109/ACCESS.2019.2935691 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_a5429ee12d294f40bcc9371e9220db59</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8801835</ieee_id><doaj_id>oai_doaj_org_article_a5429ee12d294f40bcc9371e9220db59</doaj_id><sourcerecordid>2455599485</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43</originalsourceid><addsrcrecordid>eNpNkU9vEzEQxVcIpFaln6AXS5w3-G_WPoY00EoRVISerVnveHG6rIO9UUUP_ew4bFVhH8Yaz3t-8q-qrhhdMEbNx9V6vdntFpwys-BGqKVhb6pzzpamFkos3_53Pqsuc97TsnRpqea8el6R75jjMTkkq0dISO4gwTDgEJ6wI5_APZC7FA_QwxTiSL7isVyXMj3G9EDCSDYjtEMYe7LxPriA40S2kHqsdw4GJNehD1NR3CAM009yDROcDB3mXETvq3cehoyXL_Wiuv-8-bG-qbffvtyuV9vaSaqnmnUSW2WMo441BoxpllL7RnuQ0AovjPKm7Ba1aFmjuOdGOkDwXBqpUIqL6nb27SLs7SGFX5D-2AjB_mvE1FtIU3ADWlCSG0TGu2LiJW2dM6JhaDinXQlRvD7MXocUfx8xT3Zf_m8s8S2XSpWYUqsyJeYpl2LOCf3rq4zaEzc7c7MnbvaFW1FdzaqAiK8KrSnThd9fnOKUIg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2455599485</pqid></control><display><type>article</type><title>A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing</title><source>IEEE Xplore Open Access Journals</source><creator>Liu, Yang ; Chen, Xianbang ; Xu, Lixiong ; Li, Huaqiang ; Li, Maozhen</creator><creatorcontrib>Liu, Yang ; Chen, Xianbang ; Xu, Lixiong ; Li, Huaqiang ; Li, Maozhen</creatorcontrib><description>Along with the development of digital health, efficient machine learning is anxiously needed to handle the growing health data. Among various machine learning algorithms, back propagation neural network (BPNN) shows great effectiveness in both academia and industrial fields. However, it is frequently reported that the conventional BPNN algorithm encounters low efficiency issue in dealing with large-scale digital health data. Therefore this paper presents a Hadoop based parallelized BPNN algorithm which is able to process the large-scale data efficiently. In order to complement the potential accuracy loss issue for the parallelized data processing, ensemble learning techniques are also involved. Additionally although Hadoop supplies a number of default schedulers, the heterogeneous distributed computing environment may still impact the efficiency of the parallelized BPNN. Consequently, this paper also presents a gene expression programming (GEP) algorithm based load balancing approach, which enables the computing resource awareness and the optimal scheduling of the parallelized BPNN. The experiments employ the classification task as the underlying testing basis. Two types of the experiments are carried out, in which the first one focuses on evaluating the accuracy of the presented algorithm with classifying the benchmark dataset; the second one focuses on evaluating the efficiency of the presented algorithm with classifying the large-scale dataset. The experimental results show the effectiveness of the presented resource aware parallelized BPNN algorithm.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2019.2935691</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Artificial neural networks ; Back propagation ; Back propagation networks ; Back propagation neural network ; Backpropagation ; Classification ; Computer networks ; Data processing ; Datasets ; Distributed computing ; Distributed processing ; Efficiency ; Environmental impact ; Evaluation ; Gene expression ; gene expression programming ; Hadoop ; load balancing ; Load management ; Machine learning ; Machine learning algorithms ; Neural networks ; Parallel processing ; parallelization ; Task scheduling ; Training</subject><ispartof>IEEE access, 2019, Vol.7, p.114700-114713</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43</citedby><cites>FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43</cites><orcidid>0000-0002-7935-7146</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8801835$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,4010,27610,27900,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Chen, Xianbang</creatorcontrib><creatorcontrib>Xu, Lixiong</creatorcontrib><creatorcontrib>Li, Huaqiang</creatorcontrib><creatorcontrib>Li, Maozhen</creatorcontrib><title>A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing</title><title>IEEE access</title><addtitle>Access</addtitle><description>Along with the development of digital health, efficient machine learning is anxiously needed to handle the growing health data. Among various machine learning algorithms, back propagation neural network (BPNN) shows great effectiveness in both academia and industrial fields. However, it is frequently reported that the conventional BPNN algorithm encounters low efficiency issue in dealing with large-scale digital health data. Therefore this paper presents a Hadoop based parallelized BPNN algorithm which is able to process the large-scale data efficiently. In order to complement the potential accuracy loss issue for the parallelized data processing, ensemble learning techniques are also involved. Additionally although Hadoop supplies a number of default schedulers, the heterogeneous distributed computing environment may still impact the efficiency of the parallelized BPNN. Consequently, this paper also presents a gene expression programming (GEP) algorithm based load balancing approach, which enables the computing resource awareness and the optimal scheduling of the parallelized BPNN. The experiments employ the classification task as the underlying testing basis. Two types of the experiments are carried out, in which the first one focuses on evaluating the accuracy of the presented algorithm with classifying the benchmark dataset; the second one focuses on evaluating the efficiency of the presented algorithm with classifying the large-scale dataset. The experimental results show the effectiveness of the presented resource aware parallelized BPNN algorithm.</description><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Back propagation</subject><subject>Back propagation networks</subject><subject>Back propagation neural network</subject><subject>Backpropagation</subject><subject>Classification</subject><subject>Computer networks</subject><subject>Data processing</subject><subject>Datasets</subject><subject>Distributed computing</subject><subject>Distributed processing</subject><subject>Efficiency</subject><subject>Environmental impact</subject><subject>Evaluation</subject><subject>Gene expression</subject><subject>gene expression programming</subject><subject>Hadoop</subject><subject>load balancing</subject><subject>Load management</subject><subject>Machine learning</subject><subject>Machine learning algorithms</subject><subject>Neural networks</subject><subject>Parallel processing</subject><subject>parallelization</subject><subject>Task scheduling</subject><subject>Training</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>DOA</sourceid><recordid>eNpNkU9vEzEQxVcIpFaln6AXS5w3-G_WPoY00EoRVISerVnveHG6rIO9UUUP_ew4bFVhH8Yaz3t-8q-qrhhdMEbNx9V6vdntFpwys-BGqKVhb6pzzpamFkos3_53Pqsuc97TsnRpqea8el6R75jjMTkkq0dISO4gwTDgEJ6wI5_APZC7FA_QwxTiSL7isVyXMj3G9EDCSDYjtEMYe7LxPriA40S2kHqsdw4GJNehD1NR3CAM009yDROcDB3mXETvq3cehoyXL_Wiuv-8-bG-qbffvtyuV9vaSaqnmnUSW2WMo441BoxpllL7RnuQ0AovjPKm7Ba1aFmjuOdGOkDwXBqpUIqL6nb27SLs7SGFX5D-2AjB_mvE1FtIU3ADWlCSG0TGu2LiJW2dM6JhaDinXQlRvD7MXocUfx8xT3Zf_m8s8S2XSpWYUqsyJeYpl2LOCf3rq4zaEzc7c7MnbvaFW1FdzaqAiK8KrSnThd9fnOKUIg</recordid><startdate>2019</startdate><enddate>2019</enddate><creator>Liu, Yang</creator><creator>Chen, Xianbang</creator><creator>Xu, Lixiong</creator><creator>Li, Huaqiang</creator><creator>Li, Maozhen</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-7935-7146</orcidid></search><sort><creationdate>2019</creationdate><title>A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing</title><author>Liu, Yang ; Chen, Xianbang ; Xu, Lixiong ; Li, Huaqiang ; Li, Maozhen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Back propagation</topic><topic>Back propagation networks</topic><topic>Back propagation neural network</topic><topic>Backpropagation</topic><topic>Classification</topic><topic>Computer networks</topic><topic>Data processing</topic><topic>Datasets</topic><topic>Distributed computing</topic><topic>Distributed processing</topic><topic>Efficiency</topic><topic>Environmental impact</topic><topic>Evaluation</topic><topic>Gene expression</topic><topic>gene expression programming</topic><topic>Hadoop</topic><topic>load balancing</topic><topic>Load management</topic><topic>Machine learning</topic><topic>Machine learning algorithms</topic><topic>Neural networks</topic><topic>Parallel processing</topic><topic>parallelization</topic><topic>Task scheduling</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Chen, Xianbang</creatorcontrib><creatorcontrib>Xu, Lixiong</creatorcontrib><creatorcontrib>Li, Huaqiang</creatorcontrib><creatorcontrib>Li, Maozhen</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Explore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liu, Yang</au><au>Chen, Xianbang</au><au>Xu, Lixiong</au><au>Li, Huaqiang</au><au>Li, Maozhen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2019</date><risdate>2019</risdate><volume>7</volume><spage>114700</spage><epage>114713</epage><pages>114700-114713</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Along with the development of digital health, efficient machine learning is anxiously needed to handle the growing health data. Among various machine learning algorithms, back propagation neural network (BPNN) shows great effectiveness in both academia and industrial fields. However, it is frequently reported that the conventional BPNN algorithm encounters low efficiency issue in dealing with large-scale digital health data. Therefore this paper presents a Hadoop based parallelized BPNN algorithm which is able to process the large-scale data efficiently. In order to complement the potential accuracy loss issue for the parallelized data processing, ensemble learning techniques are also involved. Additionally although Hadoop supplies a number of default schedulers, the heterogeneous distributed computing environment may still impact the efficiency of the parallelized BPNN. Consequently, this paper also presents a gene expression programming (GEP) algorithm based load balancing approach, which enables the computing resource awareness and the optimal scheduling of the parallelized BPNN. The experiments employ the classification task as the underlying testing basis. Two types of the experiments are carried out, in which the first one focuses on evaluating the accuracy of the presented algorithm with classifying the benchmark dataset; the second one focuses on evaluating the efficiency of the presented algorithm with classifying the large-scale dataset. The experimental results show the effectiveness of the presented resource aware parallelized BPNN algorithm.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2019.2935691</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-7935-7146</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2019, Vol.7, p.114700-114713 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_a5429ee12d294f40bcc9371e9220db59 |
source | IEEE Xplore Open Access Journals |
subjects | Algorithms Artificial neural networks Back propagation Back propagation networks Back propagation neural network Backpropagation Classification Computer networks Data processing Datasets Distributed computing Distributed processing Efficiency Environmental impact Evaluation Gene expression gene expression programming Hadoop load balancing Load management Machine learning Machine learning algorithms Neural networks Parallel processing parallelization Task scheduling Training |
title | A Resource Aware Parallelized Back Propagation Neural Network in Enabling Efficient Large-Scale Digital Health Data Processing |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T19%3A24%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Resource%20Aware%20Parallelized%20Back%20Propagation%20Neural%20Network%20in%20Enabling%20Efficient%20Large-Scale%20Digital%20Health%20Data%20Processing&rft.jtitle=IEEE%20access&rft.au=Liu,%20Yang&rft.date=2019&rft.volume=7&rft.spage=114700&rft.epage=114713&rft.pages=114700-114713&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2019.2935691&rft_dat=%3Cproquest_doaj_%3E2455599485%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c408t-1d4eb599c0c179a997648f78fa4ab3f395f9f9fbe83b1752f294caeaf24945e43%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2455599485&rft_id=info:pmid/&rft_ieee_id=8801835&rfr_iscdi=true |