Loading…

A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop

The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking,...

Full description

Saved in:
Bibliographic Details
Published in:Turkish journal of computer and mathematics education 2021-04, Vol.12 (2), p.1546-1563
Main Authors: Kumar, A Ravi, Singh, Harsh Pratap, Kumar, G Anil
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 1563
container_issue 2
container_start_page 1546
container_title Turkish journal of computer and mathematics education
container_volume 12
creator Kumar, A Ravi
Singh, Harsh Pratap
Kumar, G Anil
description The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.
doi_str_mv 10.17762/turcomat.v12i2.1432
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2624697283</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2624697283</sourcerecordid><originalsourceid>FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453</originalsourceid><addsrcrecordid>eNpNkF1LwzAUhoMoOHT_wIuA1535atJezm26wUQRdx3SNpkZbVKTtuK_t24KXp0XznM-eAC4wWiGheDkrutD6RvVzQZMLJlhRskZmGCK8oTxlJ7_y5dgGuMBIYRTwbKMT8D7HL7qwepP6B1c6mj3DipXjXHQtW8b7TroDXzRwfjQKFdquBpU3avOjgNPvtI1HDvw3vpk445MZ8sIl6pTcBet28O1qrxvr8GFUXXU0996BXYPq7fFOtk-P24W821Sjo93CUOMUmoyJbjiWnGEFc9wmpsUYVNxSqqswByJqigLnQvDFUoFRmWBK8FSltIrcHva2wb_0evYyYPvgxtPSsIJ47kgGR0pdqLK4GMM2sg22EaFL4mRPGqVf1rlUav80Uq_Ac8jbZs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2624697283</pqid></control><display><type>article</type><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><source>Publicly Available Content Database</source><source>Social Science Premium Collection</source><source>Education Collection</source><creator>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</creator><creatorcontrib>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</creatorcontrib><description>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</description><identifier>ISSN: 1309-4653</identifier><identifier>EISSN: 1309-4653</identifier><identifier>DOI: 10.17762/turcomat.v12i2.1432</identifier><language>eng</language><publisher>Gurgaon: Ninety Nine Publication</publisher><subject>Algorithms ; Bioinformatics ; Clustering ; Coordination compounds ; Fractions ; Ligands ; Performance evaluation ; Proteins ; Workflow</subject><ispartof>Turkish journal of computer and mathematics education, 2021-04, Vol.12 (2), p.1546-1563</ispartof><rights>2021. This work is published under https://creativecommons.org/licenses/by/4.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2624697283?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,21378,21394,25753,27924,27925,33611,33877,37012,43733,43880,44590</link.rule.ids></links><search><creatorcontrib>Kumar, A Ravi</creatorcontrib><creatorcontrib>Singh, Harsh Pratap</creatorcontrib><creatorcontrib>Kumar, G Anil</creatorcontrib><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><title>Turkish journal of computer and mathematics education</title><description>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</description><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Clustering</subject><subject>Coordination compounds</subject><subject>Fractions</subject><subject>Ligands</subject><subject>Performance evaluation</subject><subject>Proteins</subject><subject>Workflow</subject><issn>1309-4653</issn><issn>1309-4653</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ALSLI</sourceid><sourceid>CJNVE</sourceid><sourceid>M0P</sourceid><sourceid>PIMPY</sourceid><recordid>eNpNkF1LwzAUhoMoOHT_wIuA1535atJezm26wUQRdx3SNpkZbVKTtuK_t24KXp0XznM-eAC4wWiGheDkrutD6RvVzQZMLJlhRskZmGCK8oTxlJ7_y5dgGuMBIYRTwbKMT8D7HL7qwepP6B1c6mj3DipXjXHQtW8b7TroDXzRwfjQKFdquBpU3avOjgNPvtI1HDvw3vpk445MZ8sIl6pTcBet28O1qrxvr8GFUXXU0996BXYPq7fFOtk-P24W821Sjo93CUOMUmoyJbjiWnGEFc9wmpsUYVNxSqqswByJqigLnQvDFUoFRmWBK8FSltIrcHva2wb_0evYyYPvgxtPSsIJ47kgGR0pdqLK4GMM2sg22EaFL4mRPGqVf1rlUav80Uq_Ac8jbZs</recordid><startdate>20210410</startdate><enddate>20210410</enddate><creator>Kumar, A Ravi</creator><creator>Singh, Harsh Pratap</creator><creator>Kumar, G Anil</creator><general>Ninety Nine Publication</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7XB</scope><scope>88B</scope><scope>88I</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>EDSIH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M0P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210410</creationdate><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><author>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Clustering</topic><topic>Coordination compounds</topic><topic>Fractions</topic><topic>Ligands</topic><topic>Performance evaluation</topic><topic>Proteins</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kumar, A Ravi</creatorcontrib><creatorcontrib>Singh, Harsh Pratap</creatorcontrib><creatorcontrib>Kumar, G Anil</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection【Remote access available】</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Education Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>Turkey Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Science Database (ProQuest)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Turkish journal of computer and mathematics education</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kumar, A Ravi</au><au>Singh, Harsh Pratap</au><au>Kumar, G Anil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</atitle><jtitle>Turkish journal of computer and mathematics education</jtitle><date>2021-04-10</date><risdate>2021</risdate><volume>12</volume><issue>2</issue><spage>1546</spage><epage>1563</epage><pages>1546-1563</pages><issn>1309-4653</issn><eissn>1309-4653</eissn><abstract>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</abstract><cop>Gurgaon</cop><pub>Ninety Nine Publication</pub><doi>10.17762/turcomat.v12i2.1432</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1309-4653
ispartof Turkish journal of computer and mathematics education, 2021-04, Vol.12 (2), p.1546-1563
issn 1309-4653
1309-4653
language eng
recordid cdi_proquest_journals_2624697283
source Publicly Available Content Database; Social Science Premium Collection; Education Collection
subjects Algorithms
Bioinformatics
Clustering
Coordination compounds
Fractions
Ligands
Performance evaluation
Proteins
Workflow
title A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T21%3A22%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Review%20on%20Design%20and%20Development%20of%20Performance%20Evaluation%20Model%20for%20Bio-Informatics%20Data%20Using%20Hadoop&rft.jtitle=Turkish%20journal%20of%20computer%20and%20mathematics%20education&rft.au=Kumar,%20A%20Ravi&rft.date=2021-04-10&rft.volume=12&rft.issue=2&rft.spage=1546&rft.epage=1563&rft.pages=1546-1563&rft.issn=1309-4653&rft.eissn=1309-4653&rft_id=info:doi/10.17762/turcomat.v12i2.1432&rft_dat=%3Cproquest_cross%3E2624697283%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2624697283&rft_id=info:pmid/&rfr_iscdi=true