Loading…
A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop
The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking,...
Saved in:
Published in: | Turkish journal of computer and mathematics education 2021-04, Vol.12 (2), p.1546-1563 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 1563 |
container_issue | 2 |
container_start_page | 1546 |
container_title | Turkish journal of computer and mathematics education |
container_volume | 12 |
creator | Kumar, A Ravi Singh, Harsh Pratap Kumar, G Anil |
description | The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages. |
doi_str_mv | 10.17762/turcomat.v12i2.1432 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2624697283</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2624697283</sourcerecordid><originalsourceid>FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453</originalsourceid><addsrcrecordid>eNpNkF1LwzAUhoMoOHT_wIuA1535atJezm26wUQRdx3SNpkZbVKTtuK_t24KXp0XznM-eAC4wWiGheDkrutD6RvVzQZMLJlhRskZmGCK8oTxlJ7_y5dgGuMBIYRTwbKMT8D7HL7qwepP6B1c6mj3DipXjXHQtW8b7TroDXzRwfjQKFdquBpU3avOjgNPvtI1HDvw3vpk445MZ8sIl6pTcBet28O1qrxvr8GFUXXU0996BXYPq7fFOtk-P24W821Sjo93CUOMUmoyJbjiWnGEFc9wmpsUYVNxSqqswByJqigLnQvDFUoFRmWBK8FSltIrcHva2wb_0evYyYPvgxtPSsIJ47kgGR0pdqLK4GMM2sg22EaFL4mRPGqVf1rlUav80Uq_Ac8jbZs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2624697283</pqid></control><display><type>article</type><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><source>Publicly Available Content Database</source><source>Social Science Premium Collection</source><source>Education Collection</source><creator>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</creator><creatorcontrib>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</creatorcontrib><description>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</description><identifier>ISSN: 1309-4653</identifier><identifier>EISSN: 1309-4653</identifier><identifier>DOI: 10.17762/turcomat.v12i2.1432</identifier><language>eng</language><publisher>Gurgaon: Ninety Nine Publication</publisher><subject>Algorithms ; Bioinformatics ; Clustering ; Coordination compounds ; Fractions ; Ligands ; Performance evaluation ; Proteins ; Workflow</subject><ispartof>Turkish journal of computer and mathematics education, 2021-04, Vol.12 (2), p.1546-1563</ispartof><rights>2021. This work is published under https://creativecommons.org/licenses/by/4.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2624697283?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,21378,21394,25753,27924,27925,33611,33877,37012,43733,43880,44590</link.rule.ids></links><search><creatorcontrib>Kumar, A Ravi</creatorcontrib><creatorcontrib>Singh, Harsh Pratap</creatorcontrib><creatorcontrib>Kumar, G Anil</creatorcontrib><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><title>Turkish journal of computer and mathematics education</title><description>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</description><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Clustering</subject><subject>Coordination compounds</subject><subject>Fractions</subject><subject>Ligands</subject><subject>Performance evaluation</subject><subject>Proteins</subject><subject>Workflow</subject><issn>1309-4653</issn><issn>1309-4653</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ALSLI</sourceid><sourceid>CJNVE</sourceid><sourceid>M0P</sourceid><sourceid>PIMPY</sourceid><recordid>eNpNkF1LwzAUhoMoOHT_wIuA1535atJezm26wUQRdx3SNpkZbVKTtuK_t24KXp0XznM-eAC4wWiGheDkrutD6RvVzQZMLJlhRskZmGCK8oTxlJ7_y5dgGuMBIYRTwbKMT8D7HL7qwepP6B1c6mj3DipXjXHQtW8b7TroDXzRwfjQKFdquBpU3avOjgNPvtI1HDvw3vpk445MZ8sIl6pTcBet28O1qrxvr8GFUXXU0996BXYPq7fFOtk-P24W821Sjo93CUOMUmoyJbjiWnGEFc9wmpsUYVNxSqqswByJqigLnQvDFUoFRmWBK8FSltIrcHva2wb_0evYyYPvgxtPSsIJ47kgGR0pdqLK4GMM2sg22EaFL4mRPGqVf1rlUav80Uq_Ac8jbZs</recordid><startdate>20210410</startdate><enddate>20210410</enddate><creator>Kumar, A Ravi</creator><creator>Singh, Harsh Pratap</creator><creator>Kumar, G Anil</creator><general>Ninety Nine Publication</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7XB</scope><scope>88B</scope><scope>88I</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>EDSIH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M0P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210410</creationdate><title>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</title><author>Kumar, A Ravi ; Singh, Harsh Pratap ; Kumar, G Anil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Clustering</topic><topic>Coordination compounds</topic><topic>Fractions</topic><topic>Ligands</topic><topic>Performance evaluation</topic><topic>Proteins</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kumar, A Ravi</creatorcontrib><creatorcontrib>Singh, Harsh Pratap</creatorcontrib><creatorcontrib>Kumar, G Anil</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection【Remote access available】</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Education Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>Turkey Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Science Database (ProQuest)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Turkish journal of computer and mathematics education</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kumar, A Ravi</au><au>Singh, Harsh Pratap</au><au>Kumar, G Anil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop</atitle><jtitle>Turkish journal of computer and mathematics education</jtitle><date>2021-04-10</date><risdate>2021</risdate><volume>12</volume><issue>2</issue><spage>1546</spage><epage>1563</epage><pages>1546-1563</pages><issn>1309-4653</issn><eissn>1309-4653</eissn><abstract>The paper reviews the usage of the platform Hadoop in applications for systemic bioinformatics. Hadoop offers another system for Structural Bioinformatics to break down broad fractions of the Protein Data Bank that is crucial to high-throughput investigations of (for example) protein-ligand docking, protein-ligand complex clustering, and structural alignment. In specific, we review different applications of high-throughput analyses and their scalability in the literature using Hadoop. In comparison to revising the algorithms, we find that these organisations typically use a realized executable called MapReduce. Scalability demonstrates variable behavior in correlation with other batch schedulers, particularly as immediate examinations are usually not accessible on a similar platform. Direct Hadoop examinations with batch schedulers are missing in the literature, but we note that there is some evidence that the scale of MPI executions is better than Hadoop. The dilemma of the interface and structure of an asset to use Hadoop is a significant obstacle to the utilization of the Hadoop biological framework. This will enhance additional time as Hadoop interfaces, such as enhancing Flash, increasing the use of cloud platforms, and normalized approaches, for example, are taken up by Workflow Languages.</abstract><cop>Gurgaon</cop><pub>Ninety Nine Publication</pub><doi>10.17762/turcomat.v12i2.1432</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1309-4653 |
ispartof | Turkish journal of computer and mathematics education, 2021-04, Vol.12 (2), p.1546-1563 |
issn | 1309-4653 1309-4653 |
language | eng |
recordid | cdi_proquest_journals_2624697283 |
source | Publicly Available Content Database; Social Science Premium Collection; Education Collection |
subjects | Algorithms Bioinformatics Clustering Coordination compounds Fractions Ligands Performance evaluation Proteins Workflow |
title | A Review on Design and Development of Performance Evaluation Model for Bio-Informatics Data Using Hadoop |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T21%3A22%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Review%20on%20Design%20and%20Development%20of%20Performance%20Evaluation%20Model%20for%20Bio-Informatics%20Data%20Using%20Hadoop&rft.jtitle=Turkish%20journal%20of%20computer%20and%20mathematics%20education&rft.au=Kumar,%20A%20Ravi&rft.date=2021-04-10&rft.volume=12&rft.issue=2&rft.spage=1546&rft.epage=1563&rft.pages=1546-1563&rft.issn=1309-4653&rft.eissn=1309-4653&rft_id=info:doi/10.17762/turcomat.v12i2.1432&rft_dat=%3Cproquest_cross%3E2624697283%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c143t-404333f8a76a6ea601a68159f501fd632d8b1607dbcbe97f6a05710cb1d745453%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2624697283&rft_id=info:pmid/&rfr_iscdi=true |