Loading…

MapReduce: Review and open challenges

The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a majo...

Full description

Saved in:
Bibliographic Details
Published in:Scientometrics 2016-10, Vol.109 (1), p.389-422
Main Authors: Hashem, Ibrahim Abaker Targio, Anuar, Nor Badrul, Gani, Abdullah, Yaqoob, Ibrar, Xia, Feng, Khan, Samee Ullah
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3
cites cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3
container_end_page 422
container_issue 1
container_start_page 389
container_title Scientometrics
container_volume 109
creator Hashem, Ibrahim Abaker Targio
Anuar, Nor Badrul
Gani, Abdullah
Yaqoob, Ibrar
Xia, Feng
Khan, Samee Ullah
description The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.
doi_str_mv 10.1007/s11192-016-1945-y
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1880875281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1880875281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</originalsourceid><addsrcrecordid>eNp1kEFLAzEQhYMoWKs_wNuCeIzOJN3NrDcpVoWKUPQckmyiLXV3TVql_96U9eDF01ze9x7zMXaOcIUA6johYi04YMWxnpR8d8BGWBJxQRUeshGgJF6jhGN2ktIKMiOBRuzyyfQL32ydvykW_mvpvwvTNkXX-7Zw72a99u2bT6fsKJh18me_d8xeZ3cv0wc-f75_nN7OuZNYbbibADpFrnKBQDhRhUBBBolW2ZqkCmWjwMnaBEHKWFnZpgzSGqcaYZW0cswuht4-dp9bnzZ61W1jmyc1EgGpUhDmFA4pF7uUog-6j8sPE3caQe9t6MGGzjb03obeZUYMTMrZ_FL80_wv9AMKAWFk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1880875281</pqid></control><display><type>article</type><title>MapReduce: Review and open challenges</title><source>Library &amp; Information Science Abstracts (LISA)</source><source>Springer Nature</source><creator>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</creator><creatorcontrib>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</creatorcontrib><description>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</description><identifier>ISSN: 0138-9130</identifier><identifier>EISSN: 1588-2861</identifier><identifier>DOI: 10.1007/s11192-016-1945-y</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Authoring ; Bibliometrics ; Big Data ; Computation ; Computer applications ; Computer Science ; Data management ; Data processing ; Documents ; Exploration ; Fault tolerance ; Information Storage and Retrieval ; Library Science ; Reviews ; Scientific visualization ; Visualization</subject><ispartof>Scientometrics, 2016-10, Vol.109 (1), p.389-422</ispartof><rights>Akadémiai Kiadó, Budapest, Hungary 2016</rights><rights>Copyright Springer Science &amp; Business Media 2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</citedby><cites>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924,34134</link.rule.ids></links><search><creatorcontrib>Hashem, Ibrahim Abaker Targio</creatorcontrib><creatorcontrib>Anuar, Nor Badrul</creatorcontrib><creatorcontrib>Gani, Abdullah</creatorcontrib><creatorcontrib>Yaqoob, Ibrar</creatorcontrib><creatorcontrib>Xia, Feng</creatorcontrib><creatorcontrib>Khan, Samee Ullah</creatorcontrib><title>MapReduce: Review and open challenges</title><title>Scientometrics</title><addtitle>Scientometrics</addtitle><description>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</description><subject>Authoring</subject><subject>Bibliometrics</subject><subject>Big Data</subject><subject>Computation</subject><subject>Computer applications</subject><subject>Computer Science</subject><subject>Data management</subject><subject>Data processing</subject><subject>Documents</subject><subject>Exploration</subject><subject>Fault tolerance</subject><subject>Information Storage and Retrieval</subject><subject>Library Science</subject><subject>Reviews</subject><subject>Scientific visualization</subject><subject>Visualization</subject><issn>0138-9130</issn><issn>1588-2861</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp1kEFLAzEQhYMoWKs_wNuCeIzOJN3NrDcpVoWKUPQckmyiLXV3TVql_96U9eDF01ze9x7zMXaOcIUA6johYi04YMWxnpR8d8BGWBJxQRUeshGgJF6jhGN2ktIKMiOBRuzyyfQL32ydvykW_mvpvwvTNkXX-7Zw72a99u2bT6fsKJh18me_d8xeZ3cv0wc-f75_nN7OuZNYbbibADpFrnKBQDhRhUBBBolW2ZqkCmWjwMnaBEHKWFnZpgzSGqcaYZW0cswuht4-dp9bnzZ61W1jmyc1EgGpUhDmFA4pF7uUog-6j8sPE3caQe9t6MGGzjb03obeZUYMTMrZ_FL80_wv9AMKAWFk</recordid><startdate>20161001</startdate><enddate>20161001</enddate><creator>Hashem, Ibrahim Abaker Targio</creator><creator>Anuar, Nor Badrul</creator><creator>Gani, Abdullah</creator><creator>Yaqoob, Ibrar</creator><creator>Xia, Feng</creator><creator>Khan, Samee Ullah</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>20161001</creationdate><title>MapReduce: Review and open challenges</title><author>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Authoring</topic><topic>Bibliometrics</topic><topic>Big Data</topic><topic>Computation</topic><topic>Computer applications</topic><topic>Computer Science</topic><topic>Data management</topic><topic>Data processing</topic><topic>Documents</topic><topic>Exploration</topic><topic>Fault tolerance</topic><topic>Information Storage and Retrieval</topic><topic>Library Science</topic><topic>Reviews</topic><topic>Scientific visualization</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hashem, Ibrahim Abaker Targio</creatorcontrib><creatorcontrib>Anuar, Nor Badrul</creatorcontrib><creatorcontrib>Gani, Abdullah</creatorcontrib><creatorcontrib>Yaqoob, Ibrar</creatorcontrib><creatorcontrib>Xia, Feng</creatorcontrib><creatorcontrib>Khan, Samee Ullah</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Scientometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hashem, Ibrahim Abaker Targio</au><au>Anuar, Nor Badrul</au><au>Gani, Abdullah</au><au>Yaqoob, Ibrar</au><au>Xia, Feng</au><au>Khan, Samee Ullah</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MapReduce: Review and open challenges</atitle><jtitle>Scientometrics</jtitle><stitle>Scientometrics</stitle><date>2016-10-01</date><risdate>2016</risdate><volume>109</volume><issue>1</issue><spage>389</spage><epage>422</epage><pages>389-422</pages><issn>0138-9130</issn><eissn>1588-2861</eissn><abstract>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s11192-016-1945-y</doi><tpages>34</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0138-9130
ispartof Scientometrics, 2016-10, Vol.109 (1), p.389-422
issn 0138-9130
1588-2861
language eng
recordid cdi_proquest_journals_1880875281
source Library & Information Science Abstracts (LISA); Springer Nature
subjects Authoring
Bibliometrics
Big Data
Computation
Computer applications
Computer Science
Data management
Data processing
Documents
Exploration
Fault tolerance
Information Storage and Retrieval
Library Science
Reviews
Scientific visualization
Visualization
title MapReduce: Review and open challenges
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T22%3A56%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MapReduce:%20Review%20and%20open%20challenges&rft.jtitle=Scientometrics&rft.au=Hashem,%20Ibrahim%20Abaker%20Targio&rft.date=2016-10-01&rft.volume=109&rft.issue=1&rft.spage=389&rft.epage=422&rft.pages=389-422&rft.issn=0138-9130&rft.eissn=1588-2861&rft_id=info:doi/10.1007/s11192-016-1945-y&rft_dat=%3Cproquest_cross%3E1880875281%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1880875281&rft_id=info:pmid/&rfr_iscdi=true