Loading…
MapReduce: Review and open challenges
The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a majo...
Saved in:
Published in: | Scientometrics 2016-10, Vol.109 (1), p.389-422 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3 |
---|---|
cites | cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3 |
container_end_page | 422 |
container_issue | 1 |
container_start_page | 389 |
container_title | Scientometrics |
container_volume | 109 |
creator | Hashem, Ibrahim Abaker Targio Anuar, Nor Badrul Gani, Abdullah Yaqoob, Ibrar Xia, Feng Khan, Samee Ullah |
description | The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions. |
doi_str_mv | 10.1007/s11192-016-1945-y |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1880875281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1880875281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</originalsourceid><addsrcrecordid>eNp1kEFLAzEQhYMoWKs_wNuCeIzOJN3NrDcpVoWKUPQckmyiLXV3TVql_96U9eDF01ze9x7zMXaOcIUA6johYi04YMWxnpR8d8BGWBJxQRUeshGgJF6jhGN2ktIKMiOBRuzyyfQL32ydvykW_mvpvwvTNkXX-7Zw72a99u2bT6fsKJh18me_d8xeZ3cv0wc-f75_nN7OuZNYbbibADpFrnKBQDhRhUBBBolW2ZqkCmWjwMnaBEHKWFnZpgzSGqcaYZW0cswuht4-dp9bnzZ61W1jmyc1EgGpUhDmFA4pF7uUog-6j8sPE3caQe9t6MGGzjb03obeZUYMTMrZ_FL80_wv9AMKAWFk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1880875281</pqid></control><display><type>article</type><title>MapReduce: Review and open challenges</title><source>Library & Information Science Abstracts (LISA)</source><source>Springer Nature</source><creator>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</creator><creatorcontrib>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</creatorcontrib><description>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</description><identifier>ISSN: 0138-9130</identifier><identifier>EISSN: 1588-2861</identifier><identifier>DOI: 10.1007/s11192-016-1945-y</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Authoring ; Bibliometrics ; Big Data ; Computation ; Computer applications ; Computer Science ; Data management ; Data processing ; Documents ; Exploration ; Fault tolerance ; Information Storage and Retrieval ; Library Science ; Reviews ; Scientific visualization ; Visualization</subject><ispartof>Scientometrics, 2016-10, Vol.109 (1), p.389-422</ispartof><rights>Akadémiai Kiadó, Budapest, Hungary 2016</rights><rights>Copyright Springer Science & Business Media 2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</citedby><cites>FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924,34134</link.rule.ids></links><search><creatorcontrib>Hashem, Ibrahim Abaker Targio</creatorcontrib><creatorcontrib>Anuar, Nor Badrul</creatorcontrib><creatorcontrib>Gani, Abdullah</creatorcontrib><creatorcontrib>Yaqoob, Ibrar</creatorcontrib><creatorcontrib>Xia, Feng</creatorcontrib><creatorcontrib>Khan, Samee Ullah</creatorcontrib><title>MapReduce: Review and open challenges</title><title>Scientometrics</title><addtitle>Scientometrics</addtitle><description>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</description><subject>Authoring</subject><subject>Bibliometrics</subject><subject>Big Data</subject><subject>Computation</subject><subject>Computer applications</subject><subject>Computer Science</subject><subject>Data management</subject><subject>Data processing</subject><subject>Documents</subject><subject>Exploration</subject><subject>Fault tolerance</subject><subject>Information Storage and Retrieval</subject><subject>Library Science</subject><subject>Reviews</subject><subject>Scientific visualization</subject><subject>Visualization</subject><issn>0138-9130</issn><issn>1588-2861</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp1kEFLAzEQhYMoWKs_wNuCeIzOJN3NrDcpVoWKUPQckmyiLXV3TVql_96U9eDF01ze9x7zMXaOcIUA6johYi04YMWxnpR8d8BGWBJxQRUeshGgJF6jhGN2ktIKMiOBRuzyyfQL32ydvykW_mvpvwvTNkXX-7Zw72a99u2bT6fsKJh18me_d8xeZ3cv0wc-f75_nN7OuZNYbbibADpFrnKBQDhRhUBBBolW2ZqkCmWjwMnaBEHKWFnZpgzSGqcaYZW0cswuht4-dp9bnzZ61W1jmyc1EgGpUhDmFA4pF7uUog-6j8sPE3caQe9t6MGGzjb03obeZUYMTMrZ_FL80_wv9AMKAWFk</recordid><startdate>20161001</startdate><enddate>20161001</enddate><creator>Hashem, Ibrahim Abaker Targio</creator><creator>Anuar, Nor Badrul</creator><creator>Gani, Abdullah</creator><creator>Yaqoob, Ibrar</creator><creator>Xia, Feng</creator><creator>Khan, Samee Ullah</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>20161001</creationdate><title>MapReduce: Review and open challenges</title><author>Hashem, Ibrahim Abaker Targio ; Anuar, Nor Badrul ; Gani, Abdullah ; Yaqoob, Ibrar ; Xia, Feng ; Khan, Samee Ullah</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Authoring</topic><topic>Bibliometrics</topic><topic>Big Data</topic><topic>Computation</topic><topic>Computer applications</topic><topic>Computer Science</topic><topic>Data management</topic><topic>Data processing</topic><topic>Documents</topic><topic>Exploration</topic><topic>Fault tolerance</topic><topic>Information Storage and Retrieval</topic><topic>Library Science</topic><topic>Reviews</topic><topic>Scientific visualization</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hashem, Ibrahim Abaker Targio</creatorcontrib><creatorcontrib>Anuar, Nor Badrul</creatorcontrib><creatorcontrib>Gani, Abdullah</creatorcontrib><creatorcontrib>Yaqoob, Ibrar</creatorcontrib><creatorcontrib>Xia, Feng</creatorcontrib><creatorcontrib>Khan, Samee Ullah</creatorcontrib><collection>CrossRef</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><jtitle>Scientometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hashem, Ibrahim Abaker Targio</au><au>Anuar, Nor Badrul</au><au>Gani, Abdullah</au><au>Yaqoob, Ibrar</au><au>Xia, Feng</au><au>Khan, Samee Ullah</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MapReduce: Review and open challenges</atitle><jtitle>Scientometrics</jtitle><stitle>Scientometrics</stitle><date>2016-10-01</date><risdate>2016</risdate><volume>109</volume><issue>1</issue><spage>389</spage><epage>422</epage><pages>389-422</pages><issn>0138-9130</issn><eissn>1588-2861</eissn><abstract>The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s11192-016-1945-y</doi><tpages>34</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0138-9130 |
ispartof | Scientometrics, 2016-10, Vol.109 (1), p.389-422 |
issn | 0138-9130 1588-2861 |
language | eng |
recordid | cdi_proquest_journals_1880875281 |
source | Library & Information Science Abstracts (LISA); Springer Nature |
subjects | Authoring Bibliometrics Big Data Computation Computer applications Computer Science Data management Data processing Documents Exploration Fault tolerance Information Storage and Retrieval Library Science Reviews Scientific visualization Visualization |
title | MapReduce: Review and open challenges |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T22%3A56%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MapReduce:%20Review%20and%20open%20challenges&rft.jtitle=Scientometrics&rft.au=Hashem,%20Ibrahim%20Abaker%20Targio&rft.date=2016-10-01&rft.volume=109&rft.issue=1&rft.spage=389&rft.epage=422&rft.pages=389-422&rft.issn=0138-9130&rft.eissn=1588-2861&rft_id=info:doi/10.1007/s11192-016-1945-y&rft_dat=%3Cproquest_cross%3E1880875281%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-c401c78c6cf802c26ff8f3f31b7b9837f5d70c39af287ab36bd5f3bac7d2b73b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1880875281&rft_id=info:pmid/&rfr_iscdi=true |