↓ Skip to main content

MRPack: Multi-Algorithm Execution Using Compute-Intensive Approach in MapReduce

Overview of attention for article published in PLOS ONE, August 2015
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
2 X users

Citations

dimensions_citation
4 Dimensions

Readers on

mendeley
16 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
MRPack: Multi-Algorithm Execution Using Compute-Intensive Approach in MapReduce
Published in
PLOS ONE, August 2015
DOI 10.1371/journal.pone.0136259
Pubmed ID
Authors

Muhammad Idris, Shujaat Hussain, Muhammad Hameed Siddiqi, Waseem Hassan, Hafiz Syed Muhammad Bilal, Sungyoung Lee

Abstract

Large quantities of data have been generated from multiple sources at exponential rates in the last few years. These data are generated at high velocity as real time and streaming data in variety of formats. These characteristics give rise to challenges in its modeling, computation, and processing. Hadoop MapReduce (MR) is a well known data-intensive distributed processing framework using the distributed file system (DFS) for Big Data. Current implementations of MR only support execution of a single algorithm in the entire Hadoop cluster. In this paper, we propose MapReducePack (MRPack), a variation of MR that supports execution of a set of related algorithms in a single MR job. We exploit the computational capability of a cluster by increasing the compute-intensiveness of MapReduce while maintaining its data-intensive approach. It uses the available computing resources by dynamically managing the task assignment and intermediate data. Intermediate data from multiple algorithms are managed using multi-key and skew mitigation strategies. The performance study of the proposed system shows that it is time, I/O, and memory efficient compared to the default MapReduce. The proposed approach reduces the execution time by 200% with an approximate 50% decrease in I/O cost. Complexity and qualitative results analysis shows significant performance improvement.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 16 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 16 100%

Demographic breakdown

Readers by professional status Count As %
Student > Master 4 25%
Researcher 3 19%
Student > Ph. D. Student 3 19%
Student > Doctoral Student 2 13%
Librarian 1 6%
Other 1 6%
Unknown 2 13%
Readers by discipline Count As %
Computer Science 5 31%
Engineering 3 19%
Earth and Planetary Sciences 1 6%
Agricultural and Biological Sciences 1 6%
Decision Sciences 1 6%
Other 1 6%
Unknown 4 25%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 05 September 2015.
All research outputs
#14,822,669
of 22,824,164 outputs
Outputs from PLOS ONE
#123,949
of 194,764 outputs
Outputs of similar age
#148,042
of 267,539 outputs
Outputs of similar age from PLOS ONE
#3,527
of 6,027 outputs
Altmetric has tracked 22,824,164 research outputs across all sources so far. This one is in the 32nd percentile – i.e., 32% of other outputs scored the same or lower than it.
So far Altmetric has tracked 194,764 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 15.1. This one is in the 32nd percentile – i.e., 32% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 267,539 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 41st percentile – i.e., 41% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 6,027 others from the same source and published within six weeks on either side of this one. This one is in the 36th percentile – i.e., 36% of its contemporaries scored the same or lower than it.