Log in

Relevant bibliographies by topics / MapReduce, MRPERF , Capacity Scheduler

Contents

Journal articles
Dissertations / Theses
Book chapters
Conference papers

Academic literature on the topic 'MapReduce, MRPERF , Capacity Scheduler'

Author: Grafiati

Published: 4 June 2021

Last updated: 10 February 2022

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'MapReduce, MRPERF , Capacity Scheduler.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "MapReduce, MRPERF , Capacity Scheduler"

1

Sangani, Sahil. "Scheduling Algorithms in Map Reduce." International Journal on Recent and Innovation Trends in Computing and Communication 7, no. 8 (2019): 01–06. http://dx.doi.org/10.17762/ijritcc.v7i8.5342.

Full text

Abstract:

Data generated in the past few years cannot be efficiently manipulated with the traditional way of storing techniques as it is a large-scale dataset, and it can be structured, semi-structured, or unstructured. To deal with this kind of enormous dataset Hadoop framework is used, which supports the processing of large dataset in a distributed computing environment. Hadoop uses a technique named as MapReduce for processing and generating a large dataset with a parallel distributed algorithm on a cluster. It automatically handles failures and data loss due to its fault-tolerance property. The scheduler is a pluggable component of the MapReduce framework. Hadoop MapReduce framework uses various scheduler as per the requirements of the task. FIFO (First In First Out) is a default algorithm used by Hadoop, in which the jobs are executed in the order of their arrival. This paper will discuss myriad of schedulers such as FIFO, Capacity Scheduler, LATE Scheduler, Fair Scheduler, Delay Scheduler, Deadline Constraint Scheduler, and Resource Aware Scheduler. Besides these schedulers, we also conducted study of comparison of schedulers like Round Robin, Weighted Round Robin, Self-adaptive Reduce Scheduling (SARS), Self-adaptive MapReduce Scheduling (SAMR), Dynamic Priority Scheduling, Learning Scheduling, Classification & Optimization-based Scheduler (COSHH), Network-Aware, Match-matching, and Energy-Aware Scheduler. Hopefully, this study will enhance the understanding of the specific schedulers and stimulate other developers and consumers to make accurate decisions for their specific research interests.

APA, Harvard, Vancouver, ISO, and other styles

2

Gautam, Jyoti V., Harshadkumar B. Prajapati, Vipul K. Dabhi, and Sanjay Chaudhary. "Empirical Study of Job Scheduling Algorithms in Hadoop MapReduce." Cybernetics and Information Technologies 17, no. 1 (2017): 146–63. http://dx.doi.org/10.1515/cait-2017-0012.

Full text

Abstract:

Abstract Several Job scheduling algorithms have been developed for Hadoop-Map Reduce model, which vary widely in design and behavior for handling different issues such as locality of data, user share fairness, and resource awareness. This article focuses on empirically evaluating the performance of three schedulers: First In First Out (FIFO), Fair scheduler, and Capacity scheduler. To carry out the experimental evaluation, we implement our own Hadoop cluster testbed, consisting of four machines, in which one of the machines works as the master node and all four machines work as slave nodes. The experiments include variation in data sizes, use of two different data processing applications, and variation in the number of nodes used in processing. The article analyzes the performance of the job scheduling algorithms based on various relevant performance measures. The results of the experiments are evident of the performance being affected by the job scheduling parameters, the type of applications, the number of nodes in the cluster, and size of the input data.

APA, Harvard, Vancouver, ISO, and other styles

3

Wang, Ling Yan, and Ai Min Liu. "The Study on Cloud Computing Resource Allocation Method." Applied Mechanics and Materials 198-199 (September 2012): 1506–13. http://dx.doi.org/10.4028/www.scientific.net/amm.198-199.1506.

Full text

Abstract:

Resource allocation and scheduling problems in the field of cloud computing can be classified into two major groups. The first one is in the area of MapReduce task scheduling. The default scheduler is the FIFO one. Two other schedulers that are available as plug-in for Hadoop: Fair scheduler and Capacity scheduler. We presented recent research in this area to enhance performance or to better suit a specific application. MapReduce scheduling research involves introducing alternative schedulers, or proposing enhancements for existing schedulers such as streaming and input format specification. The second problem is the provisioning of virtual machines and processes to the physical machines and its different resources. We presented the major cloud hypervisors available today. We described the different methods used to solve the resource allocation problem including optimization, simulation, distributed multi-agent systems and SoA. Finally, we presented the related topic of connecting clouds which uses similar resource provisioning methods. The above two scheduling problems are often mixed up, yet they are related. For example, MapReduce benchmarks can be used to evaluate VM provisioning methods. Enhancing the solution to one problem can affect the other. Similar methods can be used in solving both problems, such as optimization methods. Cloud computing is a platform that hosts applications and services for businesses and users to accesses computing as a service. In this paper, we identify two scheduling and resource allocation problems in cloud computing.

APA, Harvard, Vancouver, ISO, and other styles

4

Hamad, Faten. "An Overview of Hadoop Scheduler Algorithms." Modern Applied Science 12, no. 8 (2018): 69. http://dx.doi.org/10.5539/mas.v12n8p69.

Full text

Abstract:

Hadoop is a cloud computing open source system, used in large-scale data processing. It became the basic computing platforms for many internet companies. With Hadoop platform users can develop the cloud computing application and then submit the task to the platform. Hadoop has a strong fault tolerance, and can easily increase the number of cluster nodes, using linear expansion of the cluster size, so that clusters can process larger datasets. However Hadoop has some shortcomings, especially in the actual use of the process of exposure to the MapReduce scheduler, which calls for more researches on Hadoop scheduling algorithms.This survey provides an overview of the default Hadoop scheduler algorithms and the problem they have. It also compare between five Hadoop framework scheduling algorithms in term of the default scheduler algorithm to be enhanced, the proposed scheduler algorithm, type of cluster applied either heterogeneous or homogeneous, methodology, and clusters classification based on performance evaluation. Finally, a new algorithm based on capacity scheduling and use of perspective resource utilization to enhance Hadoop scheduling is proposed.

APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "MapReduce, MRPERF , Capacity Scheduler"

1

"Simulation and Performance Evaluation of Hadoop Capacity Scheduler." Thesis, 2013. http://hdl.handle.net/10388/ETD-2013-06-1172.

Full text

Abstract:

MapReduce is a parallel programming paradigm used for processing huge datasets on certain classes of distributable problems using a cluster. Budgetary constraints and the need for better usage of resources in a MapReduce cluster often make organizations rent or share hardware resources for their main data processing and analysis tasks. Thus, there may be many competing jobs from different clients performing simultaneous requests to the MapReduce framework on a particular cluster. Schedulers like Fair Share and Capacity have been specially designed for such purposes. Administrators and users run into performance problems, however, because they do not know the exact meaning of different task scheduler settings and what impact they can have with respect to the resource allocation scheme across organizations for a shared MapReduce cluster. In this work, Capacity Scheduler is integrated into an existing MRPERF simulator to predict the performance of MapReduce jobs in a shared cluster under different settings for Capacity Scheduler. A few case studies on the behaviour of Capacity Scheduler across different job patterns etc. using integrated simulator are also conducted.

APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "MapReduce, MRPERF , Capacity Scheduler"

1

Sree Lakshmi, Adepu, N. Subhash Chandra, and M. BalRaju. "Optimized Capacity Scheduler for MapReduce Applications in Cloud Environments." In Data Management, Analytics and Innovation. Springer Singapore, 2018. http://dx.doi.org/10.1007/978-981-13-1402-5_12.

Full text

APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "MapReduce, MRPERF , Capacity Scheduler"

1

Chauhan, Jagmohan, Dwight Makaroff, and Winfried Grassmann. "The Impact of Capacity Scheduler Configuration Settings on MapReduce Jobs." In 2012 International Conference on Cloud and Green Computing (CGC). IEEE, 2012. http://dx.doi.org/10.1109/cgc.2012.96.

Full text

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!