Hadoop Mapreduce

Hadoop MapReduce (Hadoop Map/Reduce) is a software framework for distributed processing of large data sets on compute clusters of commodity hardware. It is a sub-project of the Apache Hadoop project. The framework takes care of scheduling tasks, monitoring them and re-executing any failed tasks.

According to The Apache Software Foundation, the primary objective of Map/Reduce is to split the input data set into independent chunks that are processed in a completely parallel manner. The Hadoop MapReduce framework sorts the outputs of the maps, which are then input to the reduce tasks. Typically, both the input and the output of the job are stored in a file system.

Also see Apache Hadoop and Hadoop Distributed File System (HDFS).

Top 5 Hadoop Related Questions

1. What is Apache Hadoop?
2. What is Hadoop MapReduce?
3. What is HortonWorks?
4. What is Hadoop Distributed File System?
5. What is unstructured data?

Vangie Beal
Vangie Beal
Vangie Beal is a freelance business and technology writer covering Internet technologies and online business since the late '90s.

Related Articles

PPT

One of Microsoft Office’s core products, PowerPoint – abbreviated to PPT based on its file extension “.ppt” – is a software program used to...

ETL

ETL is the acronym for "extract, transform, and load." These three database functions are combined into one tool to pull raw data from one...

Remote Work

Remote work is the habit of someone performing their job from home or another location that isn't owned or managed by the company for...

Direct Digital Marketing

Direct digital marketing is a method of marketing handled primarily through direct digital channels like email and Web. It makes use of addressable mediums....

HighLevel CRM

HighLevel is a sales and marketing customer relationship management (CRM) solution designed by...

Unified Endpoint Management (UEM)

As enterprise networks become increasingly distributed with growing numbers of remote workers, unified...

Decision Intelligence

Decision intelligence combines business intelligence (BI) and artificial intelligence (AI) models to improve...