Webopedia on Google+Webopedia on TwitterWebopedia on FacebookTech Bytes Blog

Search Results: "hadoop"

Apache Hadoop

Formally called Apache Hadoop, Hadoop is an Apache Software Foundation project and open source software platform for scalable, distributed computing.

Hadoop MapReduce

Hadoop MapReduce (Hadoop Map/Reduce) is a software framework for distributed processing of large data sets on compute clusters of commodity hardware. It is a sub-project of the Apache Hadoop project.

Top Hadoop Myths to Know

There's a lot of confusion about Hadoop and where it fits into the overall Big Data landscape.


Hadoop Distributed File System - HDFS

The Hadoop Distributed File System (HDFS) is a sub-project of the Apache Hadoop project. This Apache Software Foundation project is designed to provide a fault-tolerant file system designed to run on commodity hardware.

IBM PureData System for Hadoop

IBM PureData System for Hadoop is part of the IBM PureSystems family of solutions, designed to help organizations to embrace big data, cloud computing and mobile computing.

GridGain Big Data

An open source Java-based tool for real-time big data processing. GridGrain is an alternative to Hadoop's MapReduce that is compatible with the Hadoop Distributed File System. 


An enterprise software firm that specializes in open source Apache Hadoop development and support.

Apache Hive

Apache Hive is a data warehouse system for the open source Apache Hadoop project.

Apache HBase

Apache HBase (HBase) is the Hadoop database. It is a distributed, scalable, big data store. HBase is a sub-project of the Apache Hadoop project and is used to provide real-time read and write access to your big data.

Apache Pig

Apache Pig is a high-level procedural language platform developed to simplify querying large data sets in Apache Hadoop and MapReduce.