Click here

Hadoop

Formally called Apache Hadoop, it is an Apache Software Foundation project and is an open source software platform for scalable, distributed computing.  Hadoop can provide fast and reliable analysis of both structured data and unstructured data . Given its capabilities to handle large data sets, it is often  associated with the  phrase Big Data.

The Apache Hadoop software library is essentially a framework that allows for the distributed processing of large datasets across clusters of computers using a simple programming model.  Hadoop can scale up from single servers to thousands of machines, each offering local computation and storage.

The Apache Hadoop software library can detect and handle failures at the application layer, so it can deliver a highly-available service on top of a cluster of computers, each of which may be prone to failures.

As listed on the The Apache Hadoop project Web site, the Apache Hadoop project includes a number of subprojects such as the following:



Top Terms
  • 1

    Consumerization of IT

    Consumerization of IT is a phrase used to describe the cycle of information technology (IT) emerging in the consumer market then spreading to...

    Read more »

  • 2

    ERP - Enterprise Resource Planning

    Short for enterprise resource planning, ERP is business management software that allows an organization to use a system of integrated applications...

    Read more »

  • Click Here!

Connect with Webopedia

Did You Know? Archive »

  • Quick Reference Archive »