Click here

Apache Pig

Apache Pig is a high-level procedural language platform developed to simplify querying large data sets in Apache Hadoop and MapReduce. Apache Pig features a “Pig Latin” language layer that enables SQL-like queries to be performed on distributed datasets within Hadoop applications.

Pig originated as a Yahoo Research initiative for creating and executing map-reduce jobs on very large data sets.  In 2007 Pig became an open source project of the Apache Software Foundation.



Top Terms

Connect with Webopedia

  • What is 250 GB Data Usage?

    What is 250 GB (250 gigabytes) and why is this phrase so popular? Webopedia explains what the phrase 250 GB means in reference to data storage...

    Read More »

Did You Know? Archive »

  • Quick Reference Archive »