Home / Definitions / Apache Pig

Apache Pig

Forrest Stroud
Last Updated May 24, 2021 7:35 am

Apache Pig is a high-level procedural language platform developed to simplify querying large data sets in Apache Hadoop and MapReduce. Apache Pig features a Pig Latin language layer that enables SQL-like queries to be performed on distributed datasets within Hadoop applications.

Pig originated as a Yahoo Research initiative for creating and executing map-reduce jobs on very large data sets. In 2007 Pig became an open source project of the Apache Software Foundation.