Pig

Apache originagted from Yahoo Research in 2006. It was moved to Apache in 2007.

Logo

../_images/apache_pig-small.png

Website

http://pig.apache.org/

Repository

https://github.com/apache/pig

Byline

Apache Pig is a platform to create programs on top of Apache Hadoop.

License

Apache 2.0

Project age

15 years 1 months

Backers

Apache (Governed by), Yahoo Research (Creator)

Size score (1 to 10, higher is better)

4.0

Trend score (1 to 10, higher is better)

5.0

Education Resources

URL

Resource Type

Description

https://pig.apache.org/docs/latest/

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through November 30, 2022.

Statistic

Lifetime

Last 12 Months

Commits

6,714

32

Lines committed

3,436,188

2,289

Unique committers

43

2

Core committers

14

2

../_images/apache_pig-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Dask

6.75

4.75

Parallel computing with task scheduling.

HPCC

6.0

6.25

HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.

Mars

6.75

4.5

Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn.

Modin

5.0

7.5

Speed up your Pandas workflows by changing a single line of code

RayDP

2.0

5.75

Distributed data processing library on Ray by running popular big data frameworks like Apache Spark on Ray. RayDP seamlessly integrates with other Ray libraries to make it simple to build E2E data analytics and AI pipeline.