Beam

Apache Beam originated from Google’s Dataflow Model in 2014. In 2016, Google donated Dataflow Model. Later with other community members’ contribution and improvement, it became Apache Beam.

Logo

../_images/apache_beam-small.png

Website

https://beam.apache.org/

Repository

https://github.com/apache/beam

Byline

Apache Beam is a unified model for defining both batch and streaming data-parallel processing pipelines, as well as a set of language-specific SDKs for constructing pipelines and Runners for executing them on distributed processing backends, including Apache Flink, Apache Spark, Google Cloud Dataflow and Hazelcast Jet.

License

Apache 2.0

Project age

6 years 11 months

Backers

Apache (Governed by), Google (Creator)

Size score (1 to 10, higher is better)

9.0

Trend score (1 to 10, higher is better)

7.25

Education Resources

URL

Resource Type

Description

https://beam.apache.org/documentation/

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through May 31, 2021.

Statistic

Lifetime

Last 12 Months

Commits

44,152

16,858

Lines committed

28,557,812

9,923,968

Unique committers

1,142

343

Core committers

14

14

../_images/apache_beam-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Flink

9.25

7.5

Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities.

NiFi

6.75

7.5

Apache NiFi supports highly configurable directed graphs of data routing, transformation, and system mediation logic.

Storm

6.25

2.5

Storm is a distributed realtime computation system.