Analytics Zoo

Integration layer between ML frameworks and big data infrastructure

Logo

../_images/intel-analytics_analytics-zoo-small.png

Website

https://analytics-zoo.github.io/master/

Repository

https://github.com/intel-analytics/analytics-zoo

Byline

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

License

Apache 2.0

Project age

5 years 7 months

Backers

Intel (Creator)

Lastest News (2021-05-25)

Analytics Zoo 0.10.0 Release Improved document website, including quickstarts for Orca, RayOnSpark, Zouwu and BigDL; Orca library: … more

Size score (1 to 10, higher is better)

4.25

Trend score (1 to 10, higher is better)

1.75

Education Resources

URL

Resource Type

Description

https://analytics-zoo.readthedocs.io/en/v0.10.0/doc/UserGuide/develop.html

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through November 30, 2022.

Statistic

Lifetime

Last 12 Months

Commits

48,634

202

Lines committed

13,131,027

9,271

Unique committers

122

11

Core committers

24

8

../_images/intel-analytics_analytics-zoo-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Dask

6.75

4.75

Parallel computing with task scheduling.

Horovod

6.0

4.75

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Mars

6.75

4.5

Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn.

Pig

4.0

5.0

Apache Pig is a platform to create programs on top of Apache Hadoop.

RayDP

2.0

5.75

Distributed data processing library on Ray by running popular big data frameworks like Apache Spark on Ray. RayDP seamlessly integrates with other Ray libraries to make it simple to build E2E data analytics and AI pipeline.