Modin

Distributed version of Pandas API which uses Ray or Dask as its backend

Logo

../_images/modin-project_modin-small.png

Website

https://modin.readthedocs.io/en/latest/

Repository

https://github.com/modin-project/modin

Byline

Speed up your Pandas workflows by changing a single line of code

License

Apache 2.0

Project age

2 years 11 months

Backers

UC Berkeley RISE Lab (Creator)

Lastest News (2021-03-04)

Modin release 0.9.0 New functionality: spreadsheet interface, XGBoost support improvement, and read multiple CSV files at once more

Size score (1 to 10, higher is better)

4.5

Trend score (1 to 10, higher is better)

6.25

Education Resources

URL

Resource Type

Description

https://modin.readthedocs.io/en/latest/index.html

Documentation

Official project documentation.

Git Commit Statistics

Statistics computed using Git data through May 31, 2021.

Statistic

Lifetime

Last 12 Months

Commits

25,405

13,335

Lines committed

4,567,899

2,250,015

Unique committers

83

46

Core committers

14

11

../_images/modin-project_modin-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Analytics Zoo

5.0

8.25

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

Dask

6.0

6.25

Parallel computing with task scheduling.

Flashlight

5.5

7.5

A C++ standalone library for machine learning

HPCC

5.5

7.0

HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.

Mars

6.75

6.25

Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn.