Dask¶
Dask is a Python-focused distributed programming environment that integrates with NumPy, Pandas, and Scikit-Learn. It is lighter weight than Hadoop ecosystem tools like Spark.
Logo |
|
---|---|
Website |
|
Repository |
|
Byline |
Parallel computing with task scheduling. |
License |
BSD 3-clause |
Project age |
7 years 11 months |
Backers |
Anaconda Inc. (Commercial support), Chan Zuckerberg Initiative (Grant), NumFocus (Grant), Quansight (Commercial support) |
Size score (1 to 10, higher is better) |
6.75 |
Trend score (1 to 10, higher is better) |
4.75 |
Education Resources¶
URL |
Resource Type |
Description |
---|---|---|
Documentation |
Official project documentation. |
Git Commit Statistics¶
Statistics computed using Git data through November 30, 2022.
Statistic |
Lifetime |
Last 12 Months |
---|---|---|
Commits |
60,524 |
6,910 |
Lines committed |
5,964,409 |
792,858 |
Unique committers |
596 |
123 |
Core committers |
20 |
14 |
Similar Projects¶
Project |
Size Score |
Trend Score |
Byline |
---|---|---|---|
6.0 |
6.25 |
HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics. |
|
6.75 |
4.5 |
Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn. |
|
5.0 |
7.5 |
Speed up your Pandas workflows by changing a single line of code |
|
4.0 |
5.0 |
Apache Pig is a platform to create programs on top of Apache Hadoop. |
|
9.25 |
4.5 |
A unified analytics engine for large-scale data processing. |