Data Hut™ — Open Source Project Directory
We provide insights on the most popular data science and data engineering projects. By combining machine learning techniques with expert knowledge, we help you to understand the open source landscape and to pick the best software for your needs.
Data Hut News (April 08, 2021): Welcome to Data Hut! We would love to have your feedback on how we can improve the site. Please use this form or contact us directly (email: [email protected]).
We are expanding our project coverage. For site and project updates, follow us on Twitter: @datahutai
Projects by Category¶
Category |
Description |
Projects |
---|---|---|
Tools for transforming and analyzing the largest data sets. |
16 |
|
Tools for statistical analysis and machine learning. |
51 |
|
Data repositories |
36 |
|
Processing data as networks of interconnected nodes. |
23 |
To jump to a project directly or find by keyword, use the search page or the search box above.
Popular Communities and Project Backers¶
Community |
Website |
Description |
---|---|---|
Apache is the world’s largest open source foundation with over 300 top-level projects. |
||
As the world’s largest social network, FaceBook has created and sponsored a wide range of open source projects. |
||
As a multinational technology company, Google has created and sponsored over 2,000 open source projects in a wide range of areas, from programming languages to UI frameworks to machine learning. |
||
NumFocus is a 501(3)c public charity founded in 2012 to provide a fiscal umbrella for many open source software projects that have become essential for science and research. NumFocus sponsored projects benefit from a range of services including fiscal, legal, and operational. |
Latest News¶
Date |
Topic |
Description |
---|---|---|
2021-03-04 |
We are excited to announce the availability of PyTorch 1.8. This release is composed of more than 3,000 commits … more |
|
2021-02-24 |
AllenNLP 2.1 has been released. more |
|
2021-02-12 |
Ray release 1.2 is out. New features and bug fixes in AutoScaler, RLLib, Tune, SDG, and Serve. Ray Client and C++ … more |
|
2021-02-01 |
spaCy 3.0 release spaCy v3.0 features all new transformer-based pipelines that bring spaCy’s accuracy right up to … more |
|
2020-12-22 |
We are pleased to announce the release of scikit-learn 0.24! Many bug fixes and improvements were added, as well as … more |
|
2020-12-14 |
TF 2.4 is here! With increased support for distributed training and mixed precision, new NumPy frontend and tools … more |