Data Hut™ — Open Source Project Directory
Curated insights on the most popular data science and data engineering projects. By combining machine learning techniques with expert knowledge, we help you to understand the open source landscape and to pick the best software for your needs.
Data Hut News (June 13, 2022): Happy Summer! The site has been updated with the latest data for June. Enjoy!
For site and project updates, follow us on Twitter: @datahutai
Projects by Category¶
Category |
Description |
Projects |
---|---|---|
Tools for transforming and analyzing the largest data sets. |
23 |
|
Tools for statistical analysis and machine learning. |
76 |
|
Data repositories |
39 |
|
Processing data as networks of interconnected nodes. |
25 |
To jump to a project directly or find by keyword, use the search page or the search box above.
Popular Communities and Project Backers¶
Community |
Website |
Description |
---|---|---|
Apache is the world’s largest open source foundation with over 300 top-level projects. |
||
As the world’s largest social network, FaceBook has created and sponsored a wide range of open source projects. |
||
As a multinational technology company, Google has created and sponsored over 2,000 open source projects in a wide range of areas, from programming languages to UI frameworks to machine learning. |
||
NumFocus is a 501(3)c public charity founded in 2012 to provide a fiscal umbrella for many open source software projects that have become essential for science and research. NumFocus sponsored projects benefit from a range of services including fiscal, legal, and operational. |
Latest News¶
Date |
Topic |
Description |
---|---|---|
2022-06-09 |
Ray-1.13.0 Highlights: Python 3.10 support is now in alpha; Ray usage stats collection is now on by default (guarded … more |
|
2022-06-08 |
Modin 0.15.0 This release includes updated support for pandas 1.4.2, new Batch and Logging APIs, and a plethora of … more |
|
2022-06-08 |
AgensGraph v2.12 We are pleased to announce the release of AgensGraph v2.12. AgensGraph is a multi-model graph … more |
|
2022-06-07 |
OpenCV 4.6.0 OpenCV 4.6.0 Is Now Available! Release highlights: OpenCV project infrastructure is migrating to the … more |
|
2022-06-03 |
PyMC 4.0.0 We, the PyMC core development team, are incredibly excited to announce the release of a major rewrite of … more |
|
2022-06-01 |
v1.2.0 What’s changed: Add intermediate API’s to all models; Fix forecasting bugs when return_prev=True. more |
|
2022-05-24 |
2.7.0 (2022-05-24) This release adds major new features since the 2.6.1 release. We deem it moderate priority for … more |
|
2022-05-18 |
v0.7.0 This release introduces two new methods, a GradientSimilarity explainer and a ProtoSelect data summarisation … more |
|
2022-05-13 |
TensorFlow 2.9.0 TensorFlow 2.9 has been released! Highlights include performance improvements with oneDNN, and the … more |
|
2022-05-13 |
Keras Release 2.9.0 See the release notes for TensorFlow 2.9. more |
|
2022-05-13 |
Release 3.32.0 more |
|
2022-05-12 |
scikit-learn 1.1.0 We are pleased to announce the release of scikit-learn 1.1! Many bug fixes and improvements were … more |