Data Hut™ — Open Source Project Directory

Curated insights on the most popular data science and data engineering projects. By combining machine learning techniques with expert knowledge, we help you to understand the open source landscape and to pick the best software for your needs.

Data Hut News (July 18, 2022): The July update is here! In addition to the latest project statistics, we have a new project, Shap, a key library for explainability in machine learning.

For site and project updates, follow us on Twitter:

Projects by Category




Big Data

Tools for transforming and analyzing the largest data sets.


Data Science

Tools for statistical analysis and machine learning.


Data Stores

Data repositories


Graph Analytics

Processing data as networks of interconnected nodes.


To jump to a project directly or find by keyword, use the search page or the search box above.