Data Workspaces

Data Workspaces is an open source framework for maintaining the state of a data science project, including data sets, intermediate data, results, and code. It supports reproducability through snapshotting and lineage models and collaboration through a push/pull model inspired by source control systems like Git.

Logo

../_images/data-workspaces_data-workspaces-core-small.png

Website

https://dataworkspaces.ai

Repository

https://github.com/data-workspaces/data-workspaces-core

Byline

Easy management of source data, intermediate data, and results for data science projects.

License

Apache 2.0

Project age

2 years 9 months

Backers

Benedat LLC (Creator and maintainer), Max Planck Institute for Software Systems (Creator and maintainer)

Size score (1 to 10, higher is better)

1.75

Trend score (1 to 10, higher is better)

3.25

Education Resources

URL

Resource Type

Description

https://data-workspaces-core.readthedocs.io/en/latest/

Documentation

Official project documentation.

https://www.dataworkspaces.ai/quick-start/

Documentation

This is a useful guide to help users to kick-start their projects.

https://youtu.be/VjU5gGSvGsY

Video

This is the first part of a demo video.

https://youtu.be/TIPEH6jlqtA

Video

This is the second part of a demo video.

Git Commit Statistics

Statistics computed using Git data through May 31, 2021.

Statistic

Lifetime

Last 12 Months

Commits

409

14

Lines committed

63,259

180

Unique committers

6

3

Core committers

2

1

../_images/data-workspaces_data-workspaces-core-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Flambe

1.25

1.25

Flambé is a machine learning experimentation framework built to accelerate the entire research life cycle. Flambé’s main objective is to provide a unified interface for prototyping models, running experiments containing complex pipelines, monitoring those experiments in real-time, reporting results, and deploying a final model for inference.

MLflow

7.0

4.25

An open source platform for the machine learning lifecycle

PyCaret

7.25

6.75

An open-source, low-code machine learning library in Python.