Snakemake

Snakemake was created by and for the bioinformatics community. However, it has a general purpose design and can be used for for any data analysis pipeline. Most other workflow engines are control-flow oriented. Snakemake instead builds a file dependence graph and uses file modifications to drive the execution, like the original “make” tool.

Logo

../_images/snakemake_snakemake-small.png

Website

https://snakemake.github.io/

Repository

https://github.com/snakemake/snakemake

Byline

The Snakemake workflow management system is a tool to create reproducible and scalable data analyses

License

MIT

Project age

11 years 1 months

Backers

Kösterlab at the University of Duisburg-Essen (Creator and maintainer)

Size score (1 to 10, higher is better)

7.5

Trend score (1 to 10, higher is better)

6.5

Education Resources

URL

Resource Type

Description

https://snakemake.readthedocs.io/en/stable/

Documentation

Official project documentation.

https://github.com/snakemake/snakemake-wrappers

Example Code

The Snakemake Wrapper Repository is a collection of reusable wrappers that allow to quickly use popular command line tools from Snakemake rules and workflows.

https://youtu.be/_dG9b3a9zkk

Video

Reproducible Data Analysis with Snakemake - presentation from the project creator, June 2020.

Git Commit Statistics

Statistics computed using Git data through November 30, 2022.

Statistic

Lifetime

Last 12 Months

Commits

39,692

4,197

Lines committed

21,591,210

1,919,734

Unique committers

293

69

Core committers

6

2

../_images/snakemake_snakemake-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

Airflow

8.25

8.5

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Cromwell

5.25

4.25

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments

Luigi

5.0

3.75

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.