Snakemake is a beautiful project and evolves and improves so fast. Years ago I r...

bsmith89 · on July 15, 2023

I too owe a lot of my PhD and postdoc productivity to Snakemake. It's my bioinformatics super-power, allowing me to run a complex analysis, including downloading containers (Singularity/Apptainer) and other dependencies (conda), with one command.

Great for reproducibility. Great for development. Great for scaling analyses.

Snakemake is vital infrastructure for my work.

tetris11 · on July 15, 2023

Its fantastic but it doesn't scale laterelly particularly well, compared to just Make.

ta988 · on July 15, 2023

What dimension are you referring to?

tetris11 · on July 15, 2023

Large scale reproducibility was a problem a few years back for one. Conda and containers were a constant problem for us back then, especially if you had multiple NGS tools running in different environments. This has probably been solved by now, but we went with another workflow system

ta988 · on July 16, 2023

Agreed, Conda has always been a nightmare to maintain and redeploy, whatever you put it in.

bafe · on July 15, 2023

Nextflow seems to scale very well

tetris11 · on July 16, 2023

We went with Nextflow and Galaxy

bafe · on July 17, 2023

You can't go wrong with nextflow. I heard a lot of scientists complaining that it's too hard to understand, but honestly the DSL and the scheduling model (flow based) is just great

matthew_stone · on July 15, 2023

100% agree, and it's wonderful to see Snakemake on the top of HN.

Snakemake is an invaluable tool in bioinformatics analysis. It's a testament to Johannes' talent and dedication that, even with the relatively limited resources of an academic developer, Snakemake has remained broadly useful and popular.

Super nice guy too, he's always been remarkably responsive and helpful. I saw him present on Snakemake back when he was a postdoc, and it really changed my approach to pipeline development.