why would I use snakemake when I can simply write a python script to write 51 job scripts and then write a for loop in bash to submit them?
November 11, 2025 at 9:07 PM
why would I use snakemake when I can simply write a python script to write 51 job scripts and then write a for loop in bash to submit them?
In #GENETICS, Romain Durand, Alicia Pageau, and @christianlandry.bsky.social present a new Snakemake-based workflow, gyōza, to analyze deep-mutational scanning data. This tool requires little programming knowledge and can automatically generate HTML reports. buff.ly/4dJJdva
November 11, 2025 at 9:02 PM
In #GENETICS, Romain Durand, Alicia Pageau, and @christianlandry.bsky.social present a new Snakemake-based workflow, gyōza, to analyze deep-mutational scanning data. This tool requires little programming knowledge and can automatically generate HTML reports. buff.ly/4dJJdva
This took a while. After the new version of the Snakemake paper (a rolling paper on F1000) came out, the DOI now is "working" 🥳 :
https://doi.org/10.12688/f1000research.29032.3
From my point of view, it particularly describes the working with various #hpc batch systems. And: Development did […]
https://doi.org/10.12688/f1000research.29032.3
From my point of view, it particularly describes the working with various #hpc batch systems. And: Development did […]
Original post on fediscience.org
fediscience.org
November 10, 2025 at 11:40 AM
This took a while. After the new version of the Snakemake paper (a rolling paper on F1000) came out, the DOI now is "working" 🥳 :
https://doi.org/10.12688/f1000research.29032.3
From my point of view, it particularly describes the working with various #hpc batch systems. And: Development did […]
https://doi.org/10.12688/f1000research.29032.3
From my point of view, it particularly describes the working with various #hpc batch systems. And: Development did […]
I tried snakemake once, not bad
November 7, 2025 at 5:25 PM
I tried snakemake once, not bad
Reproducible workflow with scripts, data and results (which was a nice opportunity to try out how SnakeMake plays with both R and Python; and the answer is "great") available here:
doi.org/10.5281/zeno...
doi.org/10.5281/zeno...
Scripts, data, and replication materials for "A study of gender and regional differences in scientific mobility and immobility among researchers identified as potentially talented"
This repository includes replication materials including data, Python and R scripts to replicate the analysis and figures of the article with following metadata. The repository also includes additional results that were not included in the manuscript and its Appendix due to space limitations but are provided here, e.g., using alternative gender assignments or more granular classification for geographical regions. Script authors/maintainers: Aliakbar Akbaritabar Contact: akbaritabar@demogr.mpg.de Article title: A study of gender and regional differences in scientific mobility and immobility among researchers identified as potentially talented Manuscript authors: Aliakbar Akbaritabar, Robin Haunschild and Lutz Bornmann Journal: Journal of Informetrics Article DOI: https://doi.org/10.1016/j.joi.2025.101744 Replication package on Zenodo: https://doi.org/10.5281/zenodo.17495895 Abstract: Identifying talented academics worldwide using publication data has been proven to be successful with other performance measures based on citations and funding data in previous studies. In this study, we investigate the scientific mobility and immobility among academics as an additional performance measure. We reconstruct the mobility trajectory of potentially talented researchers throughout their scientific careers to study whether they have a different propensity to be mobile or non-mobile than other researchers in the group for comparison. Since the researchers’ gender may play an important role in scientific careers, we delve into gender differences. Our results indicate that potentially talented researchers have a higher propensity to be mobile than other researchers in the group for comparison – more so among male than female talented researchers. Women are overrepresented among non-mobile researchers in the other researchers group. We conclude – based on our findings – that the proposed method for identifying potentially talented individuals seems to select researchers who are more successful in their academic careers than the researchers in the group for comparison. The results agree with the findings of the previous studies based on citation and funding data. In the interpretation of our study results, one should consider yet that higher mobility is a privilege (that may be independent of talent). Specific groups, such as those with fewer caring responsibilities and visa restrictions, could have better access to this privilege. Further research is necessary thus on the trade-off between higher mobility's potential advantages and disadvantages as a strategy to build a successful academic career and unequal access to mobility. Keywords: talented academics, bibliometric indicators, scientific mobility and immobility, gendered mobility How to replicate the analysis The scripts, data, and the reproducible pipeline that creates final data for our manuscript, and runs the analysis and visualizations using R and Python are provided. The pipeline includes the preparation of data, statistical analysis, visualizations (see figures under results folder). The pipeline uses scripts that are developed in Python. The reproducible workflow (described below) uses SnakeMake workflow management to ensure full replicability. There are also R scripts for results in the manuscript (see them in workflow\scripts folder) and figures in the manuscript and Appendix (see them under results folder). Below, we describe the requirements to install before analysis can be replicated. Python requirements For the reproducible pipeline to recreate the paper's replication data and analysis, SnakeMake version 8 or above should be installed. Creating a conda environment with Python 3 (3.11.9 was used here) and the following libraries (for instance by running conda env create -f requirements.yml and after copying the following yml code into a file: "requirements.yml") enables reproducing the pipeline by opening CLI, activating the conda environment i.e., conda activate talents, and running a dry-run with snakemake -np all or a full reproduction with snakemake --cores 4 all. An HTML report shows a directed acyclic graph (DAG) of the steps' dependency in the pipeline (rules in SnakeMake lingua) which is accessible in https://akbaritabar.github.io/Replication-package-for-gender-and-regional-differences-in-scientific-mobility-and-immobility . Some rules require Scopus data at the individual level. Since the data are licensed, they cannot be shared publicly. Hence, these output files are commented out from the main Snakefile and rule all. However, the aggregated data to prepare the analysis presented in the manuscript and statistical models to recreate all figures are included in this repository. The repository completely complies with the license terms of the data provider. name: talents channels: - conda-forge - bioconda # to prevent using default anaconda channels - nodefaults dependencies: - python 3.* - pandas - pyarrow - plotnine - argparse - numpy - mizani # (is installed with plotnine) - duckdb - tabulate R packages To replicate the statistical analysis, following packages should be installed. The scripts and the data used in them are in the folder workflow\scripts. install.packages(c("tidyverse", "RColorBrewer", "argparser", "nnet", "broom", "countrycode", "nanoparquet", "stargazer", "texreg", "GGally", "ggeffects", "lmtest", "DescTools")) Publication figures Figures 1-5 are in the main manuscript. All figures, including these figures and the Appendix figures, are available in the results folder.
doi.org
November 6, 2025 at 4:46 PM
Reproducible workflow with scripts, data and results (which was a nice opportunity to try out how SnakeMake plays with both R and Python; and the answer is "great") available here:
doi.org/10.5281/zeno...
doi.org/10.5281/zeno...
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) www.biorxiv.org/content/10.1... #jcampubs
November 5, 2025 at 4:28 PM
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) www.biorxiv.org/content/10.1... #jcampubs
I want to invest my time learning snakemake or nextflow but I never sure which is more future-proof. Or probably it is irrelevant as there will be always new stuff anyways 🤪
November 5, 2025 at 12:41 PM
I want to invest my time learning snakemake or nextflow but I never sure which is more future-proof. Or probably it is irrelevant as there will be always new stuff anyways 🤪
This version dependency going on between conda and snakemake is madness. Syntax issue as well between versions; crazy python stuffs. This volatility cannot be the future of Bioinformatics. What does people use to make pipeline nowadays? How have people in maths/physics/chemistry been doing it?
November 5, 2025 at 12:39 PM
This version dependency going on between conda and snakemake is madness. Syntax issue as well between versions; crazy python stuffs. This volatility cannot be the future of Bioinformatics. What does people use to make pipeline nowadays? How have people in maths/physics/chemistry been doing it?
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs)
doi.org/10.1101/2025...
doi.org/10.1101/2025...
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs)
Omic technologies revolutionised research in microbiomics and helped decipher the complex community structure of gut microbiota, each comprising hundreds of species of bacteria, archaea, and protists. Even as sequencing costs decrease, understanding these communities remains a challenge. Here, we introduce annoSnake, a Snakemake workflow that facilitates taxonomic and functional annotation of metagenomes and metagenomic-assembled genomes (MAGs). This user-friendly and highly flexible workflow integrates state-of-the-art software packages, taking raw sequencing reads, assembling them into contigs, assigning taxonomies and annotating them using various popular protein databases. The workflow handles paired-end as well as interleaved read data, and provides publication-ready figures and tables for downstream analysis. annoSnake, as all Snakemake workflows, is highly scalable, reproducible, and portable, enabling execution across diverse high-performance computing environments with remarkable efficiency. ### Competing Interest Statement The authors have declared no competing interest.
doi.org
November 5, 2025 at 5:39 AM
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs)
doi.org/10.1101/2025...
doi.org/10.1101/2025...
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) https://www.biorxiv.org/content/10.1101/2025.11.03.686227v1
November 4, 2025 at 7:49 PM
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) https://www.biorxiv.org/content/10.1101/2025.11.03.686227v1
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) https://www.biorxiv.org/content/10.1101/2025.11.03.686227v1
November 4, 2025 at 7:49 PM
annoSnake: a Snakemake workflow for taxonomic and functional annotation of metagenomes and metagenome-assembled genomes (MAGs) https://www.biorxiv.org/content/10.1101/2025.11.03.686227v1
Snakemake is essential for reproducible research—it organizes code, and acts as my second brain about code and data relations.
A downside is its maintenance cost. It can be easily convoluted as a project progresses. This tool might resolve that.
arxiv.org/html/2505.02...
A downside is its maintenance cost. It can be easily convoluted as a project progresses. This tool might resolve that.
arxiv.org/html/2505.02...
Snakemaker: Seamlessly transforming ad-hoc analyses into sustainable Snakemake workflows with generative AI
arxiv.org
November 2, 2025 at 10:02 PM
Snakemake is essential for reproducible research—it organizes code, and acts as my second brain about code and data relations.
A downside is its maintenance cost. It can be easily convoluted as a project progresses. This tool might resolve that.
arxiv.org/html/2505.02...
A downside is its maintenance cost. It can be easily convoluted as a project progresses. This tool might resolve that.
arxiv.org/html/2505.02...
Just submitted a talk for FOSDEM (been invited). They asked to attach an icon-image for the talk. So I drew one. The compute racks are difficult to identify as such, but this is as far as my aquarelle skills go.
#snakemake #reproduciblecomputing #hpc
#snakemake #reproduciblecomputing #hpc
November 2, 2025 at 7:58 PM
Just submitted a talk for FOSDEM (been invited). They asked to attach an icon-image for the talk. So I drew one. The compute racks are difficult to identify as such, but this is as far as my aquarelle skills go.
#snakemake #reproduciblecomputing #hpc
#snakemake #reproduciblecomputing #hpc
9/
Bioinformaticians have always thrived by adopting the next tool.
E.g., from many broken Bash scripts to Snakemake or nextflow.
Now, it’s time to adopt AI.
Bioinformaticians have always thrived by adopting the next tool.
E.g., from many broken Bash scripts to Snakemake or nextflow.
Now, it’s time to adopt AI.
November 1, 2025 at 1:45 PM
9/
Bioinformaticians have always thrived by adopting the next tool.
E.g., from many broken Bash scripts to Snakemake or nextflow.
Now, it’s time to adopt AI.
Bioinformaticians have always thrived by adopting the next tool.
E.g., from many broken Bash scripts to Snakemake or nextflow.
Now, it’s time to adopt AI.
Just getting back from two day teaching #snakemake programming on #hpc clusters. And I have several observations:
- the education level everywhere else seems way better, than on my home cluster. It is still a good idea, to assume the worst.
- it is never a good idea to accept […]
- the education level everywhere else seems way better, than on my home cluster. It is still a good idea, to assume the worst.
- it is never a good idea to accept […]
Original post on fediscience.org
fediscience.org
October 29, 2025 at 6:45 PM
Just getting back from two day teaching #snakemake programming on #hpc clusters. And I have several observations:
- the education level everywhere else seems way better, than on my home cluster. It is still a good idea, to assume the worst.
- it is never a good idea to accept […]
- the education level everywhere else seems way better, than on my home cluster. It is still a good idea, to assume the worst.
- it is never a good idea to accept […]
Finally finished improving the #snakemake workflow for a refutation paper.
Now what's the "law" governing how much time is spent on a false claim vs. how much on its verifiable refutation?
I have no experience with submitting papers disproving others so this is gonna be … interesting.
Now what's the "law" governing how much time is spent on a false claim vs. how much on its verifiable refutation?
I have no experience with submitting papers disproving others so this is gonna be … interesting.
October 29, 2025 at 10:53 AM
Finally finished improving the #snakemake workflow for a refutation paper.
Now what's the "law" governing how much time is spent on a false claim vs. how much on its verifiable refutation?
I have no experience with submitting papers disproving others so this is gonna be … interesting.
Now what's the "law" governing how much time is spent on a false claim vs. how much on its verifiable refutation?
I have no experience with submitting papers disproving others so this is gonna be … interesting.
snakemake-executor-plugin-slurm 1.9.1 A Snakemake executor plugin for submitting jobs to a SLURM cluster.
Interest | Match | Feed
Interest | Match | Feed
Origin
pypi.org
October 28, 2025 at 1:04 AM
snakemake-executor-plugin-slurm 1.9.0 A Snakemake executor plugin for submitting jobs to a SLURM cluster.
Interest | Match | Feed
Interest | Match | Feed
Origin
pypi.org
October 28, 2025 at 12:15 AM
🚀 The Autumn School in #Bioinformatics has started!
This week we’ll cover Genome Assembly & Annotation, RNA-seq, Phylogenomics, Docker, Singularity, Nextflow, Snakemake and much more.. Excited for a week of hands-on learning and bioinformatics fun with @passdan.bsky.social & Christoph Hahn! 🧬💻
This week we’ll cover Genome Assembly & Annotation, RNA-seq, Phylogenomics, Docker, Singularity, Nextflow, Snakemake and much more.. Excited for a week of hands-on learning and bioinformatics fun with @passdan.bsky.social & Christoph Hahn! 🧬💻
October 20, 2025 at 3:16 PM
🚀 The Autumn School in #Bioinformatics has started!
This week we’ll cover Genome Assembly & Annotation, RNA-seq, Phylogenomics, Docker, Singularity, Nextflow, Snakemake and much more.. Excited for a week of hands-on learning and bioinformatics fun with @passdan.bsky.social & Christoph Hahn! 🧬💻
This week we’ll cover Genome Assembly & Annotation, RNA-seq, Phylogenomics, Docker, Singularity, Nextflow, Snakemake and much more.. Excited for a week of hands-on learning and bioinformatics fun with @passdan.bsky.social & Christoph Hahn! 🧬💻
I have indeed plans to rewrite performance critical inner parts of snakemake in rust. Might be also something for the hackathon.
October 19, 2025 at 10:00 PM
I have indeed plans to rewrite performance critical inner parts of snakemake in rust. Might be also something for the hackathon.
Wow this is really good. I just finished a GOlang stand alone server concurrent and faster routes, just have to add a templating engine. github.com/omicscode/do... finished rekease this week. We should write the snakemake in either RUST 🦀 or GOlang.
GitHub - omicscode/doseqGO: a go based concurrent sequence information platform
a go based concurrent sequence information platform - GitHub - omicscode/doseqGO: a go based concurrent sequence information platform
github.com
October 19, 2025 at 9:56 PM
Wow this is really good. I just finished a GOlang stand alone server concurrent and faster routes, just have to add a templating engine. github.com/omicscode/do... finished rekease this week. We should write the snakemake in either RUST 🦀 or GOlang.
The #Snakemake wrapper repository and #Snakedeploy have received a major update: each wrapper now reports the latest version with changes, and #Snakedeploy can automatically upgrade all wrappers in your workflow to those versions. snakemake-wrappers.readthedocs.io
The Snakemake Wrappers repository | Snakemake wrappers
snakemake-wrappers.readthedocs.io
October 19, 2025 at 9:15 PM
The #Snakemake wrapper repository and #Snakedeploy have received a major update: each wrapper now reports the latest version with changes, and #Snakedeploy can automatically upgrade all wrappers in your workflow to those versions. snakemake-wrappers.readthedocs.io
#Snakemake has a novel feature for generalizing rules and modules: pathvars! snakemake.readthedocs.io/en/stable/sn...
Pathvars are particularly useful to make modules parameterizable and more generic. Further, they reduce boilerplate code commonly used for defining output folders.
Pathvars are particularly useful to make modules parameterizable and more generic. Further, they reduce boilerplate code commonly used for defining output folders.
Snakefiles and Rules | Snakemake 9.13.3 documentation
snakemake.readthedocs.io
October 19, 2025 at 9:12 PM
#Snakemake has a novel feature for generalizing rules and modules: pathvars! snakemake.readthedocs.io/en/stable/sn...
Pathvars are particularly useful to make modules parameterizable and more generic. Further, they reduce boilerplate code commonly used for defining output folders.
Pathvars are particularly useful to make modules parameterizable and more generic. Further, they reduce boilerplate code commonly used for defining output folders.
Sure, i will register and will help also. I am not sure whether at my new job place they use snakemake or nextflow but it is always a good idea to contribute and good to be learning together. If they use snakemake then so good and if not then also good. So it always a good learning experience.
October 17, 2025 at 6:22 PM
Sure, i will register and will help also. I am not sure whether at my new job place they use snakemake or nextflow but it is always a good idea to contribute and good to be learning together. If they use snakemake then so good and if not then also good. So it always a good learning experience.