Sol Shenker
Sol Shenker
@seqcmd.bsky.social
Scientist, Bioinformatics, cancer
Does it help to have them "check each other's work"? Eg, run it with two different tools, if you get the same result there's a higher chance it's right? Or there's something about how the PDF is typeset that makes some sequences particularly hard to extract correctly?
May 21, 2025 at 1:28 AM
If you're willing to try nextflow instead of snakemake I put together a generic module for split-apply-combine that can be used to easily parallelize single-threaded FASTQ processing programs. You just need to write a little wrapper process, and specify fan-out number github.com/shenkers/nf-...
GitHub - shenkers/nf-scatter-gather: A modular nextflow construct for parallelizing simple FASTQ processing tasks with map/reduce
A modular nextflow construct for parallelizing simple FASTQ processing tasks with map/reduce - shenkers/nf-scatter-gather
github.com
December 8, 2024 at 9:28 PM
Bouillon Bilk, it was very cool!
September 29, 2023 at 3:15 AM
Impressive speedup! It's the same fundamental data-structures, it's just faster because it's written in Rust? Haven't used rust for anything yet, but I would have expected performance to be pretty similar to bedtool's cpp.
September 26, 2023 at 1:54 AM
Why do you think that is? Are we measuring the wrong thing? We are measuring the right thing, but we don't know how to interpret it? Something else?
September 19, 2023 at 8:14 PM
Is that an individual knockout performed in adult mice? Is it a failure to develop testes vs failure of germ stem cells to differentiate? If you subsequently turn it back on is fertility restored?
September 15, 2023 at 12:04 PM
@chrisamiller.bsky.social @lwpembleton.bsky.social I'm trying to bootstrap a platform exactly along these lines, hoping to help small strappy companies or research labs. Would love show and hear more about your pain points if you have time.
September 14, 2023 at 8:24 PM