Bioinformatics runs on plain text:
FASTA. FASTQ. SAM. BED. GTF.
They look messy.
Unix commands clean them like magic.
Bioinformatics runs on plain text:
FASTA. FASTQ. SAM. BED. GTF.
They look messy.
Unix commands clean them like magic.
https://www.newsbeep.com/il/107694/
Sequence results The raw FastQ sequences upon quality check revealed the dataset comprised 5.5 million to 11.6 million…
https://www.newsbeep.com/il/107694/
Sequence results The raw FastQ sequences upon quality check revealed the dataset comprised 5.5 million to 11.6 million…
https://www.europesays.com/ie/155078/
Sequence results The raw FastQ sequences upon quality check revealed the dataset comprised 5.5 million to 11.6 million…
https://www.europesays.com/ie/155078/
Sequence results The raw FastQ sequences upon quality check revealed the dataset comprised 5.5 million to 11.6 million…
Action items:
Always check how your data were generated.
When in doubt, start from FASTQ and build your own consistent pipeline.
Use GEO or CellxGene metadata—it’s messy, but it’s your map.
Action items:
Always check how your data were generated.
When in doubt, start from FASTQ and build your own consistent pipeline.
Use GEO or CellxGene metadata—it’s messy, but it’s your map.
☹️
I would have liked to compare timings with FASTQ-DENTAL.
☹️
I would have liked to compare timings with FASTQ-DENTAL.
Okay so got 12k reads (bug seems very hard to crack) but the kind folks at @plasmidsaurus.bsky.social offered to rerun. In the meantime, I'm getting some very strong blast hits for a recently discovered species, Edaphobacter flagellatus.
FASTQ DATA:
drive.google.com/drive/folder...
Okay so got 12k reads (bug seems very hard to crack) but the kind folks at @plasmidsaurus.bsky.social offered to rerun. In the meantime, I'm getting some very strong blast hits for a recently discovered species, Edaphobacter flagellatus.
FASTQ DATA:
drive.google.com/drive/folder...
Unix head, but FASTQ-aware 🧬
✅ Get first N reads (not just lines!)
✅ Works with paired-end reads
✅ Can limit by bases OR reads
✅ Blazingly fast (it's Rust 🦀)
cat huge.fq | fasten_head -r 1000
Stop doing math with -n. Let the tool understand your data.
Unix head, but FASTQ-aware 🧬
✅ Get first N reads (not just lines!)
✅ Works with paired-end reads
✅ Can limit by bases OR reads
✅ Blazingly fast (it's Rust 🦀)
cat huge.fq | fasten_head -r 1000
Stop doing math with -n. Let the tool understand your data.
simpleaf quant --reads1 L1_R1.fq,L2_R1.fq,L3_R1.fq \
--reads2 L1_R2.fq,L2_R2.fq,L3_R2.fq
simpleaf quant --reads1 L1_R1.fq,L2_R1.fq,L3_R1.fq \
--reads2 L1_R2.fq,L2_R2.fq,L3_R2.fq
🔗 academic.oup.com/bioinformati... @giulio_formenti @erichjarvis
🔗 academic.oup.com/bioinformati... @giulio_formenti @erichjarvis
SDR-seq makes it possible; an experimental method connecting coding and non-coding variants to expression in single cells.
Proud to have contributed a custom FASTQ-processing tool to this great collaborative project.
Known as SDR-seq, the tool has unprecedented scale, precision, and speed to characterise variants and link them to functions and disease.
www.embl.org/news/science...
SDR-seq makes it possible; an experimental method connecting coding and non-coding variants to expression in single cells.
Proud to have contributed a custom FASTQ-processing tool to this great collaborative project.
summit.nextflow.io/2025/virtual...
summit.nextflow.io/2025/virtual...
Go from FASTQ files to interactive 3D genomic insights — fast, simple, and affordable.
👉 Sign up to try the test dataset or book a run-through:
🔗 portal.cantatabio.co...
🔗 cantatabio.com/dovet...
Go from FASTQ files to interactive 3D genomic insights — fast, simple, and affordable.
👉 Sign up to try the test dataset or book a run-through:
🔗 portal.cantatabio.co...
🔗 cantatabio.com/dovet...
Although POD5 requires more space than FASTQ, high capacity HDDs are cheaper than graphic cards.
Although POD5 requires more space than FASTQ, high capacity HDDs are cheaper than graphic cards.