Noah Legall, Ph. D.
banner
noahlegall.bsky.social
Noah Legall, Ph. D.
@noahlegall.bsky.social
AppSci @ Dotmatics | Microbial Bioinformatics | Deep Learning & Explainability | Nextflow Ambassador | Author of 'The Microbialist' Substack | Thoughts are my own personal opinions and do not represent a third party
'The Microbialist' No. 3 💻🦠🧬

I wanted to sit down and see what the cutting edge was with using phage and bacteria genomics to predict interactions 🧫 In this newsletter, we delve into just this topic!

themicrobialist.substack.com/p/understand...
Understanding Bacteriophage Specificity
A quick synopsis of an interesting interaction experiment
themicrobialist.substack.com
February 23, 2025 at 1:27 PM
Even though I said 'The Microbialist' would be a monthly newsletter, my original post looked a bit lonely! In this post, I talk about some analysis approaches to better understand what microbes might be present in a genome assembly.
Data mining of genome assembly contigs
How can one see what types of microbes could be present in an assembly?
open.substack.com
January 17, 2025 at 6:00 PM
I'd like to announce my monthly newsletter 'The Microbialist' to stay up to date on things happening in bioinformatics/microbiology/machine learning.
My first posting will be an op-ed on how to enter and excel in the field - please let me know what you think!
The Bioinformatics Plunge
An attempt at making a broad question much more narrow
themicrobialist.substack.com
January 2, 2025 at 3:27 AM
Today is my last official day in academia - I started out as a researcher when researchers at UNC chapel hill took a chance on a freshman who discovered a love of computers from an intro programming class. That was 10 years ago. Excited for the future but also bittersweet feelings about this
December 9, 2024 at 3:20 PM
Reposted by Noah Legall, Ph. D.
Dr. Brian Druker is a giant of #oncology, and one of the #cancerresearch physician-scientist leaders I admire most. I was fortunate to meet him very early in my career and hear sage advice on navigating life and science. I appreciate his stand for integrity.

www.oregonlive.com/health/2024/...
Dr. Brian Druker, head of OHSU’s Knight Cancer Institute, steps down: ‘We have ... forgotten our mission’
Dr. Brian Druker is known for his pioneering cancer research and development of leukemia drug Gleevec.
www.oregonlive.com
December 4, 2024 at 8:04 PM
Reposted by Noah Legall, Ph. D.
I genuinely know little about what’s going on in South Korea, but this video is pretty awesome.
Lee Jae-myung, Leader of South Korea's Democratic Party, live-streamed himself scaling the walls of the National Assembly to bypass military barricades so that he could vote to overturn the President's martial law.
December 3, 2024 at 5:47 PM
I'm interested to dig into this - hopefully can make feature attribution a bit more scalable to larger datasets (a claim made in the preprint).
Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution
Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and are intractable for large datasets. These methods requ...
arxiv.org
December 2, 2024 at 11:12 PM
Reposted by Noah Legall, Ph. D.
Accelerating whole-genome alignment in the age of complete genome assemblies https://www.biorxiv.org/content/10.1101/2024.11.25.625328v1 🧬🖥️🧪 https://github.com/at-cg/mm2-plus
December 2, 2024 at 4:30 PM
Reposted by Noah Legall, Ph. D.
just sustained multiple fractures in a stampede after trying to buy a coffee maker for my eight week old child
November 29, 2024 at 1:35 PM
Reposted by Noah Legall, Ph. D.
Denver gave people experiencing homelessness $1k/month. A year later, nearly half had housing.

They also had fewer ER visits, nights spent in a hospital, and jail stays.

The report estimates that this reduction in public service use SAVED the city $589k.
www.businessinsider.com/denver-basic...
Denver gave people experiencing homelessness $1,000 a month. A year later, nearly half of participants said they had housing.
Participants in Denver's basic-income program reported having more-secure housing, though results were similar in the trial and control groups.
www.businessinsider.com
November 26, 2024 at 12:47 AM
An idea I've been toying with in my spare time is the idea of finding out which observations in a training dataset are the most influential in coming up with model predictions. Can we not do better than Leave-One-Out?
November 25, 2024 at 5:15 PM
Reposted by Noah Legall, Ph. D.
Real science:
1. Is not correlated with the amount of funding
2. Progresses slowly, usually taking many years
3. Leads to more questions than it answers
4. Advances when we disengage & in improvisational discussions
5. Is too important a thing to be done in a non-playful way
November 24, 2024 at 11:37 PM
Reposted by Noah Legall, Ph. D.
Hi, can you help me? I want to develop a model that makes risk predictions.

Use logistic regression.

Can I use some more modern techniques, like AI?

Use a neural network with single non-hidden feed forward layer that outputs to a single dimension using a sigmoid activation function.
November 21, 2024 at 1:18 PM
Reposted by Noah Legall, Ph. D.
The new version of the Nextflow VS Code extension with the language server is awesome!

One of my favorite little things you can do, is preview the workflow DAG while you are writing and developing your workflow 🚀 🤩! (Showcased below using the nf-core/demo pipeline: nf-co.re/demo/1.0.1). #Nextflow
November 21, 2024 at 2:37 PM
Reposted by Noah Legall, Ph. D.
Bakta: rapid & standardized annotation of bacterial genomes, MAGs & plasmids
🦠🧬💻 Bakta v1.10 - largest update so far:

Highlights:
- user-provided HMMs: --hmms
- output file recovery from JSON files: bakta_io
- export of inference metrics: inference.tsv
- bypass overlap filters: --skip-filter
- improved genome plots

github.com/oschwengers/...

👇 (1/11)
Release v1.10 - Novel in & novel out · oschwengers/bakta
This is the tenth minor release (v1.10) introducing user-provided HMMs, output file recovery, feature inference scores, and various improvements. Compatible database scheme version: 5 Important Si...
github.com
November 18, 2024 at 11:57 AM