Uri Keich
urikeich.bsky.social
Uri Keich
@urikeich.bsky.social
Reposted by Uri Keich

How can you transfer peptide IDs between runs and still control your false discovery rate? Till now, the short answer is, you couldn't. Now you can, with PIP-ECHO.

www.biorxiv.org/content/10.1...
Improved detection of differentially abundant proteins through FDR-control of peptide-identity-propagation
Quantitative analysis of proteomics data frequently employs peptide-identity-propagation (PIP) — also known as match-between-runs (MBR) — to increase the number of peptides quantified in a given LC-MS...
www.biorxiv.org
December 2, 2024 at 5:10 PM
Reposted by Uri Keich
Re-posting our new preprint on match between runs. This multi-lab effort (Keich, Noble, Payne & Smith) led by Alex Solivais should be of interest to anyone doing LFQ. We describe here how to control FDR in LFQ and provide the open source software to do it.
www.biorxiv.org/content/10.1...
Improved detection of differentially abundant proteins through FDR-control of peptide-identity-propagation
Quantitative analysis of proteomics data frequently employs peptide-identity-propagation (PIP) — also known as match-between-runs (MBR) — to increase the number of peptides quantified in a given LC-MS/MS experiment. PIP can routinely account for up to 40% of all quantitative results, with that proportion rising as high as 75% in single-cell proteomics. Therefore, a significant concern for any PIP method is the possibility of false discoveries: errors that result in peptides being quantified incorrectly. Although several tools for label-free quantification (LFQ) claim to control the false discovery rate (FDR) of PIP, these claims cannot be validated as there is currently no accepted method to assess the accuracy of the stated FDR. We present a method for FDR control of PIP, called “PIP-ECHO” (PIP Error Control via Hybrid cOmpetition) and devise a rigorous protocol for evaluating FDR control of any PIP method. Using three different datasets, we evaluate PIP-ECHO alongside the PIP procedures implemented by FlashLFQ, IonQuant, and MaxQuant. These analyses show that PIP-ECHO can accurately control the FDR of PIP at 1% across multiple datasets. Only PIP-ECHO was able to control the FDR in data with injected sample size equivalent to a single-cell dataset. The three other methods fail to control the FDR at 1%, yielding false discovery proportions ranging from 2–6%. We demonstrate the practical implications of this work by performing differential expression analyses on spike-in datasets, where different known amounts of yeast or E. coli peptides are added to a constant background of HeLa cell lysate peptides. In this setting, PIP-ECHO increases both the accuracy and sensitivity of differential expression analysis: our implementation of PIP-ECHO within FlashLFQ enables the detection of 53% more differentially abundant proteins than MaxQuant and 146% more than IonQuant in the spike-in dataset. ### Competing Interest Statement The authors have declared no competing interest.
www.biorxiv.org
December 2, 2024 at 5:05 PM
Reposted by Uri Keich
BLAST is a fantastic tool that has enabled sequence-driven discovery for over 30 years. But, alas, the E-value that it reports turns out to have some serious problems. Here we propose a fix. It's more computationally expensive, but computers are a bit faster than they were in 1990.
bit.ly/3ZDgYt8
A BLAST from the past: revisiting blastp’s E-value
AbstractMotivation. The Basic Local Alignment Search Tool, BLAST, is an indispensable tool for genomic research. BLAST established itself as the canonical
academic.oup.com
December 6, 2024 at 7:24 PM