JOURNAL ARTICLE

An algorithm for decoy-free false discovery rate estimation in XL-MS/MS proteomics

Yisu PengShantanu JainPredrag Radivojac

Year: 2024 Journal:   Bioinformatics Vol: 40 (Supplement_1)Pages: i428-i436   Publisher: Oxford University Press

Abstract

Abstract Motivation Cross-linking tandem mass spectrometry (XL-MS/MS) is an established analytical platform used to determine distance constraints between residues within a protein or from physically interacting proteins, thus improving our understanding of protein structure and function. To aid biological discovery with XL-MS/MS, it is essential that pairs of chemically linked peptides be accurately identified, a process that requires: (i) database search, that creates a ranked list of candidate peptide pairs for each experimental spectrum and (ii) false discovery rate (FDR) estimation, that determines the probability of a false match in a group of top-ranked peptide pairs with scores above a given threshold. Currently, the only available FDR estimation mechanism in XL-MS/MS is the target-decoy approach (TDA). However, despite its simplicity, TDA has both theoretical and practical limitations that impact the estimation accuracy and increase run time over potential decoy-free approaches (DFAs). Results We introduce a novel decoy-free framework for FDR estimation in XL-MS/MS. Our approach relies on multi-sample mixtures of skew normal distributions, where the latent components correspond to the scores of correct peptide pairs (both peptides identified correctly), partially incorrect peptide pairs (one peptide identified correctly, the other incorrectly), and incorrect peptide pairs (both peptides identified incorrectly). To learn these components, we exploit the score distributions of first- and second-ranked peptide-spectrum matches for each experimental spectrum and subsequently estimate FDR using a novel expectation-maximization algorithm with constraints. We evaluate the method on ten datasets and provide evidence that the proposed DFA is theoretically sound and a viable alternative to TDA owing to its good performance in terms of accuracy, variance of estimation, and run time. Availability and implementation https://github.com/shawn-peng/xlms

Keywords:
Decoy False discovery rate Computer science Algorithm Function (biology) Peptide Data mining Computational biology Chemistry Biology

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
54
Refs
0.07
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Proteomics Techniques and Applications
Physical Sciences →  Chemistry →  Spectroscopy
Mass Spectrometry Techniques and Applications
Physical Sciences →  Chemistry →  Spectroscopy
Pesticide Residue Analysis and Safety
Life Sciences →  Agricultural and Biological Sciences →  Food Science

Related Documents

JOURNAL ARTICLE

Decoy-free protein-level false discovery rate estimation

Ben TengTing HuangZengyou He

Journal:   Bioinformatics Year: 2013 Vol: 30 (5)Pages: 675-681
JOURNAL ARTICLE

Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics

Dominik MadejLong WuHenry Lam

Journal:   Journal of Proteome Research Year: 2022 Vol: 21 (2)Pages: 339-348
JOURNAL ARTICLE

False Discovery Rate Estimation in Proteomics

Suruchi AggarwalAmit Kumar Yadav

Journal:   Methods in molecular biology Year: 2015 Vol: 1362 Pages: 119-128
JOURNAL ARTICLE

Target‐decoy false discovery rate estimation using Crema

Andy LinDonavan SeeWilliam E. FondrieUri KeichWilliam Stafford Noble

Journal:   PROTEOMICS Year: 2024 Vol: 24 (8)Pages: e2300084-e2300084
© 2026 ScienceGate Book Chapters — All rights reserved.