Challenges in Lipidomics Biomarker Identification: Avoiding the Pitfalls and Improving Reproducibility

Bibliographic Details
Title: Challenges in Lipidomics Biomarker Identification: Avoiding the Pitfalls and Improving Reproducibility
Authors: Johanna von Gerichten, Kyle Saunders, Melanie J. Bailey, Lee A. Gethings, Anthony Onoja, Nophar Geifman, Matt Spick
Source: Metabolites, Vol 14, Iss 8, p 461 (2024)
Publisher Information: MDPI AG, 2024.
Publication Year: 2024
Collection: LCC:Microbiology
Subject Terms: lipidomics, separation science, mass spectrometry, bioinformatics, machine learning, retention time, Microbiology, QR1-502
More Details: Identification of features with high levels of confidence in liquid chromatography–mass spectrometry (LC–MS) lipidomics research is an essential part of biomarker discovery, but existing software platforms can give inconsistent results, even from identical spectral data. This poses a clear challenge for reproducibility in biomarker identification. In this work, we illustrate the reproducibility gap for two open-access lipidomics platforms, MS DIAL and Lipostar, finding just 14.0% identification agreement when analyzing identical LC–MS spectra using default settings. Whilst the software platforms performed more consistently using fragmentation data, agreement was still only 36.1% for MS2 spectra. This highlights the critical importance of validation across positive and negative LC–MS modes, as well as the manual curation of spectra and lipidomics software outputs, in order to reduce identification errors caused by closely related lipids and co-elution issues. This curation process can be supplemented by data-driven outlier detection in assessing spectral outputs, which is demonstrated here using a novel machine learning approach based on support vector machine regression combined with leave-one-out cross-validation. These steps are essential to reduce the frequency of false positive identifications and close the reproducibility gap, including between software platforms, which, for downstream users such as bioinformaticians and clinicians, can be an underappreciated source of biomarker identification errors.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2218-1989
Relation: https://www.mdpi.com/2218-1989/14/8/461; https://doaj.org/toc/2218-1989
DOI: 10.3390/metabo14080461
Access URL: https://doaj.org/article/f69f122041d04a2091a6a3e4389abb2e
Accession Number: edsdoj.f69f122041d04a2091a6a3e4389abb2e
Database: Directory of Open Access Journals
More Details
ISSN:22181989
DOI:10.3390/metabo14080461
Published in:Metabolites
Language:English