Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

Original languageEnglish
JournalAnalytical and Bioanalytical Chemistry
Volume414
Issue number22
Pages (from-to)6635-6645
Number of pages11
ISSN1618-2642
DOIs
Publication statusPublished - 09.2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

    Research areas

  • Centroiding, Data processing, Data quality, HRMS
  • Chemistry

Recently viewed

Publications

  1. Partitioned beta diversity patterns of plants across sharp and distinct boundaries of quartz habitat islands
  2. What can conservation strategies learn from the ecosystem services approach?
  3. An Adaptive and Optimized Switching Observer for Sensorless Control of an Electromagnetic Valve Actuator in Camless Internal Combustion Engines
  4. ASSESS — automatic self-assessment using linked data
  5. Wavelet functions for rejecting spurious values
  6. Preventive Diagnostics for cardiovascular diseases based on probabilistic methods and description logic
  7. Knowledge-Enhanced Language Models Are Not Bias-Proof
  8. An analytical approach to evaluating monotonic functions of fuzzy numbers
  9. Self-regulation in error management training: emotion control and metacognition as mediators of performance effects
  10. Spaces for challenging experiences, indeterminacy, and experimentation
  11. Commitment to grand challenges in fluid forms of organizing
  12. A structural property of the wavelet packet transform method to localise incoherency of a signal
  13. Quantum Computing and the Analog/Digital Distinction
  14. A Multimethod Latent State-Trait Model for Structurally Different and Interchangeable Methods
  15. Factor structure and measurement invariance of the Students’ Self-report Checklist of Social and Learning Behaviour (SSL)
  16. Mechanism of dynamic recrystallization and evolution of texture in the hot working domains of the processing map for Mg-4Al-2Ba-2Ca Alloy
  17. A comparison of ML, WLSMV and Bayesian methods for multilevel structural equation models in small samples: A simulation study
  18. AGDISTIS - Graph-based disambiguation of named entities using linked data
  19. Species constancy depends on plot size - A problem for vegetation classification and how it can be solved
  20. A Cross-Classified CFA-MTMM Model for Structurally Different and Nonindependent Interchangeable Methods
  21. Using Conjoint Analysis to Elicit Preferences for Occupational Health Services in Small and Microenterprises
  22. Combining multiple investigative approaches to unravel functional responses to global change in the understorey of temperate forests