Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

Original languageEnglish
JournalAnalytical and Bioanalytical Chemistry
Volume414
Issue number22
Pages (from-to)6635-6645
Number of pages11
ISSN1618-2642
DOIs
Publication statusPublished - 09.2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

    Research areas

  • Centroiding, Data processing, Data quality, HRMS
  • Chemistry

Recently viewed

Publications

  1. The impact of explicit references in computer supported collaborative learning: Evidence from eye movement analyses
  2. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  3. Explaining the (Non-) Adoption of Advanced Data Analytics in Auditing
  4. Variational Pragmatics
  5. An Experimental Approach to the Optimization of Customer Information at the Point of Sale
  6. Development and characterisation of a new interface for coupling capillary LC with collision-cell ICPMS and its application for phosphorylation profiling of tryptic protein digests
  7. Mythos
  8. Importance of timing
  9. Time Use and Time Budgets
  10. CASE via MS
  11. Terminologien/Semantik
  12. Development of an Interdisciplinary, Intercultural Master’s Program on Sustainability
  13. Machine learning for optimization of energy and plastic consumption in the production of thermoplastic parts in SME
  14. Personalbeschaffung
  15. Measuring Variation in Gaze Following Across Communities, Ages, and Individuals
  16. Binnendifferenzierung in der Schulpraxis
  17. Forest history from a single tree species perspective
  18. Anders als die anderen?
  19. Games
  20. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  21. Farewell to the party model?
  22. Die Bedeutung der Zeit
  23. The declarative value of paraphs and the scope of military opposition. Annotations to Johannes Hurter: On the way to military opposition.
  24. Theodor Fontane, das Fremde und die Juden
  25. A Note on Risk Aversion and Labour Market Outcomes
  26. No matter what the name, we’re all the same? Examining ethnic online discrimination in ridesharing marketplaces
  27. Future Making
  28. Edge Effects