Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

Original languageEnglish
JournalAnalytical and Bioanalytical Chemistry
Volume414
Issue number22
Pages (from-to)6635-6645
Number of pages11
ISSN1618-2642
DOIs
Publication statusPublished - 09.2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

    Research areas

  • Centroiding, Data processing, Data quality, HRMS
  • Chemistry

Recently viewed

Publications

  1. Supporting the Development and Realization of Data-Driven Business Models with Enterprise Architecture Modeling and Management
  2. Extraction of finite-time coherent sets in 3D Rayleigh-Benard Convection using the dynamic Laplacian
  3. Using Heider’s Epistemology of Thing and Medium for Unpacking the Conception of Documents: Gantt Charts and Boundary Objects
  4. Privatizing the commons
  5. Design, Modeling and Control of an Over-actuated Hexacopter Tilt-Rotor
  6. Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions:
  7. The Framework for Inclusive Science Education
  8. Adaptive Item Selection Under Matroid Constraints
  9. A Besov space mapping property for the double layer potential on polygons
  10. Introduction: The representative turn in EU Studies
  11. Improvements in Flexibility depend on Stretching Duration
  12. Improving Human-Machine Interaction
  13. Forging of Mg–3Sn–2Ca–0.4Al Alloy Assisted by Its Processing Map and Validation Through Analytical Modeling
  14. Using Reading Strategy Training to Foster Students´ Mathematical Modelling Competencies
  15. Aging and Distal Effect Anticipation when Using Tools
  16. An Ecosystem Architecture Meta-Model for Supporting Ultra-Large Scale Digital Transformations
  17. Natural enemy diversity reduces temporal variability in wasp but not bee parasitism
  18. A Statistical Approach to Estimate Spatial Distributions of Wet Deposition in Germany
  19. Fast response of groundwater to heavy rainfall
  20. Transcending the Locality of Grassroots Initiatives
  21. Correlation between Isometric Maximum Strength and One Repetition Maximum in the Calf Muscle in Extended and Bended Knee Joint
  22. Entrepreneurial actions
  23. Effects of oral corrective feedback on the development of complex morphosyntax
  24. "Wen feiern wir denn eigentlich?"
  25. Contrasting requests in Inner Circle Englishes
  26. Nitrogen uptake by grassland communities

Press / Media

  1. Too long, didn't read?