Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

Original languageEnglish
JournalAnalytical and Bioanalytical Chemistry
Volume414
Issue number22
Pages (from-to)6635-6645
Number of pages11
ISSN1618-2642
DOIs
Publication statusPublished - 09.2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

    Research areas

  • Centroiding, Data processing, Data quality, HRMS
  • Chemistry

Recently viewed

Researchers

  1. Marcus Erben

Publications

  1. Knowledge-Enhanced Language Models Are Not Bias-Proof
  2. Simulation based comparison of safety-stock calculation methods
  3. Combining multiple investigative approaches to unravel functional responses to global change in the understorey of temperate forests
  4. Artificial intelligence
  5. BUSINESS MODELS IN BANKING: A CLUSTER ANALYSIS USING ARCHIVAL DATA
  6. A latent state-trait analysis of current achievement motivation across different tasks of cognitive ability
  7. Advisory systems in pluralistic knowledge societies:
  8. Pathways of Data-driven Business Model Design and Realization
  9. Fusion of knowledge bases for better navigation of wheeled mobile robotic group with 3D TVS
  10. Parameterized Synthetic Image Data Set for Fisheye Lens
  11. Shepherds’ local knowledge and scientific data on the scavenging ecosystem service
  12. Machine Learning and Data Mining for Sports Analytics
  13. Erkenntnistheorie
  14. Markups and Concentration in the Context of Digitization
  15. Editorial: Courts in Context. An Empirical Re-Evaluation of Categorization in the Asylum Regime
  16. Communication under the microscope: The theory and practice of microanalysis
  17. Probabilistic movement models and zones of control
  18. Horizontal, but not vertical canopy structure is related to stand functional diversity in a subtropical slope forest
  19. A PD Fuzzy Control of a Nonholonomic Car-Like Robot for Drive Assistant Systems
  20. Telecoupling as a framework to support a more nuanced understanding of causality in land system science
  21. Differentiating Different Types of Cognitive Load
  22. Implementation of a balanced scorecard for hybrid business models
  23. End-users’ perspective on digitalization
  24. Direct measurement of cognitive load in multimedia learning
  25. New Methods for the Analysis of Links between International Firm Activities and Firm Performance: A Practitioner’s Guide
  26. The auditor as an element of in- and external corporate governance
  27. Genetically based differentiation in growth of multiple non-native plant species along a steep environmental gradient
  28. Nitrogen uptake by grassland communities
  29. Ansparabschreibung durch Existenzgründer
  30. CALPHAD-based modeling of pressure-dependent Al, Cu and Li unary systems
  31. Beyond pandemic populism
  32. New concepts of extrusion dies to reduce the anisotropy of extruded profiles by means of additive manufacturing
  33. Observations of Microstructure-Oriented Crack Growth in a Cast Mg-Al-Ba-Ca Alloy under Tension, Compression and Fatigue
  34. Guidance for assessing interregional ecosystem service flows
  35. Adaptive Lehrerinterventionen beim mathematischen Modellieren