Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra. / Reuschenbach, Max; Hohrenk-Danzouma, Lotta L.; Schmidt, Torsten C. et al.
In: Analytical and Bioanalytical Chemistry, Vol. 414, No. 22, 09.2022, p. 6635-6645.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{aee39fc4efe149d8818ad67ed6f336e3,
title = "Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra",
abstract = "High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard{\textquoteright}s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.",
keywords = "Centroiding, Data processing, Data quality, HRMS, Chemistry",
author = "Max Reuschenbach and Hohrenk-Danzouma, {Lotta L.} and Schmidt, {Torsten C.} and Gerrit Renner",
note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",
year = "2022",
month = sep,
doi = "10.1007/s00216-022-04224-y",
language = "English",
volume = "414",
pages = "6635--6645",
journal = "Analytical and Bioanalytical Chemistry",
issn = "1618-2642",
publisher = "Springer Science and Business Media Deutschland",
number = "22",

}

RIS

TY - JOUR

T1 - Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

AU - Reuschenbach, Max

AU - Hohrenk-Danzouma, Lotta L.

AU - Schmidt, Torsten C.

AU - Renner, Gerrit

N1 - Publisher Copyright: © 2022, The Author(s).

PY - 2022/9

Y1 - 2022/9

N2 - High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

AB - High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

KW - Centroiding

KW - Data processing

KW - Data quality

KW - HRMS

KW - Chemistry

UR - http://www.scopus.com/inward/record.url?scp=85134558702&partnerID=8YFLogxK

U2 - 10.1007/s00216-022-04224-y

DO - 10.1007/s00216-022-04224-y

M3 - Journal articles

C2 - 35871703

AN - SCOPUS:85134558702

VL - 414

SP - 6635

EP - 6645

JO - Analytical and Bioanalytical Chemistry

JF - Analytical and Bioanalytical Chemistry

SN - 1618-2642

IS - 22

ER -

Recently viewed

Publications

  1. Soft Skills for Hard Constraints
  2. Application of feedforward artificial neural network in Muskingum flood routing
  3. Temporal processes in prime–mask interaction
  4. Diffusion patterns in small vs. large capital markets-the case of value-based management
  5. A MODEL FOR QUANTIFICATION OF SOFTWARE COMPLEXITY
  6. Challenges and boundaries in implementing social return on investment
  7. Should learners use their hands for learning? Results from an eye-tracking study
  8. Duration of Organizational Decision Processes in Organizations in View of Simulation Calculations
  9. Structural Synthesis of Parallel Robots with Unguided Linear Actuators
  10. Influence of Process Parameters and Die Design on the Microstructure and Texture Development of Direct Extruded Magnesium Flat Products
  11. Introduction Mobile Digital Practices. Situating People, Things, and Data
  12. Relationships between language-related variations in text tasks, reading comprehension, and students’ motivation and emotions: A systematic review
  13. Species composition and forest structure explain the temperature sensitivity patterns of productivity in temperate forests
  14. From Open Access to Open Science
  15. Dynamically adjusting the k-values of the ATCS rule in a flexible flow shop scenario with reinforcement learning
  16. Mathematical relation between extended connectivity and eigenvector coefficients.
  17. Technical concept and evaluation design of the state subsidized project [Level-Q]
  18. Modelling, Simulation and Experimental Analysis of a Metal-Polymer Hybrid Fibre based Microstrip Resonator for High Frequency Characterisation
  19. How generative drawing affects the learning process
  20. Introduction to ‘Exploring the frontiers: unveiling new horizons in carbon efficient biomass utilization’
  21. Taking notes as a strategy for solving reality-based tasks in mathematics
  22. Combining multiple investigative approaches to unravel functional responses to global change in the understorey of temperate forests
  23. From entity to process
  24. Understanding the socio-technical aspects of low-code adoption for software development
  25. Intraspecific trait variation patterns along a precipitation gradient in Mongolian rangelands
  26. Using Wikipedia for Cross-Language Named Entity Recognition
  27. Closed-form Solution for the Direct Kinematics Problem of the Planar 3-RPR Parallel Mechanism
  28. Artificial Intelligence in Foreign Language Learning and Teaching