Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra. / Reuschenbach, Max; Hohrenk-Danzouma, Lotta L.; Schmidt, Torsten C. et al.
In: Analytical and Bioanalytical Chemistry, Vol. 414, No. 22, 09.2022, p. 6635-6645.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{aee39fc4efe149d8818ad67ed6f336e3,
title = "Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra",
abstract = "High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard{\textquoteright}s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.",
keywords = "Centroiding, Data processing, Data quality, HRMS, Chemistry",
author = "Max Reuschenbach and Hohrenk-Danzouma, {Lotta L.} and Schmidt, {Torsten C.} and Gerrit Renner",
note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",
year = "2022",
month = sep,
doi = "10.1007/s00216-022-04224-y",
language = "English",
volume = "414",
pages = "6635--6645",
journal = "Analytical and Bioanalytical Chemistry",
issn = "1618-2642",
publisher = "Springer Science and Business Media Deutschland",
number = "22",

}

RIS

TY - JOUR

T1 - Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra

AU - Reuschenbach, Max

AU - Hohrenk-Danzouma, Lotta L.

AU - Schmidt, Torsten C.

AU - Renner, Gerrit

N1 - Publisher Copyright: © 2022, The Author(s).

PY - 2022/9

Y1 - 2022/9

N2 - High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

AB - High-resolution mass spectrometry is widely used in many research fields allowing for accurate mass determinations. In this context, it is pretty standard that high-resolution profile mode mass spectra are reduced to centroided data, which many data processing routines rely on for further evaluation. Yet information on the peak profile quality is not conserved in those approaches; i.e., describing results reliability is almost impossible. Therefore, we overcome this limitation by developing a new statistical parameter called data quality score (DQS). For the DQS calculations, we performed a very fast and robust regression analysis of the individual high-resolution peak profiles and considered error propagation to estimate the uncertainties of the regression coefficients. We successfully validated the new algorithm with the vendor-specific algorithm implemented in Proteowizard’s msConvert. Moreover, we show that the DQS is a sum parameter associated with centroid accuracy and precision. We also demonstrate the benefit of the new algorithm in nontarget screenings as the DQS prioritizes signals that are not influenced by non-resolved isobaric ions or isotopic fine structures. The algorithm is implemented in Python, R, and Julia programming languages and supports multi- and cross-platform downstream data handling.

KW - Centroiding

KW - Data processing

KW - Data quality

KW - HRMS

KW - Chemistry

UR - http://www.scopus.com/inward/record.url?scp=85134558702&partnerID=8YFLogxK

U2 - 10.1007/s00216-022-04224-y

DO - 10.1007/s00216-022-04224-y

M3 - Journal articles

C2 - 35871703

AN - SCOPUS:85134558702

VL - 414

SP - 6635

EP - 6645

JO - Analytical and Bioanalytical Chemistry

JF - Analytical and Bioanalytical Chemistry

SN - 1618-2642

IS - 22

ER -

Recently viewed

Publications

  1. Combining multiple investigative approaches to unravel functional responses to global change in the understorey of temperate forests
  2. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  3. Employing A-B tests for optimizing prices levels in e-commerce applications
  4. Distributable Modular Software Framework for Manufacturing Systems
  5. Incorporating ecosystem services into ecosystem-based management to deal with complexity
  6. Switching Dispatching Rules with Gaussian Processes
  7. How does telework modify informal workplace learning and how can supervisors provide support?
  8. Pushing the Envelope: Creating Public Value in the Labor Market
  9. Learning in the "Third Space"
  10. Digital Seriality as Structure and Process
  11. Evaluating a Bayesian Student Model of Decimal Misconceptions
  12. On Software, or the Persistence of Visual Knowledge.
  13. Parameterized Synthetic Image Data Set for Fisheye Lens
  14. Gaining deep leverage? Reflecting and shaping real-world lab impacts through leverage points
  15. Discrete Lyapunov Controllers for an Actuator in Camless Engines
  16. An Exploration of humans‘ ability to recognize emotions displayed by robots
  17. Toward a lifespan metric of reading fluency
  18. Anonymized firm data under test: evidence from a replication study
  19. Contested Promises
  20. Development and validation of the Later Life Work Index for successful management of an aging workforce
  21. Comparing Web-Based and Blended Training for Coping With Challenges of Flexible Work Designs
  22. Thanking and responding to thanks in American English: Language patterning and contextual appropriateness
  23. Exploring the implications of the value concept for performance assessment of sustainable business models
  24. Extraction of information from invoices - challenges in the extraction pipeline
  25. Developing robust field survey protocols in landscape ecology
  26. Second-order SMC with disturbance compensation for robust tracking control in PMSM applications
  27. Sprachliche Muster
  28. Development and criterion validity of differentiated and elevated vocational interests in adolescence
  29. Agile Portfolio Management Patterns
  30. Threshold Level
  31. Collaborative decision making in sustainable flood risk management
  32. Temporal and thermodynamic irreversibility in production theory
  33. Adding the “e-” to Learning for Sustainable Development
  34. Safer Spaces
  35. Microtomography on biomaterials using the harwi-2 beamline at desy
  36. Tormentil for active ulcerative colitis
  37. The relationship between acculturation strategies and depressive and anxiety disorders in Turkish migrants in the Netherlands
  38. Improving efficiency in budgeting
  39. The State and Healthcare
  40. Hot workability analysis with processing map and texture characteristics of as-cast TX32 magnesium alloy
  41. Resettlement as a temporal border