y-Randomization and its variants in QSPR/QSAR

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

y-Randomization is a tool used in validation of QSPR/QSAR models, whereby the performance of the original model in data description (r 2) is compared to that of models built for permuted (randomly shuffled) response, based on the original descriptor pool and the original model building procedure. We compared y-randomization and several variants thereof, using original response, permuted response, or random number pseudoresponse and original descriptors or random number pseudodescriptors, in the typical setting of multilinear regression (MLR) with descriptor selection. For each combination of number of observations (compounds), number of descriptors in the final model, and number of descriptors in the pool to select from, computer experiments using the same descriptor selection method result in two different mean highest random r 2 values. A lower one is produced by y-randomization or a variant likewise based on the original descriptors, while a higher one is obtained from variants that use random number pseudodescriptors. The difference is due to the intercorrelation of real descriptors in the pool. We propose to compare an original model's r 2 to both of these whenever possible. The meaning of the three possible outcomes of such a double test is discussed. Often y-randomization is not available to a potential user of a model, due to the values of all descriptors in the pool for all compounds not being published. In such cases random number experiments as proposed here are still possible. The test was applied to several recently published MLR QSAR equations, and cases of failure were identified. Some progress also is reported toward the aim of obtaining the mean highest r 2 of random pseudomodels by calculation rather than by tedious multiple simulations on random number variables.

Titel in Übersetzungy-Randomisierung in QSPR/QSAR
OriginalspracheEnglisch
ZeitschriftJournal of Chemical Information and Modeling
Jahrgang47
Ausgabenummer6
Seiten (von - bis)2345-2357
Anzahl der Seiten13
ISSN1549-9596
DOIs
PublikationsstatusErschienen - 11.2007
Extern publiziertJa

DOI

Zuletzt angesehen

Publikationen

  1. Geodetic rays and fibers in periodic graphs
  2. Assessment of the biotic and abiotic elimination processes of five micropollutants during cultivation of the green microalgae Acutodesmus obliquus
  3. EEZ-adjacent distant-water fishing as a global security challenge
  4. Production planning with simulated annealing
  5. Notting Hill Gate 3 Basic
  6. Steering for sustainable development
  7. Emerging Technologies for Improving Access to Radiation Therapy
  8. Well if that had been true, that would have been perfectly reasonable - Appeals to reasonableness in political interviews
  9. Development of a magnesium recycling alloy based on the AM alloy system
  10. Feedback Systems
  11. From Estimation Results to Stylized Facts
  12. Conference report Spatial strategies at the land-sea interface
  13. Data Matters
  14. 2 Thessalonians as pseudepigraphic 'reading instruction' for 1 Thessalonians
  15. Towards a more sustainable metal use – Lessons learned from national strategy documents
  16. Geschäftsprozessintegration mit SAP
  17. The Influence Of Product Reuse On Production Planning and Control
  18. Dis/Ability and Digital Cultures. A Media-Archaeological Perspective on Inclusion as a Cipher
  19. YouCallo – Tapping the Knowledge of Social Groupware Systems
  20. How health message framing and targets affect distancing during the Covid-19 pandemic
  21. Sustainable Green Technologies
  22. Priming of CD8+ T-cell responses after DNA immunization is impaired in TLR9- and MyD88-deficient mice.
  23. Participation for effective environmental governance? Evidence from Water Framework Directive implementation in Germany, Spain and the United Kingdom
  24. Exploring inclusive education in times of COVID-19
  25. Recent developments in microalgal conversion of organic-enriched waste streams
  26. Relation of vocational identity statuses to interest structure among Swiss adolescents
  27. Die übertragene Revolution
  28. Absenteeism as a Reaction to Harmful Behavior in the Workplace from a Stress Theory Point of View
  29. Communications about uncertainty in scientific climate-related findings
  30. Stochastic environmental policy, risk-taking, and growth
  31. How do low back pain patients conceptualize their expectations regarding treatment?
  32. From Short Story to Stage
  33. Neighbourhood‐mediated shifts in tree biomass allocation drive overyielding in tropical species mixtures
  34. Naturschutz, Zukunftsaufgabe
  35. Art, Aesthetics and Organization
  36. Location, Location, Location
  37. Social dynamics of community resilience building in the face of climate change
  38. Schreibstrategien als Publikationsstrategien
  39. P : Passivität
  40. Aquatic Exposure Predictions of Insecticide Field Concentrations Using a Multimedia Mass-Balance Model
  41. Influence of Solution Heat Treatment on the Microstructure, Hardness and Stress Corrosion Behavior of Extruded Resoloy®
  42. Board gender diversity and carbon emissions
  43. Resultant (moral) luck: Post hoc decision evaluation as dependent on belief truth, belief justification, and outcome in moral and prudential situations