y-Randomization and its variants in QSPR/QSAR

Christoph Rücker; G. Rücker; M. Meringer

doi:10.1021/ci700157b

y-Randomization and its variants in QSPR/QSAR

Research output: Journal contributions › Journal articles › Research › peer-review

Authors

Christoph Rücker
G. Rücker
M. Meringer

y-Randomization is a tool used in validation of QSPR/QSAR models, whereby the performance of the original model in data description (r ²) is compared to that of models built for permuted (randomly shuffled) response, based on the original descriptor pool and the original model building procedure. We compared y-randomization and several variants thereof, using original response, permuted response, or random number pseudoresponse and original descriptors or random number pseudodescriptors, in the typical setting of multilinear regression (MLR) with descriptor selection. For each combination of number of observations (compounds), number of descriptors in the final model, and number of descriptors in the pool to select from, computer experiments using the same descriptor selection method result in two different mean highest random r ² values. A lower one is produced by y-randomization or a variant likewise based on the original descriptors, while a higher one is obtained from variants that use random number pseudodescriptors. The difference is due to the intercorrelation of real descriptors in the pool. We propose to compare an original model's r ² to both of these whenever possible. The meaning of the three possible outcomes of such a double test is discussed. Often y-randomization is not available to a potential user of a model, due to the values of all descriptors in the pool for all compounds not being published. In such cases random number experiments as proposed here are still possible. The test was applied to several recently published MLR QSAR equations, and cases of failure were identified. Some progress also is reported toward the aim of obtaining the mean highest r ² of random pseudomodels by calculation rather than by tedious multiple simulations on random number variables.

Translated title of the contribution	y-Randomisierung in QSPR/QSAR
Original language	English
Journal	Journal of Chemical Information and Modeling
Volume	47
Issue number	6
Pages (from-to)	2345-2357
Number of pages	13
ISSN	1549-9596
DOIs	https://doi.org/10.1021/ci700157b
Publication status	Published - 11.2007
Externally published	Yes

ASJC Scopus Subject Areas

Research areas

Chemistry

Comment on "nomenclature, Chemical Abstracts Service Numbers, Isomer Enumeration, Ring Strain, and Stereochemistry: What Does Any of This Have to Do with an International Chemical Disarmament and Nonproliferation Treaty?"

Rücker, C., Meringer, M. & Wassermann, A., 13.04.2021, In: Journal of Chemical Education. 98, 4, p. 1465-1467 3 p.

Research output: Journal contributions › Comments / Debate / Reports › Research

Octanol-Water Partition Coefficient Measurement by a Simple ¹H NMR Method

Cumming, H. & Rücker, C., 30.09.2017, In: ACS Omega. 2, 9, p. 6244-6249 6 p.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1021/ci700157b
Final published version

y-Randomization and its variants in QSPR/QSAR

Authors

ASJC Scopus Subject Areas

Research areas

Related by journal

Exploring the limits of graph invariant- and spectrum-based discrimination of (sub)structures.

Molecules in silico: A graph description of chemical reactions

Organic Synthesis – Art or Science?

QSPR Using MOLGEN-QSPR: The challenge of fluoroalkane boiling points

Substructure, subgraph, and walk counts as measures of the complexity of graphs and molecules.

Other publications by the same author(s)

Are Si-C bonds cleaved by microorganisms? A critical review on biodegradation of methylsiloxanes

Are Si–C bonds formed in the environment and/or in technical microbiological systems?

REACH und QSAR: ein Leitfaden für kleine und mittlere Unternehmen

Comment on "nomenclature, Chemical Abstracts Service Numbers, Isomer Enumeration, Ring Strain, and Stereochemistry: What Does Any of This Have to Do with an International Chemical Disarmament and Nonproliferation Treaty?"

Octanol-Water Partition Coefficient Measurement by a Simple ¹H NMR Method

DOI

Recently viewed

Activities

Publications

Press / Media