Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals. / Strempel, Sebastian; Nendza, Monika; Scheringer, Martin et al.
In: Environmental toxicology and chemistry / SETAC, Vol. 32, No. 5, 05.2013, p. 1187-1195.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{1fe007ae0669419ca523375073077ca9,
title = "Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals",
abstract = "The present study presents a data-oriented, tiered approach to assessing the bioaccumulation potential of chemicals according to the European chemicals regulation on Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH). The authors compiled data for eight physicochemical descriptors (partition coefficients, degradation half-lives, polarity, and so forth) for a set of 713 organic chemicals for which experimental values of the bioconcentration factor (BCF) are available. The authors employed supervised machine learning methods (conditional inference trees and random forests) to derive relationships between the physicochemical descriptors and the BCF values. In a first tier, the authors established rules for classifying a chemical as bioaccumulative (B) or nonbioaccumulative (non-B). In a second tier, the authors developed a new tool for estimating numerical BCF values. For both cases the optimal set of relevant descriptors was determined; these are biotransformation half-life and octanol-water distribution coefficient (log D) for the classification rules and log D, biotransformation half-life, and topological polar surface area for the BCF estimation tool. The uncertainty of the BCF estimates obtained with the new estimation tool was quantified by comparing the estimated and experimental BCF values of the 713 chemicals. Comparison with existing BCF estimation methods indicates that the performance of this new BCF estimation tool is at least as high as that of existing methods. The authors recommend the present study's classification rules and BCF estimation tool for a consensus application in combination with existing BCF estimation methods.",
keywords = "Chemistry, Biotransformation, Environmental Monitoring, Environmental Pollutants, Environmental Pollution, Half-Life, Organic Chemicals, Trees",
author = "Sebastian Strempel and Monika Nendza and Martin Scheringer and Konrad Hungerb{\"u}hler",
note = "Copyright {\textcopyright} 2013 SETAC.",
year = "2013",
month = may,
doi = "10.1002/etc.2150",
language = "English",
volume = "32",
pages = "1187--1195",
journal = "Environmental toxicology and chemistry / SETAC",
issn = "1552-8618",
publisher = "Oxford University Press",
number = "5",

}

RIS

TY - JOUR

T1 - Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals

AU - Strempel, Sebastian

AU - Nendza, Monika

AU - Scheringer, Martin

AU - Hungerbühler, Konrad

N1 - Copyright © 2013 SETAC.

PY - 2013/5

Y1 - 2013/5

N2 - The present study presents a data-oriented, tiered approach to assessing the bioaccumulation potential of chemicals according to the European chemicals regulation on Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH). The authors compiled data for eight physicochemical descriptors (partition coefficients, degradation half-lives, polarity, and so forth) for a set of 713 organic chemicals for which experimental values of the bioconcentration factor (BCF) are available. The authors employed supervised machine learning methods (conditional inference trees and random forests) to derive relationships between the physicochemical descriptors and the BCF values. In a first tier, the authors established rules for classifying a chemical as bioaccumulative (B) or nonbioaccumulative (non-B). In a second tier, the authors developed a new tool for estimating numerical BCF values. For both cases the optimal set of relevant descriptors was determined; these are biotransformation half-life and octanol-water distribution coefficient (log D) for the classification rules and log D, biotransformation half-life, and topological polar surface area for the BCF estimation tool. The uncertainty of the BCF estimates obtained with the new estimation tool was quantified by comparing the estimated and experimental BCF values of the 713 chemicals. Comparison with existing BCF estimation methods indicates that the performance of this new BCF estimation tool is at least as high as that of existing methods. The authors recommend the present study's classification rules and BCF estimation tool for a consensus application in combination with existing BCF estimation methods.

AB - The present study presents a data-oriented, tiered approach to assessing the bioaccumulation potential of chemicals according to the European chemicals regulation on Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH). The authors compiled data for eight physicochemical descriptors (partition coefficients, degradation half-lives, polarity, and so forth) for a set of 713 organic chemicals for which experimental values of the bioconcentration factor (BCF) are available. The authors employed supervised machine learning methods (conditional inference trees and random forests) to derive relationships between the physicochemical descriptors and the BCF values. In a first tier, the authors established rules for classifying a chemical as bioaccumulative (B) or nonbioaccumulative (non-B). In a second tier, the authors developed a new tool for estimating numerical BCF values. For both cases the optimal set of relevant descriptors was determined; these are biotransformation half-life and octanol-water distribution coefficient (log D) for the classification rules and log D, biotransformation half-life, and topological polar surface area for the BCF estimation tool. The uncertainty of the BCF estimates obtained with the new estimation tool was quantified by comparing the estimated and experimental BCF values of the 713 chemicals. Comparison with existing BCF estimation methods indicates that the performance of this new BCF estimation tool is at least as high as that of existing methods. The authors recommend the present study's classification rules and BCF estimation tool for a consensus application in combination with existing BCF estimation methods.

KW - Chemistry

KW - Biotransformation

KW - Environmental Monitoring

KW - Environmental Pollutants

KW - Environmental Pollution

KW - Half-Life

KW - Organic Chemicals

KW - Trees

UR - http://www.scopus.com/inward/record.url?scp=84876417882&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/8a2d8b7d-fd00-39e5-b703-9cb2adca3cd6/

U2 - 10.1002/etc.2150

DO - 10.1002/etc.2150

M3 - Journal articles

C2 - 23382013

VL - 32

SP - 1187

EP - 1195

JO - Environmental toxicology and chemistry / SETAC

JF - Environmental toxicology and chemistry / SETAC

SN - 1552-8618

IS - 5

ER -

DOI

Recently viewed

Publications

  1. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  2. WHICH ESTIMATION SITUATIONS ARE RELEVANT FOR A VALID ASSESSMENT OF MEASUREMENT ESTIMATION SKILLS
  3. Quantum Computing and the Analog/Digital Distinction
  4. A Comparative Study for Fisheye Image Classification
  5. Creating regional (e-)learning networks
  6. Pressure fault recognition and compensation with an adaptive feedforward regulator in a controlled hybrid actuator within engine applications
  7. An analytical approach to evaluating monotonic functions of fuzzy numbers
  8. Masked Autoencoder Pretraining for Event Classification in Elite Soccer
  9. Grounds different from, though equally solid with
  10. The identification of up-And downstream industries using input-output tables and a firm-level application to minority shareholdings
  11. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  12. Understanding the error-structure of Time-driven Activity-based Costing
  13. Intraindividual variability in identity centrality
  14. Optimization of 3D laser scanning speed by use of combined variable step
  15. Some results on output algebraic feedback with applications to mechanical systems
  16. Wavelet functions for rejecting spurious values
  17. German Utilities and distributed PV
  18. Markups and Concentration in the Context of Digitization
  19. Covert and overt automatic imitation are correlated
  20. An observer for sensorless variable valve control in camless internal combustion engines
  21. Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals
  22. Supporting Visual and Verbal Learning Preferences in a Second-Language Multimedia Learning Environment
  23. Writing as a Deeper Form of Concentration
  24. (Re)productivity