Loading…
Predicting the Genotoxicity of Polycyclic Aromatic Compounds from Molecular Structure with Different Classifiers
Classification models were developed to provide accurate prediction of genotoxicity of 277 polycyclic aromatic compounds (PACs) directly from their molecular structures. Numerical descriptors encoding the topological, geometric, electronic, and polar surface area properties of the compounds were cal...
Saved in:
Published in: | Chemical research in toxicology 2003-12, Vol.16 (12), p.1567-1580 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Classification models were developed to provide accurate prediction of genotoxicity of 277 polycyclic aromatic compounds (PACs) directly from their molecular structures. Numerical descriptors encoding the topological, geometric, electronic, and polar surface area properties of the compounds were calculated to represent the structural information. Each compound's genotoxicity was represented with IMAX (maximal SOS induction factor) values measured by the SOS Chromotest in the presence and absence of S9 rat liver homogenate. The compounds' class identity was determined by a cutoff IMAX value of 1.25compounds with IMAX > 1.25 in either test were classified as genotoxic, and the ones with IMAX ≤ 1.25 were nongenotoxic. Several binary classification models were generated to predict genotoxicity: k-nearest neighbor (k-NN), linear discriminant analysis, and probabilistic neural network. The study showed k-NN to provide the highest predictive ability among the three classifiers with a training set classification rate of 93.5%. A consensus model was also developed that incorporated the three classifiers and correctly predicted 81.2% of the 277 compounds. It also provided a higher prediction rate on the genotoxic class than any other single model. |
---|---|
ISSN: | 0893-228X 1520-5010 |
DOI: | 10.1021/tx030032a |