Predicting potent compounds via model-based global optimization

J Chem Inf Model. 2013 Mar 25;53(3):553-9. doi: 10.1021/ci3004682. Epub 2013 Feb 14.

Abstract

Finding potent compounds for a given target in silico can be viewed as a constraint global optimization problem. This requires the use of an optimization function for which evaluations might be costly. The major task is maximizing the function while minimizing the number of evaluation steps. To solve this problem, we propose a machine learning algorithm, which first builds a statistical QSAR-model of the SAR landscape and then uses the model to identify regions in compound space having a high probability to contain a highly potent compound. For this purpose, we devise the so-called expected potency improvement (EI) criterion to rank candidate compounds with respect to their likelihood to exhibit higher potency than the most active compound in the training data. Therefore, this approach significantly differs from a purely prediction-oriented classical QSAR model. The method is superior to a nearest neighbor approach as significantly fewer evaluation steps are needed to identify the most potent compound for the given target.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computer Simulation
  • Databases, Chemical
  • Drug Discovery / methods*
  • High-Throughput Screening Assays / methods*
  • Humans
  • Models, Chemical*
  • Normal Distribution
  • Quantitative Structure-Activity Relationship