In:
Molecular Informatics, Wiley, Vol. 35, No. 5 ( 2016-05), p. 192-198
Abstract:
We present the application of machine learning models to selecting G protein‐coupled receptor (GPCR)‐focused compound libraries. The library design process was realized by ant colony optimization. A proprietary Boehringer‐Ingelheim reference set consisting of 3519 compounds tested in dose‐response assays at 11 GPCR targets served as training data for machine learning and activity prediction. We compared the usability of the proprietary data with a public data set from ChEMBL. Gaussian process models were trained to prioritize compounds from a virtual combinatorial library. We obtained meaningful models for three of the targets (5‐HT 2c , MCH, A1), which were experimentally confirmed for 12 of 15 selected and synthesized or purchased compounds. Overall, the models trained on the public data predicted the observed assay results more accurately. The results of this study motivate the use of Gaussian process regression on public data for virtual screening and target‐focused compound library design.
Type of Medium:
Online Resource
ISSN:
1868-1743
,
1868-1751
DOI:
10.1002/minf.201501012
Language:
English
Publisher:
Wiley
Publication Date:
2016
detail.hit.zdb_id:
2537668-8
SSG:
15,3
Bookmarklink