Development of Machine Learning-Based QSAR Model for Virtual Screening of Dipeptidyl Peptidase-4 Inhibitors Sudarko,1, a) Salsabila Safa A B S,1, b) Zulfikar,1, c) Anak Agung Istri Ratnadewi 1, d), Wuryanti Handayani 1, e)
1. Departement of Chemistry, Faculty of Mathemathics and Natural Sciences, Universitas Jember, Jl. Kalimantan 37, Jember, 68121, Indonesia
a. Corresponding author: darko[at]unej.ac.id
b. Electronic mail: salsabilasafaabs[at]gmail.com
c. Electronic email: zulfikar[at]unej.ac.id
d. Electronic email: istri_dewi.fmipa[at]unej.ac.id
e. Electronic email: wuriyanti.fmipa[at]unej.ac.id
Abstract
Abstract
Treatment of type 2 diabetes mellitus is mostly done by inhibiting the DPP-4 protein using an inhibitor compound, however it may cause headaches and indigestion as its side effect. This study has been focused on development of the DPP-4 inhibitor as new drug candidates for type 2 diabetes mellitus using the Machine Learning-based Quantitative Structure-Activity Relationship (QSAR) for the virtual screening process. Training dataset has been obtained from the ChEMBL database with DPP-4 as target protein (code ChEMBL284), and it is used to find a model which then applied for virtual screening process of 884 million molecules obtained from the ZINC database. The screening processes based on the predicted activity values above the experimental activity values of the drugs that were already available and it is then screened again according to Lipinski Rule of 5 to find out the compounds that can be absorbed by the body. The compounds that can be absorbed by the body was docked using AutoDockVina software to determine the free energy value and interaction pattern between the compound and protein target to get recommendations for a new DPP-4 inhibitor candidate. Result obtained from best model with R2 test value of 0.69 is then used for virtual screening. The results of the virtual screening were 5 compounds that had the highest pIC50 values and not violating Lipinski^s RO5. These compounds had codes ZINC341837061, ZINC001359979988, ZINC001707862778, ZINC001722886251 and ZINC001726358542.