Cited 5 time in
Exploring chemical space for lead identification by propagating on chemical similarity network
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Yi, Jungseob | - |
| dc.contributor.author | Lee, Sangseon | - |
| dc.contributor.author | Lim, Sangsoo | - |
| dc.contributor.author | Cho, Changyun | - |
| dc.contributor.author | Piao, Yinhua | - |
| dc.contributor.author | Yeo, Marie | - |
| dc.contributor.author | Kim, Dongkyu | - |
| dc.contributor.author | Kim, Sun | - |
| dc.contributor.author | Lee, Sunho | - |
| dc.date.accessioned | 2024-08-08T10:00:39Z | - |
| dc.date.available | 2024-08-08T10:00:39Z | - |
| dc.date.issued | 2023-01 | - |
| dc.identifier.issn | 2001-0370 | - |
| dc.identifier.issn | 2001-0370 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/21057 | - |
| dc.description.abstract | Motivation: Lead identification is a fundamental step to prioritize candidate compounds for downstream drug discovery process. Machine learning (ML) and deep learning (DL) approaches are widely used to identify lead compounds using both chemical property and experimental information. However, ML or DL methods rarely consider compound similarity information directly since ML and DL models use abstract representation of molecules for model construction. Alternatively, data mining approaches are also used to explore chemical space with drug candidates by screening undesirable compounds. A major challenge for data mining approaches is to develop efficient data mining methods that search large chemical space for desirable lead compounds with low false positive rate. Results: In this work, we developed a network propagation (NP) based data mining method for lead identification that performs search on an ensemble of chemical similarity networks. We compiled 14 fingerprint-based similarity networks. Given a target protein of interest, we use a deep learning-based drug target interaction model to narrow down compound candidates and then we use network propagation to prioritize drug candidates that are highly correlated with drug activity score such as IC50. In an extensive experiment with BindingDB, we showed that our approach successfully discovered intentionally unlabeled compounds for given targets. To further demonstrate the prediction power of our approach, we identified 24 candidate leads for CLK1. Two out of five synthesizable candidates were experimentally validated in binding assays. In conclusion, our framework can be very useful for lead identification from very large compound databases such as ZINC. © 2023 The Author(s) | - |
| dc.format.extent | 9 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Elsevier B.V. | - |
| dc.title | Exploring chemical space for lead identification by propagating on chemical similarity network | - |
| dc.type | Article | - |
| dc.publisher.location | 네델란드 | - |
| dc.identifier.doi | 10.1016/j.csbj.2023.08.016 | - |
| dc.identifier.scopusid | 2-s2.0-85172202691 | - |
| dc.identifier.wosid | 001075290300001 | - |
| dc.identifier.bibliographicCitation | Computational and Structural Biotechnology Journal, v.21, pp 4187 - 4195 | - |
| dc.citation.title | Computational and Structural Biotechnology Journal | - |
| dc.citation.volume | 21 | - |
| dc.citation.startPage | 4187 | - |
| dc.citation.endPage | 4195 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Biochemistry & Molecular Biology | - |
| dc.relation.journalResearchArea | Biotechnology & Applied Microbiology | - |
| dc.relation.journalWebOfScienceCategory | Biochemistry & Molecular Biology | - |
| dc.relation.journalWebOfScienceCategory | Biotechnology & Applied Microbiology | - |
| dc.subject.keywordPlus | DRUG DISCOVERY | - |
| dc.subject.keywordPlus | LEARNING APPROACH | - |
| dc.subject.keywordPlus | DATABASE | - |
| dc.subject.keywordPlus | PREDICTION | - |
| dc.subject.keywordPlus | MOLECULES | - |
| dc.subject.keywordPlus | MODEL | - |
| dc.subject.keywordAuthor | Chemical network construction | - |
| dc.subject.keywordAuthor | Data mining | - |
| dc.subject.keywordAuthor | Lead identification | - |
| dc.subject.keywordAuthor | Network propagation | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
