Detailed Information

Cited 12 time in webofscience Cited 15 time in scopus
Metadata Downloads

NPOmix: A machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clustersopen access

Authors
Tiago F. LeãoWang, Mingxunda Silva, RicardoGurevich, AlexeyBauermeister, AnelizeGomes, Paulo Wender P.Brejnrod, AskerGlukhov, EvgeniaAron, Allegra T.Louwen, Joris J. R.Kim, Hyun WooReher, RaphaelFiore, Marli F.van der Hooft, Justin J. J.Gerwick, LenaGerwick, William H.Bandeira, NunoDorrestein, Pieter C.
Issue Date
Nov-2022
Publisher
Oxford University Press
Keywords
genomics; mass spectrometry; machine learning; specialized metabolites; biosynthetic gene clusters
Citation
PNAS Nexus, v.1, no.5, pp 1 - 15
Pages
15
Indexed
SCOPUS
ESCI
Journal Title
PNAS Nexus
Volume
1
Number
5
Start Page
1
End Page
15
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/21806
DOI
10.1093/pnasnexus/pgac257
ISSN
2752-6542
2752-6542
Abstract
Microbial specialized metabolites are an important source of and inspiration for many pharmaceuticals, biotechnological products and play key roles in ecological processes. Untargeted metabolomics using liquid chromatography coupled with tandem mass spectrometry is an efficient technique to access metabolites from fractions and even environmental crude extracts. Nevertheless, metabolomics is limited in predicting structures or bioactivities for cryptic metabolites. Efficiently linking the biosynthetic potential inferred from (meta)genomics to the specialized metabolome would accelerate drug discovery programs by allowing metabolomics to make use of genetic predictions. Here, we present a k-nearest neighbor classifier to systematically connect mass spectrometry fragmentation spectra to their corresponding biosynthetic gene clusters (independent of their chemical class). Our new pattern-based genome mining pipeline links biosynthetic genes to metabolites that they encode for, as detected via mass spectrometry from bacterial cultures or environmental microbiomes. Using paired datasets that include validated genes-mass spectral links from the Paired Omics Data Platform, we demonstrate this approach by automatically linking 18 previously known mass spectra (17 for which the biosynthesis gene clusters can be found at the MIBiG database plus palmyramide A) to their corresponding previously experimentally validated biosynthetic genes (e.g., via nuclear magnetic resonance or genetic engineering). We illustrated a computational example of how to use our Natural Products Mixed Omics (NPOmix) tool for siderophore mining that can be reproduced by the users. We conclude that NPOmix minimizes the need for culturing (it worked well on microbiomes) and facilitates specialized metabolite prioritization based on integrative omics mining.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Pharmacy > Department of Pharmacy > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Hyun Woo photo

Kim, Hyun Woo
College of Pharmacy (Department of Pharmacy)
Read more

Altmetrics

Total Views & Downloads

BROWSE