Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

DeepPI: Alignment-Free Analysis of Flexible Length Proteins Based on Deep Learning and Image Generator

Authors
Ji, MingeunKan, YejinKim, DongyeonLee, SeungminYi, Gangman
Issue Date
Sep-2024
Publisher
SPRINGER HEIDELBERG
Keywords
Machine learning; Deep learning; Global Average Pooling; Protein function
Citation
Interdisciplinary Sciences: Computational Life Sciences, v.16, no.1, pp 1 - 12
Pages
12
Indexed
SCIE
SCOPUS
Journal Title
Interdisciplinary Sciences: Computational Life Sciences
Volume
16
Number
1
Start Page
1
End Page
12
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/21599
DOI
10.1007/s12539-024-00618-x
ISSN
1913-2751
1867-1462
Abstract
With the rapid development of NGS technology, the number of protein sequences has increased exponentially. Computational methods have been introduced in protein functional studies because the analysis of large numbers of proteins through biological experiments is costly and time-consuming. In recent years, new approaches based on deep learning have been proposed to overcome the limitations of conventional methods. Although deep learning-based methods effectively utilize features of protein function, they are limited to sequences of fixed-length and consider information from adjacent amino acids. Therefore, new protein analysis tools that extract functional features from proteins of flexible length and train models are required. We introduce DeepPI, a deep learning-based tool for analyzing proteins in large-scale database. The proposed model that utilizes Global Average Pooling is applied to proteins of flexible length and leads to reduced information loss compared to existing algorithms that use fixed sizes. The image generator converts a one-dimensional sequence into a distinct two-dimensional structure, which can extract common parts of various shapes. Finally, filtering techniques automatically detect representative data from the entire database and ensure coverage of large protein databases. We demonstrate that DeepPI has been successfully applied to large databases such as the Pfam-A database. Comparative experiments on four types of image generators illustrated the impact of structure on feature extraction. The filtering performance was verified by varying the parameter values and proved to be applicable to large databases. Compared to existing methods, DeepPI outperforms in family classification accuracy for protein function inference.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yi, Gang Man photo

Yi, Gang Man
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE