Detailed Information

Cited 3 time in webofscience Cited 3 time in scopus
Metadata Downloads

RFfiller: a robust and fast statistical algorithm for gap filling in draft genomes

Full metadata record
DC Field Value Language
dc.contributor.authorMidekso, Firaol Dida-
dc.contributor.authorYi, Gangman-
dc.date.accessioned2023-04-27T09:40:24Z-
dc.date.available2023-04-27T09:40:24Z-
dc.date.issued2022-10-
dc.identifier.issn2167-8359-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/2431-
dc.description.abstractNumerous published genomes contain gaps or unknown sequences. Gap filling is a critical final step in de novo genome assembly, particularly for large genomes. While certain computational approaches partially address the problem, others have shortcomings regarding the draft genome’s dependability and correctness (high rates of mis-assembly at gap-closing sites and high error rates). While it is well established that genomic repeats result in gaps, many sequence reads originating from repeat-related gaps are typically missed by existing approaches. A fast and reliable statistical algorithm for closing gaps in a draft genome is presented in this paper. It utilizes the alignment statistics between scaffolds, contigs, and paired-end reads to generate a Markov chain that appropriately assigns contigs or long reads to scaffold gap regions (only corrects candidate regions), resulting in accurate and efficient gap closure. To reconstruct the missing component between the two ends of the same insert, the RFfiller meticulously searches for valid overlaps (in repeat regions) and generates transition tables for similar reads, allowing it to make a statistical guess at the missing sequence. Finally, in our experiments, we show that the RFfiller’s gap-closing accuracy is better than that of other publicly available tools when sequence data from various organisms are used. Assembly benchmarks were used to validate RFfiller. Our findings show that RFfiller efficiently fills gaps and that it is especially effective when the gap length is longer. We also show that the RFfiller outperforms other gap closing tools currently on the market. Copyright 2022 Midekso and Yi.-
dc.format.extent22-
dc.language영어-
dc.language.isoENG-
dc.publisherPeerJ-
dc.titleRFfiller: a robust and fast statistical algorithm for gap filling in draft genomes-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.7717/peerj.14186-
dc.identifier.scopusid2-s2.0-85140327220-
dc.identifier.wosid000891524300002-
dc.identifier.bibliographicCitationPeerJ, v.10, pp 1 - 22-
dc.citation.titlePeerJ-
dc.citation.volume10-
dc.citation.startPage1-
dc.citation.endPage22-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.subject.keywordPlusSEQUENCE-ANALYSIS-
dc.subject.keywordPlusSINGLE-CELL-
dc.subject.keywordPlusASSEMBLER-
dc.subject.keywordAuthorDe novo assembly-
dc.subject.keywordAuthorDNA sequencing-
dc.subject.keywordAuthorKeywords Gap filling-
dc.subject.keywordAuthorRead extension-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yi, Gang Man photo

Yi, Gang Man
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE