GapSense: Similarity Estimation-Based Gap Filler with TGS-Reads for Genome Assemblies
  • Kan, Yejin
  • Kim, Dongyeon
  • Yang, Jinkyung
  • Yi, Gangman
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Advances in next-generation sequencing have led to an explosion in sequencing data, accelerating genome assembly research. However, draft genomes generated after scaffolding still contain unresolved gaps, often caused by repetitive regions and sequencing errors. These gaps may contain biologically meaningful sequences and thus require accurate resolution. However, existing gap-filling tools often exhibit limited reliability, especially when applied to large and complex eukaryotic genomes, due to their insufficient capacity to resolve repetitive regions or their heavy dependence on error-prone long reads. To address this challenge, we present GapSense, a robust gap-filling method that leverages similarity estimation using third-generation sequencing (TGS) reads. By quantifying pairwise similarity among candidate sequences, GapSense prioritizes informative regions and reconstructs gap sequences with higher accuracy. The proposed method introduces a novel similarity scoring mechanism that evaluates the geometric overlap of adjacent subregions to capture local structural variations and reduces noise from low-coverage and error-prone long reads. Experimental results on six representative species and three popular assemblers show that GapSense consistently outperforms existing tools in terms of gap-filling accuracy and contiguity, while maintaining low performance variability across different datasets. These findings demonstrate the effectiveness and generalizability of GapSense for accurate and scalable gap-filling.

키워드

Gap-filling<italic>De novo</italic> assemblyThird-generation sequencingNext-generation sequencingGenome assembly
제목
GapSense: Similarity Estimation-Based Gap Filler with TGS-Reads for Genome Assemblies
저자
Kan, YejinKim, DongyeonYang, JinkyungYi, Gangman
DOI
10.1007/s12539-025-00770-y
발행일
2025-11
유형
Article; Early Access
저널명
Interdisciplinary sciences, computational life sciences