Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

EdgeV-SE: Self-Reflective Fine-Tuning Framework for Edge-Deployable Vision-Language Models

Full metadata record
DC Field Value Language
dc.contributor.authorJeon, Yoonmo-
dc.contributor.authorLee, Seunghun-
dc.contributor.authorKim, Woongsup-
dc.date.accessioned2026-02-02T05:30:22Z-
dc.date.available2026-02-02T05:30:22Z-
dc.date.issued2026-01-
dc.identifier.issn2076-3417-
dc.identifier.issn2076-3417-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/63570-
dc.description.abstractFeatured Application The proposed framework enables the deployment of robust Vision-Language Models on resource-constrained off-the-shelf edge devices, such as the NVIDIA Jetson series. Its primary application is real-time disaster damage assessment using satellite imagery in communication-denied environments, facilitating immediate decision-making for first responders.Abstract The deployment of Vision-Language Models (VLMs) in Satellite IoT scenarios is critical for real-time disaster assessment but is often hindered by the substantial memory and compute requirements of state-of-the-art models. While parameter-efficient fine-tuning (PEFT) enables adaptation, with minimal computational overhead, standard supervised methods often fail to ensure robustness and reliability on resource-constrained edge devices. To address this, we propose EdgeV-SE, a self-reflective fine-tuning framework that significantly enhances the performance of VLM without introducing any inference-time overhead. Our framework incorporates an uncertainty-aware self-reflection mechanism with asymmetric dual pathways: a generative linguistic pathway and an auxiliary discriminative visual pathway. By estimating uncertainty from the linguistic pathway using a log-likelihood margin between class verbalizers, EdgeV-SE identifies ambiguous samples and refines its decision boundaries via consistency regularization and cross-pathway mutual learning. Experimental results on hurricane damage assessment demonstrate that our approach improves image classification accuracy, enhances image-text semantic alignment, and achieves superior caption quality. Notably, our work achieves these gains while maintaining practical deployment on a commercial off-the-shelf edge device such as NVIDIA Jetson Orin Nano, preserving the inference latency and memory footprint. Overall, our work contributes a unified self-reflective fine-tuning framework that improves robustness, calibration, and deployability of VLMs on edge devices.-
dc.format.extent31-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI-
dc.titleEdgeV-SE: Self-Reflective Fine-Tuning Framework for Edge-Deployable Vision-Language Models-
dc.typeArticle-
dc.publisher.location스위스-
dc.identifier.doi10.3390/app16020818-
dc.identifier.scopusid2-s2.0-105028701707-
dc.identifier.wosid001670111700001-
dc.identifier.bibliographicCitationApplied Sciences, v.16, no.2, pp 1 - 31-
dc.citation.titleApplied Sciences-
dc.citation.volume16-
dc.citation.number2-
dc.citation.startPage1-
dc.citation.endPage31-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaMaterials Science-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryEngineering, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryMaterials Science, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordAuthorVision-Language Model (VLM)-
dc.subject.keywordAuthoredge computing-
dc.subject.keywordAuthorself-reflective learning-
dc.subject.keywordAuthorconsistency regularization-
dc.subject.keywordAuthormutual learning-
dc.subject.keywordAuthorsatellite IoT-
dc.subject.keywordAuthorNVIDIA Jetson-
dc.subject.keywordAuthordisaster analysis-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > Department of Information and Communication Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Woong Sup photo

Kim, Woong Sup
College of Engineering (Department of Information and Communication Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE