EdgeV-SE: Self-Reflective Fine-Tuning Framework for Edge-Deployable Vision-Language Models

Jeon, Yoonmo; Lee, Seunghun; Kim, Woongsup

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

EdgeV-SE: Self-Reflective Fine-Tuning Framework for Edge-Deployable Vision-Language Models

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jeon, Yoonmo	-
dc.contributor.author	Lee, Seunghun	-
dc.contributor.author	Kim, Woongsup	-
dc.date.accessioned	2026-02-02T05:30:22Z	-
dc.date.available	2026-02-02T05:30:22Z	-
dc.date.issued	2026-01	-
dc.identifier.issn	2076-3417	-
dc.identifier.issn	2076-3417	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/63570	-
dc.description.abstract	Featured Application The proposed framework enables the deployment of robust Vision-Language Models on resource-constrained off-the-shelf edge devices, such as the NVIDIA Jetson series. Its primary application is real-time disaster damage assessment using satellite imagery in communication-denied environments, facilitating immediate decision-making for first responders.Abstract The deployment of Vision-Language Models (VLMs) in Satellite IoT scenarios is critical for real-time disaster assessment but is often hindered by the substantial memory and compute requirements of state-of-the-art models. While parameter-efficient fine-tuning (PEFT) enables adaptation, with minimal computational overhead, standard supervised methods often fail to ensure robustness and reliability on resource-constrained edge devices. To address this, we propose EdgeV-SE, a self-reflective fine-tuning framework that significantly enhances the performance of VLM without introducing any inference-time overhead. Our framework incorporates an uncertainty-aware self-reflection mechanism with asymmetric dual pathways: a generative linguistic pathway and an auxiliary discriminative visual pathway. By estimating uncertainty from the linguistic pathway using a log-likelihood margin between class verbalizers, EdgeV-SE identifies ambiguous samples and refines its decision boundaries via consistency regularization and cross-pathway mutual learning. Experimental results on hurricane damage assessment demonstrate that our approach improves image classification accuracy, enhances image-text semantic alignment, and achieves superior caption quality. Notably, our work achieves these gains while maintaining practical deployment on a commercial off-the-shelf edge device such as NVIDIA Jetson Orin Nano, preserving the inference latency and memory footprint. Overall, our work contributes a unified self-reflective fine-tuning framework that improves robustness, calibration, and deployability of VLMs on edge devices.	-
dc.format.extent	31	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	MDPI	-
dc.title	EdgeV-SE: Self-Reflective Fine-Tuning Framework for Edge-Deployable Vision-Language Models	-
dc.type	Article	-
dc.publisher.location	스위스	-
dc.identifier.doi	10.3390/app16020818	-
dc.identifier.scopusid	2-s2.0-105028701707	-
dc.identifier.wosid	001670111700001	-
dc.identifier.bibliographicCitation	Applied Sciences, v.16, no.2, pp 1 - 31	-
dc.citation.title	Applied Sciences	-
dc.citation.volume	16	-
dc.citation.number	2	-
dc.citation.startPage	1	-
dc.citation.endPage	31	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Materials Science	-
dc.relation.journalResearchArea	Physics	-
dc.relation.journalWebOfScienceCategory	Chemistry, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Engineering, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Materials Science, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Physics, Applied	-
dc.subject.keywordAuthor	Vision-Language Model (VLM)	-
dc.subject.keywordAuthor	edge computing	-
dc.subject.keywordAuthor	self-reflective learning	-
dc.subject.keywordAuthor	consistency regularization	-
dc.subject.keywordAuthor	mutual learning	-
dc.subject.keywordAuthor	satellite IoT	-
dc.subject.keywordAuthor	NVIDIA Jetson	-
dc.subject.keywordAuthor	disaster analysis	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > Department of Information and Communication Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Woong Sup photo

Kim, Woong Sup: College of Engineering (Department of Information and Communication Engineering)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE