Prediction-based GPU sharing for distributed trainingopen access
- Authors
- Shin, Changyong; Go, Younghun; Yoo, Yeonho; Jeong, Jinwoo; Hwang, Jaehyun; Yang, Gyeongsik; Yoo, Chuck
- Issue Date
- Aug-2026
- Publisher
- ELSEVIER
- Keywords
- Cloud computing; GPU Sharing; Service level agreement; Performance prediction; GPU Scheduling
- Citation
- Future Generation Computer Systems, v.181, pp 1 - 14
- Pages
- 14
- Indexed
- SCIE
SCOPUS
- Journal Title
- Future Generation Computer Systems
- Volume
- 181
- Start Page
- 1
- End Page
- 14
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/63845
- DOI
- 10.1016/j.future.2026.108413
- ISSN
- 0167-739X
1872-7115
- Abstract
- GPU sharing aims to enhance the efficiency of GPU utilization by running distributed deep learning training jobs concurrently. However, GPU sharing poses a significant challenge: the increase in job completion time (JCT) caused by interference between jobs is inconsistent, complicating job scheduling. Our experiments reveal that the degree of JCT increase varies by as much as-3.7x. While previous studies have analyzed this JCT inconsistency problem, none of them have been able to minimize the inconsistency. We propose TensorShare, a proactive GPU sharing technique that leverages a deep learning model to predict the extent of JCT increase. This study defines a new metric, called GPU SLA, which represents the upper threshold of JCT increase. TensorShare then introduces a novel scheduler that proactively identifies which jobs meet GPU SLA while minimizing the JCT increase. Our evaluation shows that TensorShare improves GPU SLA satisfaction rates by 26.1x-47.3x and reduces the JCT increase by 37%-60%. Furthermore, we evaluate TensorShare with large language models that are not included in training TensorShare's prediction model, achieving-7x and-10.3x improvements in GPU SLA satisfaction and JCT inconsistency, respectively.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.