A self-imitation learning approach for scheduling evaporation and encapsulation stages of OLED display manufacturing systems

Lee, Donghun; Park, In-Beom; Kim, Kwanho

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A self-imitation learning approach for scheduling evaporation and encapsulation stages of OLED display manufacturing systems

Authors: Lee, Donghun; Park, In-Beom; Kim, Kwanho

Issue Date: Jun-2025

Publisher: Elsevier Ltd

Keywords: Deep reinforcement learning; Eligibility return; OLED display manufacturing scheduling; Self-imitation learning; Total tardiness

Citation: Robotics and Computer-Integrated Manufacturing, v.93, pp 1 - 14

Pages: 14

Indexed: SCIE
SCOPUS

Journal Title: Robotics and Computer-Integrated Manufacturing

Volume: 93

Start Page: 1

End Page: 14

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/57817

DOI: 10.1016/j.rcim.2024.102917

ISSN: 0736-5845
1879-2537

Abstract: In modern organic light-emitting diode (OLED) manufacturing systems, scheduling is a key decision-making problem to improve productivity. In particular, the scheduling of evaporation and encapsulation stages has been confronted with complicated constraints such as job-splitting property, preventive maintenance, machine eligibility, family setups, and heterogeneous release time of jobs. To efficiently solve such complicated scheduling problems, reinforcement learning (RL) has drawn increasing attention as an alternative in recent years. Unfortunately, the performance of the RL-based scheduling methods might not be satisfactory since unexpected correlations between actions are caused by machine eligibility restrictions, making it more challenging to address the credit assignment problem. To minimize the total tardiness, this article proposes a self-imitation learningbased scheduling method in which an agent utilizes past good experiences to exploit efficient exploration. Furthermore, a novel return design is introduced to overcome the credit assignment problem by considering machine eligibility restrictions. To prove the effectiveness and efficiency of the proposed method, numerical experiments are carried out by using the datasets that simulated the real-world OLED display manufacturing systems. Experiment results demonstrate that the proposed method outperforms other baselines, including rulebased and meta-heuristics, as well as the other DRL-based method in terms of the total tardiness while reducing computation time compared to meta-heuristics.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > Department of Industrial and Systems Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Kwan Ho photo

Kim, Kwan Ho: College of Engineering (Department of Industrial and Systems Engineering)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE