Enhanced reinforcement learning by recursive updating of Q-values for reward propagation

Sung, Y.; Ahn, E.; Cho, K.

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Enhanced reinforcement learning by recursive updating of Q-values for reward propagation

Authors: Sung, Y.; Ahn, E.; Cho, K.

Issue Date: 2013

Keywords: Propagation; Q-learning; Q-value; Terminal reward

Citation: Lecture Notes in Electrical Engineering, v.215 LNEE, pp 1003 - 1008

Pages: 6

Indexed: SCOPUS

Journal Title: Lecture Notes in Electrical Engineering

Volume: 215 LNEE

Start Page: 1003

End Page: 1008

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/17654

DOI: 10.1007/978-94-007-5860-5_121

ISSN: 1876-1100
1876-1119

Abstract: In this paper, we propose a method to reduce the learning time of Q-learning by combining the method of updating even to Q-values of unexecuted actions with the method of adding a terminal reward to unvisited Q-values. To verify the method, its performance was compared to that of conventional Q-learning. The proposed approach showed the same performance as conventional Q-learning, with only 27 % of the learning episodes required for conventional Q-learning. Accordingly, we verified that the proposed method reduced learning time by updating more Q-values in the early stage of learning and distributing a terminal reward to more Q-values. © 2013 Springer Science+Business Media.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Sung, Yunsick photo

Sung, Yunsick: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE