Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Enhanced reinforcement learning by recursive updating of Q-values for reward propagation

Authors
Sung, Y.Ahn, E.Cho, K.
Issue Date
2013
Keywords
Propagation; Q-learning; Q-value; Terminal reward
Citation
Lecture Notes in Electrical Engineering, v.215 LNEE, pp 1003 - 1008
Pages
6
Indexed
SCOPUS
Journal Title
Lecture Notes in Electrical Engineering
Volume
215 LNEE
Start Page
1003
End Page
1008
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/17654
DOI
10.1007/978-94-007-5860-5_121
ISSN
1876-1100
1876-1119
Abstract
In this paper, we propose a method to reduce the learning time of Q-learning by combining the method of updating even to Q-values of unexecuted actions with the method of adding a terminal reward to unvisited Q-values. To verify the method, its performance was compared to that of conventional Q-learning. The proposed approach showed the same performance as conventional Q-learning, with only 27 % of the learning episodes required for conventional Q-learning. Accordingly, we verified that the proposed method reduced learning time by updating more Q-values in the early stage of learning and distributing a terminal reward to more Q-values. © 2013 Springer Science+Business Media.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Sung, Yunsick photo

Sung, Yunsick
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE