Cited 2 time in
Advanced Double Layered Multi-Agent Systems Based on A3C in Real-Time Path Planning
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Dajeong | - |
| dc.contributor.author | Kim, Junoh | - |
| dc.contributor.author | Cho, Kyungeun | - |
| dc.contributor.author | Sung, Yunsick | - |
| dc.date.accessioned | 2023-04-27T15:40:32Z | - |
| dc.date.available | 2023-04-27T15:40:32Z | - |
| dc.date.issued | 2021-11 | - |
| dc.identifier.issn | 2079-9292 | - |
| dc.identifier.issn | 2079-9292 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/4256 | - |
| dc.description.abstract | In this paper, we propose an advanced double layered multi-agent system to reduce learning time, expressing a state space using a 2D grid. This system is based on asynchronous advantage actor-critic systems (A3C) and reduces the state space that agents need to consider by hierarchically expressing a 2D grid space and determining actions. Specifically, the state space is expressed in the upper and lower layers. Based on the learning results using A3C in the lower layer, the upper layer makes decisions without additional learning, and accordingly, the total learning time can be reduced. Our method was verified experimentally using a virtual autonomous surface vehicle simulator. It reduced the learning time required to reach a 90% goal achievement rate by 7.1% compared to the conventional double layered A3C. In addition, the goal achievement by the proposed method was 18.86% higher than that of the traditional double layered A3C over 20,000 learning episodes. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | Advanced Double Layered Multi-Agent Systems Based on A3C in Real-Time Path Planning | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/electronics10222762 | - |
| dc.identifier.scopusid | 2-s2.0-85118895771 | - |
| dc.identifier.wosid | 000724749900001 | - |
| dc.identifier.bibliographicCitation | ELECTRONICS, v.10, no.22 | - |
| dc.citation.title | ELECTRONICS | - |
| dc.citation.volume | 10 | - |
| dc.citation.number | 22 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Physics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
| dc.subject.keywordAuthor | asynchronous advantage actor-critic | - |
| dc.subject.keywordAuthor | multi-agent system | - |
| dc.subject.keywordAuthor | simulation framework | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
