Cited 0 time in
Generating, retrieving persona and generating responses for long-term open-domain dialogue
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Cha, Dohyun | - |
| dc.contributor.author | Lee, Dawon | - |
| dc.contributor.author | Kim, Jihie | - |
| dc.date.accessioned | 2025-07-28T06:01:18Z | - |
| dc.date.available | 2025-07-28T06:01:18Z | - |
| dc.date.issued | 2025-07 | - |
| dc.identifier.issn | 2376-5992 | - |
| dc.identifier.issn | 2376-5992 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/58784 | - |
| dc.description.abstract | Open-domain dialogue systems have shown remarkable capabilities in generating natural and consistent responses in short-term conversations. However, in long-term conversations such as multi-session chat (MSC), where the dialogue history exceeds the model's maximum input length (i.e., 1024 tokens), existing dialogue generation systems often overlook the information from earlier dialogues, leading to the loss of context. To prevent such loss and generate natural, consistent responses, we propose a GRGPerDialogue framework, consisting of three main stages: generating persona from past dialogues, retrieving persona relevant to the current utterance, and generating responses based on both persona and recent dialogues. In the first stage, we generate the persona of each speaker in real-time with diverse expressions, leveraging Llama 2 In-Context Learning (ICL). Subsequently, we propose a new dataset called Persona-Utterance Pair (PUP) and use it to train Facebook dense passage retrieval (DPR) model for retrieving persona sentences relevant to the current utterance. Finally, we train generative models such as Generative Pre-trained Transformer 2 (GPT-2) and Bidirectional and Auto-Regressive Transformers (BART) to generate responses based on retrieved persona sentences and the recent dialogues. Experimental results on a long-term dialogue dataset demonstrate that the GRGPerDialogue framework outperforms baseline models by approximately 0.6% to 1% in terms of the Rouge-1 metric. Furthermore, human evaluation results supported the effectiveness of GRGPerDialogue. These results indicate that GRGPerDialogue can generate responses that are not only more fluent and consistent, but also more relevant to the dialogue history than baseline models. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | PEERJ INC | - |
| dc.title | Generating, retrieving persona and generating responses for long-term open-domain dialogue | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.7717/peerj-cs.2979 | - |
| dc.identifier.scopusid | 2-s2.0-105025471737 | - |
| dc.identifier.wosid | 001531885400001 | - |
| dc.identifier.bibliographicCitation | PeerJ Computer Science, v.11 | - |
| dc.citation.title | PeerJ Computer Science | - |
| dc.citation.volume | 11 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordAuthor | Natural language processing | - |
| dc.subject.keywordAuthor | Open-domain dialogue | - |
| dc.subject.keywordAuthor | Dialogue generation systems | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
