DGU-HAU: A Dataset for 3D Human Action Analysis on Utterancesopen access
- Authors
- Park, Jiho; Park, Kwangryeol; Kim, Dongho
- Issue Date
- Dec-2023
- Publisher
- MDPI
- Keywords
- 3D human action analysis; human activity understanding; motion capture; multi-modal dataset; utterance dataset
- Citation
- Electronics, v.12, no.23, pp 1 - 15
- Pages
- 15
- Indexed
- SCIE
SCOPUS
- Journal Title
- Electronics
- Volume
- 12
- Number
- 23
- Start Page
- 1
- End Page
- 15
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/25729
- DOI
- 10.3390/electronics12234793
- ISSN
- 2079-9292
2079-9292
- Abstract
- Constructing diverse and complex multi-modal datasets is crucial for advancing human action analysis research, providing ground truth annotations for training deep learning networks, and enabling the development of robust models across real-world scenarios. Generating natural and contextually appropriate nonverbal gestures is essential for enhancing immersive and effective human-computer interactions in various applications. These applications include video games, embodied virtual assistants, and conversations within a metaverse. However, existing speech-related human datasets are focused on style transfer, so they have limitations that make them unsuitable for 3D human action analysis studies, such as human action recognition and generation. Therefore, we introduce a novel multi-modal dataset, DGU-HAU, a dataset for 3D human action on utterances that commonly occurs during daily life. We validate the dataset using a human action generation model, Action2Motion (A2M), a state-of-the-art 3D human action generation model.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.