Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics

Full metadata record
DC Field Value Language
dc.contributor.authorTse, Tze Ho Elden-
dc.contributor.authorFeng, Runyang-
dc.contributor.authorZheng, Linfang-
dc.contributor.authorPark, Jiho-
dc.contributor.authorGao, Yixing-
dc.contributor.authorKim, Jihie-
dc.contributor.authorLeonardis, Ales-
dc.contributor.authorChang, Hyung Jin-
dc.date.accessioned2025-05-13T05:30:18Z-
dc.date.available2025-05-13T05:30:18Z-
dc.date.issued2025-04-
dc.identifier.issn2159-5399-
dc.identifier.issn2374-3468-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/58328-
dc.description.abstractWith the availability of egocentric 3D hand-object interaction datasets, there is increasing interest in developing unified models for hand-object pose estimation and action recognition. However, existing methods still struggle to recognise seen actions on unseen objects due to the limitations in representing object shape and movement using 3D bounding boxes. Additionally, the reliance on object templates at test time limits their generalisability to unseen objects. To address these challenges, we propose to leverage superquadrics as an alternative 3D object representation to bounding boxes and demonstrate their effectiveness on both template-free object reconstruction and action recognition tasks. Moreover, as we find that pure appearance-based methods can outperform the unified methods, the potential benefits from 3D geometric information remain unclear. Therefore, we study the compositionality of actions by considering a more challenging task where the training combinations of verbs and nouns do not overlap with the testing split. We extend H2O and FPHA datasets with compositional splits and design a novel collaborative learning framework that can explicitly reason about the geometric relations between hands and the manipulated object. Through extensive quantitative and qualitative evaluations, we demonstrate significant improvements over the state-of-the-arts in (compositional) action recognition. Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.-
dc.format.extent9-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for the Advancement of Artificial Intelligence-
dc.titleCollaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1609/aaai.v39i7.32800-
dc.identifier.scopusid2-s2.0-105004065199-
dc.identifier.wosid001478153300083-
dc.identifier.bibliographicCitationProceedings of the AAAI Conference on Artificial Intelligence, v.39, no.7, pp 7437 - 7445-
dc.citation.titleProceedings of the AAAI Conference on Artificial Intelligence-
dc.citation.volume39-
dc.citation.number7-
dc.citation.startPage7437-
dc.citation.endPage7445-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassforeign-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordAuthorAction Recognition-
dc.subject.keywordAuthorBounding-box-
dc.subject.keywordAuthorCollaborative Learning-
dc.subject.keywordAuthorObject Interactions-
dc.subject.keywordAuthorObject Movements-
dc.subject.keywordAuthorObject Pose-
dc.subject.keywordAuthorObject Reconstruction-
dc.subject.keywordAuthorPose-estimation-
dc.subject.keywordAuthorSuperquadrics-
dc.subject.keywordAuthorUnified Modeling-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Ji Hie photo

Kim, Ji Hie
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE