Detailed Information

Cited 20 time in webofscience Cited 23 time in scopus
Metadata Downloads

Partially collapsed Gibbs sampling for latent Dirichlet allocation

Full metadata record
DC Field Value Language
dc.contributor.authorPark, Hongju-
dc.contributor.authorPark, Taeyoung-
dc.contributor.authorLee, Yung-Seop-
dc.date.accessioned2023-04-28T02:40:40Z-
dc.date.available2023-04-28T02:40:40Z-
dc.date.issued2019-10-01-
dc.identifier.issn0957-4174-
dc.identifier.issn1873-6793-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/7534-
dc.description.abstractA latent Dirichlet allocation (LDA) model is a machine learning technique to identify latent topics from text corpora within a Bayesian hierarchical framework. Current popular inferential methods to fit the LDA model are based on variational Bayesian inference, collapsed Gibbs sampling, or a combination of these. Because these methods assume a unimodal distribution over topics, however, they can suffer from large bias when text corpora consist of various clusters with different topic distributions. This paper proposes an inferential LDA method to efficiently obtain unbiased estimates under flexible modeling for heterogeneous text corpora with the method of partial collapse and the Dirichlet process mixtures. The method is illustrated using a simulation study and an application to a corpus of 1300 documents from neural information processing systems (NIPS) conference articles during the period of 2000-2002 and British Broadcasting Corporation (BBC) news articles during the period of 2004-2005. (C) 2019 Elsevier Ltd. All rights reserved.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.titlePartially collapsed Gibbs sampling for latent Dirichlet allocation-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1016/j.eswa.2019.04.028-
dc.identifier.scopusid2-s2.0-85064893718-
dc.identifier.wosid000470951200015-
dc.identifier.bibliographicCitationEXPERT SYSTEMS WITH APPLICATIONS, v.131, pp 208 - 218-
dc.citation.titleEXPERT SYSTEMS WITH APPLICATIONS-
dc.citation.volume131-
dc.citation.startPage208-
dc.citation.endPage218-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaOperations Research & Management Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryOperations Research & Management Science-
dc.subject.keywordPlusMODELS-
dc.subject.keywordPlusSAMPLERS-
dc.subject.keywordAuthorBayesian analysis-
dc.subject.keywordAuthorLatent Dirichlet allocation-
dc.subject.keywordAuthorDirichlet process mixture-
dc.subject.keywordAuthorPartial collapse-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordAuthorNatural language processing-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Natural Science > Department of Statistics > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Yung Seop photo

Lee, Yung Seop
College of Natural Science (Department of Statistics)
Read more

Altmetrics

Total Views & Downloads

BROWSE