Double random forest

Han, Sunwoo; Kim, Hyunjoong; Lee, Yung-Seop

Detailed Information

Cited 42 time in webofscience

Cited 64 time in scopus

Metadata Downloads

Double random forest

Full metadata record

DC Field	Value	Language
dc.contributor.author	Han, Sunwoo	-
dc.contributor.author	Kim, Hyunjoong	-
dc.contributor.author	Lee, Yung-Seop	-
dc.date.accessioned	2023-04-27T22:40:38Z	-
dc.date.available	2023-04-27T22:40:38Z	-
dc.date.issued	2020-08	-
dc.identifier.issn	0885-6125	-
dc.identifier.issn	1573-0565	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/6373	-
dc.description.abstract	Random forest (RF) is one of the most popular parallel ensemble methods, using decision trees as classifiers. One of the hyper-parameters to choose from for RF fitting is the nodesize, which determines the individual tree size. In this paper, we begin with the observation that for many data sets (34 out of 58), the best RF prediction accuracy is achieved when the trees are grown fully by minimizing the nodesize parameter. This observation leads to the idea that prediction accuracy could be further improved if we find a way to generate even bigger trees than the ones with a minimum nodesize. In other words, the largest tree created with the minimum nodesize parameter may not be sufficiently large for the best performance of RF. To produce bigger trees than those by RF, we propose a new classification ensemble method called double random forest (DRF). The new method uses bootstrap on each node during the tree creation process, instead of just bootstrapping once on the root node as in RF. This method, in turn, provides an ensemble of more diverse trees, allowing for more accurate predictions. Finally, for data where RF does not produce trees of sufficient size, we have successfully demonstrated that DRF provides more accurate predictions than RF.	-
dc.format.extent	18	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	SPRINGER	-
dc.title	Double random forest	-
dc.type	Article	-
dc.publisher.location	네델란드	-
dc.identifier.doi	10.1007/s10994-020-05889-1	-
dc.identifier.scopusid	2-s2.0-85087483727	-
dc.identifier.wosid	000545057500001	-
dc.identifier.bibliographicCitation	MACHINE LEARNING, v.109, no.8, pp 1569 - 1586	-
dc.citation.title	MACHINE LEARNING	-
dc.citation.volume	109	-
dc.citation.number	8	-
dc.citation.startPage	1569	-
dc.citation.endPage	1586	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.subject.keywordPlus	CLASSIFICATION TREES	-
dc.subject.keywordPlus	ALGORITHMS	-
dc.subject.keywordPlus	ENSEMBLES	-
dc.subject.keywordAuthor	Classification	-
dc.subject.keywordAuthor	Ensemble	-
dc.subject.keywordAuthor	Random forest	-
dc.subject.keywordAuthor	Bootstrap	-
dc.subject.keywordAuthor	Decision tree	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Natural Science > Department of Statistics > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, Yung Seop photo

Lee, Yung Seop: College of Natural Science (Department of Statistics)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE