Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture

구건우; Jaemin Lee; 박명관

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture

Full metadata record

DC Field	Value	Language
dc.contributor.author	구건우	-
dc.contributor.author	Jaemin Lee	-
dc.contributor.author	박명관	-
dc.date.accessioned	2024-08-08T09:00:41Z	-
dc.date.available	2024-08-08T09:00:41Z	-
dc.date.issued	2023-12	-
dc.identifier.issn	1225-3871	-
dc.identifier.issn	2765-3773	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/20780	-
dc.description.abstract	This study aims to investigate neural language models in alignment with Chomsky's (1965, 1980) proposition regarding the innate human tendency to acquire syntactic rules based on hierarchical structures rather than linear order. We have examined the architectural and dataset factors influencing the acquisition of a syntactic inductive bias during the pre-training of the T5 model. To achieve this objective, particularly concerning the pre-training dataset, we inquire whether there are differences when using datasets of varying complexity. We use two distinct pre-training datasets: Child-Directed Speech and L2-Textbook dataset. Upon examination, we observe that these datasets exhibit different levels of syntactic complexity. Then, with our models, we employ two syntactic transformation tasks: (i) question formation and (ii) passivization. Our results indicate that model depth (number of layers) exerts a more significant influence on hierarchical generalization compared to other model components. Furthermore, we observed that models learn the hierarchical aspects of language more efficiently when exposed to the simpler complexity of the dataset.	-
dc.format.extent	18	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	한국응용언어학회	-
dc.title	Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture	-
dc.title.alternative	Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.17154/kjal.2023.12.39.4.179	-
dc.identifier.bibliographicCitation	응용언어학, v.39, no.4, pp 179 - 196	-
dc.citation.title	응용언어학	-
dc.citation.volume	39	-
dc.citation.number	4	-
dc.citation.startPage	179	-
dc.citation.endPage	196	-
dc.identifier.kciid	ART003035194	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	T5 model	-
dc.subject.keywordAuthor	a hiarachical inductive bias	-
dc.subject.keywordAuthor	syntactic transformation	-
dc.subject.keywordAuthor	passivization	-
dc.subject.keywordAuthor	question formation	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Humanities > Division of English Language & Literature > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Park, Myung Kwan photo

Park, Myung Kwan: College of Humanities (Division of English Language and Literature)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE