Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and ArchitectureHierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture

Other Titles
Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture
Authors
구건우Jaemin Lee박명관
Issue Date
Dec-2023
Publisher
한국응용언어학회
Keywords
T5 model; a hiarachical inductive bias; syntactic transformation; passivization; question formation
Citation
응용언어학, v.39, no.4, pp 179 - 196
Pages
18
Indexed
KCI
Journal Title
응용언어학
Volume
39
Number
4
Start Page
179
End Page
196
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/20780
DOI
10.17154/kjal.2023.12.39.4.179
ISSN
1225-3871
2765-3773
Abstract
This study aims to investigate neural language models in alignment with Chomsky's (1965, 1980) proposition regarding the innate human tendency to acquire syntactic rules based on hierarchical structures rather than linear order. We have examined the architectural and dataset factors influencing the acquisition of a syntactic inductive bias during the pre-training of the T5 model. To achieve this objective, particularly concerning the pre-training dataset, we inquire whether there are differences when using datasets of varying complexity. We use two distinct pre-training datasets: Child-Directed Speech and L2-Textbook dataset. Upon examination, we observe that these datasets exhibit different levels of syntactic complexity. Then, with our models, we employ two syntactic transformation tasks: (i) question formation and (ii) passivization. Our results indicate that model depth (number of layers) exerts a more significant influence on hierarchical generalization compared to other model components. Furthermore, we observed that models learn the hierarchical aspects of language more efficiently when exposed to the simpler complexity of the dataset.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Humanities > Division of English Language & Literature > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Myung Kwan photo

Park, Myung Kwan
College of Humanities (Division of English Language and Literature)
Read more

Altmetrics

Total Views & Downloads

BROWSE