Cited 0 time in
Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 구건우 | - |
| dc.contributor.author | Jaemin Lee | - |
| dc.contributor.author | 박명관 | - |
| dc.date.accessioned | 2024-08-08T09:00:41Z | - |
| dc.date.available | 2024-08-08T09:00:41Z | - |
| dc.date.issued | 2023-12 | - |
| dc.identifier.issn | 1225-3871 | - |
| dc.identifier.issn | 2765-3773 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/20780 | - |
| dc.description.abstract | This study aims to investigate neural language models in alignment with Chomsky's (1965, 1980) proposition regarding the innate human tendency to acquire syntactic rules based on hierarchical structures rather than linear order. We have examined the architectural and dataset factors influencing the acquisition of a syntactic inductive bias during the pre-training of the T5 model. To achieve this objective, particularly concerning the pre-training dataset, we inquire whether there are differences when using datasets of varying complexity. We use two distinct pre-training datasets: Child-Directed Speech and L2-Textbook dataset. Upon examination, we observe that these datasets exhibit different levels of syntactic complexity. Then, with our models, we employ two syntactic transformation tasks: (i) question formation and (ii) passivization. Our results indicate that model depth (number of layers) exerts a more significant influence on hierarchical generalization compared to other model components. Furthermore, we observed that models learn the hierarchical aspects of language more efficiently when exposed to the simpler complexity of the dataset. | - |
| dc.format.extent | 18 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | 한국응용언어학회 | - |
| dc.title | Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture | - |
| dc.title.alternative | Hierarchical Inductive Bias in the L2 Textbook-T5 and Child-T5 Language Model: A Study of Data and Architecture | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.17154/kjal.2023.12.39.4.179 | - |
| dc.identifier.bibliographicCitation | 응용언어학, v.39, no.4, pp 179 - 196 | - |
| dc.citation.title | 응용언어학 | - |
| dc.citation.volume | 39 | - |
| dc.citation.number | 4 | - |
| dc.citation.startPage | 179 | - |
| dc.citation.endPage | 196 | - |
| dc.identifier.kciid | ART003035194 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.subject.keywordAuthor | T5 model | - |
| dc.subject.keywordAuthor | a hiarachical inductive bias | - |
| dc.subject.keywordAuthor | syntactic transformation | - |
| dc.subject.keywordAuthor | passivization | - |
| dc.subject.keywordAuthor | question formation | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
