Evaluating L2 Training Methods in Neural Language Models

이재민; 신정아

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Evaluating L2 Training Methods in Neural Language Models

Authors: 이재민; 신정아

Issue Date: Dec-2024

Publisher: 서울대학교 언어교육원

Keywords: developmentally plausible data; cross-linguistic transfer; second language acquisition; neural language models; L2 language models; catastrophic forgetting

Citation: 어학연구, v.60, no.3, pp 323 - 345

Pages: 23

Indexed: KCI

Journal Title: 어학연구

Volume: 60

Number: 3

Start Page: 323

End Page: 345

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/56755

DOI: 10.30961/lr.2024.60.3.323

ISSN: 0254-4474
2586-7113

Abstract: Recent advancements in language models (LMs) have significantly improved language processing capabilities; however, these models remain less efficient than human learning, especially when trained on developmentally plausible data volumes similar to those encountered by children (Warstadt & Bowman, 2022; Linzen, 2020). The inefficiency is even more pronounced in second language (L2) acquisition contexts, where cross-linguistic transfer is a key phenomenon (Papadimitriou & Jurafsky, 2020; Yadavalli et al., 2023). This study evaluates L2 training methods in neural language models by examining mutual L1-L2 influences during learning with developmentally plausible data volumes. We propose two approaches to mitigate catastrophic forgetting: the One-Stage Training (OST) method, which integrates L1 and L2 learning into a single stage, and the One-Stage Mixed Training (OSMT) method, which refines OST by incorporating L1 data into the L2 stage for more realistic simulation of bilingual learning. Through continuous syntactic evaluations throughout training, we analyzed how L1 performance changes during L2 acquisition and how cross-linguistics transfer emerges in Korean and English. The results indicate that OST and OSMT effectively mitigated catastrophic forgetting and supported more stable learning compared to the conventional Two-Stage Training method. OSMT achieved superior integration of L1 and L2 structures while revealing negative transfer effects from Korean (L1) to English (L2). These findings provide valuable insights into both neural model training and human-like L2 acquisition processes.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Humanities > Division of English Language & Literature > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Shin, Jeong Ah photo

Shin, Jeong Ah: College of Humanities (Division of English Language and Literature)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE