
Mika Braginsky Thesis Defense: Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development
Description
Mika Braginksy, PhD candidate in Brain and Cognitive Sciences
Advisor: Edward Gibson (MIT)
Abstract: Studying how children learn language gives complementary insights into both children and language. Given that languages have the properties that they do, what does children’s ability to learn them tell us about cognitive development? And given that children have the cognitive capacities that they do, what does the learnability of language tell us about its properties?
I present a series of large‐scale investigations of language learning, using dense datasets and computational models to support generalization across children, over development, among languages, and while distinguishing among theories. In the Study 1, we ask: what makes words harder or easier to learn, and what does that reveal about word learning mechanisms? We examined the factors that contribute to individual words’ learning trajectories, and the consistency of those factors across languages.
For subsequent investigations, I focus on the study of morphology learning. Learning morphology is a particularly interesting problem because it straightforwardly involves both memorizing specific forms and generalizing beyond direct experience. It is thus a fruitful case study for the interplay between mechanisms of memory and inference, which is fundamental both in language and in cognition more generally.
Study 2 applies the datasets and approaches developed in Study 1 to the domain of morphology, investigating how morphology learning relates to age and vocabulary development. Study 3 delves further into morphology learning by applying the abstraction and rigor of computational modeling. Finally, I propose a series of studies to use a novel data collection method to create a dense dataset on morphological development and evaluate theories of morphology learning.
Taken together, these studies synthesize empirical and computational methods to investigate multiple domains of language development at scale. Focusing on lexical and morphological development, they clarify and enhance the empirical and theoretical landscape of language learning.
Join the defense via Zoom https://mit.zoom.us/j/98709363565?pwd=djV5U3Y0SEltNmRBWnRDMlV1VnpQZz09