Language Acquisition in Computers
Megan Belzner, Sean Colin-Ellerin, Jorge H. Roman

TL;DR
This paper presents a computer system that mimics children's language learning, focusing on morphology and syntax to understand and differentiate languages with minimal initial input.
Contribution
It introduces a novel approach combining morphological and syntactic analysis in a computer system inspired by child language acquisition techniques.
Findings
Successfully distinguished English, French, and Spanish texts using frequency thresholds.
Developed methods for understanding simple sentence structures and learning new words.
Implemented morphological analysis with bigrams and recursive syntax analysis in different programming languages.
Abstract
This project explores the nature of language acquisition in computers, guided by techniques similar to those used in children. While existing natural language processing methods are limited in scope and understanding, our system aims to gain an understanding of language from first principles and hence minimal initial input. The first portion of our system was implemented in Java and is focused on understanding the morphology of language using bigrams. We use frequency distributions and differences between them to define and distinguish languages. English and French texts were analyzed to determine a difference threshold of 55 before the texts are considered to be in different languages, and this threshold was verified using Spanish texts. The second portion of our system focuses on gaining an understanding of the syntax of a language using a recursive method. The program uses one of two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Algorithms and Data Compression · Speech and dialogue systems
