Differential contributions of machine learning and statistical analysis to language and cognitive sciences
Kun Sun, Rong Wang

TL;DR
This paper systematically compares machine learning and statistical analysis in language and cognitive sciences, illustrating their distinct techniques, objectives, and insights using the Buckeye Speech Corpus, and clarifying misconceptions to guide researchers.
Contribution
It provides a comprehensive tutorial-like comparison of machine learning and statistical methods applied to the same dataset, highlighting their differences, strengths, and appropriate applications in language research.
Findings
Machine learning excels at pattern recognition and prediction.
Statistical methods offer deeper insights into variable relationships.
Semantic relevance as a new metric enhances understanding of speech timing.
Abstract
Data-driven approaches have revolutionized scientific research, with machine learning and statistical analysis being commonly used methodologies. Despite their widespread use, these approaches differ significantly in their techniques, objectives and implementations. Few studies have systematically applied both methods to identical datasets to highlight potential differences, particularly in language and cognitive sciences. This study employs the Buckeye Speech Corpus to illustrate how machine learning and statistical analysis are applied in data-driven research to obtain distinct insights on language production. We demonstrate the theoretical differences, implementation steps, and unique objectives of each approach through a comprehensive, tutorial-like comparison. Our analysis reveals that while machine learning excels at pattern recognition and prediction, statistical methods provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
