Use and Misuse of Machine Learning in Anthropology
Jeff Calder, Reed Coil, Annie Melton, Peter J. Olver, Gilbert, Tostevin, Katrina Yezzi-Woodley

TL;DR
This paper examines the widespread but often improper use of machine learning in paleoanthropology, highlighting common errors and advocating for better practices, transparency, and interdisciplinary collaboration to improve research reliability.
Contribution
It provides an overview of ML applications in paleoanthropology, identifies prevalent misapplications, and offers recommendations for correct usage and improved research standards.
Findings
Frequent use of outdated ML algorithms in anthropology
Common errors in train/test data splits and sample selection
Lack of data and code sharing hampers reproducibility
Abstract
Machine learning (ML), being now widely accessible to the research community at large, has fostered a proliferation of new and striking applications of these emergent mathematical techniques across a wide range of disciplines. In this paper, we will focus on a particular case study: the field of paleoanthropology, which seeks to understand the evolution of the human species based on biological and cultural evidence. As we will show, the easy availability of ML algorithms and lack of expertise on their proper use among the anthropological research community has led to foundational misapplications that have appeared throughout the literature. The resulting unreliable results not only undermine efforts to legitimately incorporate ML into anthropological research, but produce potentially faulty understandings about our human evolutionary and behavioral past. The aim of this paper is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Image Processing and 3D Reconstruction · Anomaly Detection Techniques and Applications
