Using Machine Learning to Guide Cognitive Modeling: A Case Study in Moral Reasoning
Mayank Agrawal, Joshua C. Peterson, Thomas L. Griffiths

TL;DR
This paper presents a data-driven, iterative approach for using machine learning to develop interpretable and accurate cognitive models, demonstrated through moral decision-making predictions using the Moral Machine dataset.
Contribution
It introduces a novel procedure that combines machine learning with cognitive modeling to produce interpretable models, specifically applied to moral reasoning.
Findings
Successfully predicted moral decision outcomes in complex conflicts.
Produced a simple, interpretable model explaining human moral judgments.
Demonstrated generalization of principles to real-world moral dilemmas.
Abstract
Large-scale behavioral datasets enable researchers to use complex machine learning algorithms to better predict human behavior, yet this increased predictive power does not always lead to a better understanding of the behavior in question. In this paper, we outline a data-driven, iterative procedure that allows cognitive scientists to use machine learning to generate models that are both interpretable and accurate. We demonstrate this method in the domain of moral decision-making, where standard experimental approaches often identify relevant principles that influence human judgments, but fail to generalize these findings to "real world" situations that place these principles in conflict. The recently released Moral Machine dataset allows us to build a powerful model that can predict the outcomes of these conflicts while remaining simple enough to explain the basis behind human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychology of Moral and Emotional Judgment · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
