Parameter estimation in softmax decision-making models with linear   objective functions

Paul Reverdy; Naomi E. Leonard

arXiv:1502.04635·math.OC·September 1, 2015

Parameter estimation in softmax decision-making models with linear objective functions

Paul Reverdy, Naomi E. Leonard

PDF

TL;DR

This paper develops a systematic approach for estimating parameters in softmax decision-making models with linear objectives, enabling better understanding of human decision behaviors from behavioral data.

Contribution

It introduces conditions for convexity of the likelihood function, ensuring convergence of maximum likelihood estimators, and extends the method to nonlinear models via linearization.

Findings

01

Successfully fit the UCL model to human data

02

Detected significant behavioral differences across tasks

03

Provided a framework for parameter estimation in decision models

Abstract

With an eye towards human-centered automation, we contribute to the development of a systematic means to infer features of human decision-making from behavioral data. Motivated by the common use of softmax selection in models of human decision-making, we study the maximum likelihood parameter estimation problem for softmax decision-making models with linear objective functions. We present conditions under which the likelihood function is convex. These allow us to provide sufficient conditions for convergence of the resulting maximum likelihood estimator and to construct its asymptotic distribution. In the case of models with nonlinear objective functions, we show how the estimator can be applied by linearizing about a nominal parameter value. We apply the estimator to fit the stochastic UCL (Upper Credible Limit) model of human decision-making to human subject data. We show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax