Information search with situation-specific reward functions
Journal Title: Judgment and Decision Making - Year 2012, Vol 7, Issue 2
Abstract
The goal of obtaining information to improve classification accuracy can strongly conflict with the goal of obtaining information for improving payoffs. Two environments with such a conflict were identified through computer optimization. Three subsequent experiments investigated people’s search behavior in these environments. Experiments 1 and 2 used a multiple-cue probabilistic category-learning task to convey environmental probabilities. In a subsequent search task subjects could query only a single feature before making a classification decision. The crucial manipulation concerned the search-task reward structure. The payoffs corresponded either to accuracy, with equal rewards associated with the two categories, or to an asymmetric payoff function, with different rewards associated with each category. In Experiment 1, in which learning-task feedback corresponded to the true category, people later preferentially searched the accuracy-maximizing feature, whether or not this would improve monetary rewards. In Experiment 2, an asymmetric reward structure was used during learning. Subjects searched the reward-maximizing feature when asymmetric payoffs were preserved in the search task. However, if search-task payoffs corresponded to accuracy, subjects preferentially searched a feature that was suboptimal for reward and accuracy alike. Importantly, this feature would have been most useful, under the learning-task payoff structure. Experiment 3 found that, if words and numbers are used to convey environmental probabilities, neither reward nor accuracy consistently predicts search. These findings emphasize the necessity of taking into account people’s goals and search-and-decision processes during learning, thereby challenging current models of information search.
Authors and Affiliations
Björn Meder and Jonathan D. Nelson
Assessing a domain-specific risk-taking construct: A meta-analysis of reliability of the DOSPERT scale
The DOSPERT scale has been used extensively to understand individual differences in risk attitudes across varying decision domains since 2002. The present study reports a reliability generalization meta-analysis to summa...
Public policy for thee, but not for me: Varying the grammatical person of public policy justifications influences their support
Past research has shown that people consistently believe that others are more easily manipulated by external influences than they themselves are—a phenomenon called the “third-person effect” (Davison, 1983). The present...
What does it mean to maximize? “Decision difficulty,” indecisiveness, and the jingle-jangle fallacies in the measurement of maximizing
For two decades, researchers have investigated the correlates and consequences of individual differences in maximizing, the tendency to pursue the goal of making the best possible choice by extensively seeking out and co...
Choosing victims: Human fungibility in moral decision-making
In considering moral dilemmas, people often judge the acceptability of exchanging individuals’ interests, rights, and even lives. Here we investigate the related, but often overlooked, question of how people decide who t...
From foe to friend and back again: The temporal dynamics of intra-party bias in the 2016 U.S. Presidential Election
Political identification is the basis of enduring conflict, suggesting that political attitudes are difficult to change. Here we show that in the 2016 U.S. Presidential Election, political identities underwent modificati...