Computational theories of attention in decision making
For many years, research on attention has been dominated by theories based on the assumption that attention is limited in capacity. These include the limited-capacity channel theories of Welford and Broadbent and capacity or resource theories by Moray, Posner, and Kahneman. This talk challenges those theories and their many descendants by asking why capacity is limited and what role capacity plays in the computations required to perform attention tasks. There are few satisfactory answers in limited-capacity theories of attention. I show that the effects of load on performance, which are commonly interpreted as evidence for limited capacity, can be produced by models that assume unlimited, limited, and fixed capacity. I argue that attention is better construed as selection of information that we need to achieve our goals. Following current research on computational models of attention in associative learning, categorization, perceptual learning, cognitive development, neuroscience, and artificial intelligence, I propose that attention is a process of choice, in which selection implemented as multiplicative gain control and processing is constrained by normalization. This perspective focuses on interactions between representations and decision processes applied to them, explaining many attentional phenomena without assuming attention has limited capacity.
This is an in-person presentation on July 20, 2026 (09:00 ~ 09:20 EDT).
Selective attention is typically treated as a dedicated cognitive mechanism that modulates representations to favor task-relevant information. We challenge this view, presenting evidence from two very different modeling frameworks. We find that attentional effects can arise as a natural consequence of adaptive processing rather than requiring purpose-built attentional machinery. In the first line of work, we analyze monkey spiking data from a task requiring trial-by-trial switching between color and motion judgments. Representations across all recorded cortical areas, including V4, MT, PFC, FEF, LIP, and IT, stretched along the task-relevant dimension, with spike timing carrying critical information. An LSTM deep network trained on identical input sequences and rewards, without any explicit attentional mechanism, displayed the same qualitative stretching as a consequence of minimizing prediction error. In this case, selective attention emerges from error minimization. In the second line of work, we develop the Sampling Emergent Attention (SEA) model, which re-conceptualizes attention through a Bayesian lens as the expected utility of sampling particular information sources. Attentional effects emerge from a cost-sensitive information-gathering process, including flexible, stimulus-specific allocation patterns that mirror human eye-tracking data and go beyond what traditional fixed-weight attention models predict. In this case, selective attention emerges from utility maximization. Despite their fundamental differences, both frameworks converge on the same conclusion: attentional phenomena do not imply attentional mechanisms. Whether optimizing prediction error or expected utility, adaptive systems naturally develop the representational and behavioral signatures of attention, which invites a reappraisal of what attention fundamentally is. References http://dx.doi.org/10.1038/s41467-025-65231-y https://doi.org/10.1037/rev0000287
This is an in-person presentation on July 20, 2026 (09:20 ~ 09:40 EDT).
Dr. Ellen O'Donoghue
Dr. Matthew Broschard
Vladimir Sloutsky
Dr. Ed Wasserman
Dr. Brandon Turner
Attention has been criticised for its lack of theoretical cohesion. We show that a single computational framework - the adaptive representation model – can unify several competing conceptualizations. Specifically, we model attention in a category learning task where the underlying category structure changes. In our model, cognitive agents pay attention to features to weight them to make category decisions. These attention weights are dynamic and updated using gradient descent to maximize performance on every trial. Features that are attended to are stored in memory and used to inform the attentional update. Thus, the same attentional mechanism is used to (i) weight dimensions based on perceived importance that influences the (ii) encoding of relevant information (iii) explain adaptive behavior such as rapid phase transformations during learning and (iv) explain maladaptive behavior like catastrophic transfer and learning traps. We construct a switchboard framework to turn these different mechanisms on and off and show their contribution to explaining behavior. We test our computational account using empirical cross-species experiments involving humans, rats and pigeons. We find evidence for selective attention for humans but not rats and pigeons. Overall, we find that despite its seeming lack of theoretical cohesion, jointly conceptualizing attention is critical to developing a coherent account of learning, memory and decision making.
This is an in-person presentation on July 20, 2026 (09:40 ~ 10:00 EDT).
Dr. Sudeep Bhatia
People tend to choose the options they look at more. Prior work explains this effect by assuming that visual attention increases preference for the attended option. However, it remains unclear why looking at an option increases its preference and when this relationship is weakened. Across three studies using an experimental paradigm that combines eye-tracking and a think-aloud protocol, we provide evidence that visual attention shapes preference by activating attributes associated with the attended option. Critically, this mechanism can be modulated by contextual factors such as stimulus desirability and behavioral goals, and when these factors are incongruent, the influence of visual attention on choice is weakened. To account for these patterns, we propose a new computational model of gaze-driven attribute activation. Our model subsumes existing theories as special cases, and uniquely predicts which attributes are brought to mind by gaze and how they shape downstream decisions. Together, our work offers a mechanistic explanation of a fundamental driver of choice as well as its boundary conditions.
This is an in-person presentation on July 20, 2026 (10:00 ~ 10:20 EDT).
Submitting author
Author