Yuri Imaizumi’s research while affiliated with University of Tsukuba and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (21)


ResponsetoReviwerRev1.pdf
  • Data
  • File available

November 2022

·

7 Reads

Yuri Imaizumi

·

·

·

[...]

·

Download

Cued lottery task and monkeys’ choice behavior
a A Sequence of events in the choice trials. Two pie charts representing available options were presented to the monkeys on the left and right sides of the screen. The monkeys chose either of the targets by fixating on the side where they appeared. b Frequency with which the target on the right side was selected for the expected values of the left and right target options. c Sequence of events in single-cue trials. d AIC values are estimated based on the four standard economic models to describe the monkey’s choice behavior: EV, EU, PT1, and PT2. See the Methods section for details. e Estimated utility functions in the best-fit model PT2. f Estimated probability-weighting functions in the best-fit model PT2. Images in panels a-c were created by the authors and previously published in Neural Population Dynamics Underlying Expected Value Computation. Hiroshi Yamada, et al.²⁸. https://creativecommons.org/licenses/by/4.0/.
Neural coding of probability and magnitude of rewards in the four brain regions
a Illustration of neural recording areas based on coronal magnetic resonance images. b Example activity histogram of a DS neuron modulated by probability and magnitude of rewards with positive regression coefficients during the single-cue task (P + M + type). The activity aligned to the cue onset is represented for three different levels of probability (0.1–0.3, 0.4–0.7, and 0.8–1.0) and magnitude (0.1–0.3 mL, 0.4–0.7 mL, and 0.8–1.0 mL) of rewards. Gray hatched time windows indicate the 1-s time window used to estimate the neural firing rates shown in f and g. Raster grams are shown below. c–e similar to b, but for VS, cOFC, and mOFC neurons. f Plot of the neural firing rates during the 1-s time window in b for ten levels of probability and magnitude of rewards. The firings are normalized by the maximum firing rates. P and M indicate the probability and magnitude of rewards, respectively. g Color map of the neural firing rates during the 1 s time window in b for ten levels of probability and magnitude of rewards. Average smoothing was made between neighboring pixels. h Percentage of neurons modulated by probability and magnitude of rewards in the four core reward brain regions. Gray indicates activity showing positive regression coefficients for probability and magnitude of rewards (P + M + type). Black indicates activity showing the negative regression coefficients for probability and magnitude (P-M- type). Images in panels a were created by the authors and previously published in Neural Population Dynamics Underlying Expected Value Computation. Hiroshi Yamada, et al.²⁸. https://creativecommons.org/licenses/by/4.0/.
Neural models of economic decision theory
Schematic depiction of predicted neuronal responses R defined by the four economic models that represent the expected value (a, EV), expected utility (b, EU), prospect theory one-parameter Prelec (c, PT1), and two-parameter Prelec (d, PT2). Model equations are presented in each plot. R was plotted against the probability (p) and magnitude (m) of the rewards. b, g, α, γ, and δ are the free parameters. g and b are the gain and intercept parameters, respectively. α represents the curvature of u(m). δ and γ represent the probability weighting functions. For these schematic drawings, the following values for the free parameters were used: b, g, α, γ, and δ were 0 spk s⁻¹, 1, 0.6, 2, and 0.5, respectively, for all four Figs. See the Methods section for more details.
Prospect theory best explained neural firing rates in the reward circuitry
a Plot of an example activity of the DS neuron in Fig. 2b against the probability (p) and magnitude (m) of rewards. To draw the 3D curvature (left) and contour lines (right), the neighboring pixels were averaged and smoothed. b AIC values against the proportion of variance explained are plotted in each model for the example neuron in a. c A 3D histogram (left) and contour lines (right) predicted from the best-fit PT2 model in a. The activity of the example neuron in a is shown on the right color map. Contour lines are shown for every 10% change in the fit model. du(m) and w(p) estimated in the best-fit model PT2 for the neural activity in a. e Probability density of the estimated AIC difference of the three models against the EV (simplest) model. The plots display the mean values. n represents the number of neuronal signals that showed both positive and negative regression coefficients for the probability and magnitude of the rewards.
Neuronal clusters categorized by the fitted parameters according to the prospect theory model
a Plots of all five parameters estimated in DS, VS, and cOFC neurons. g, b, α, δ, and γ were plotted. b Cumulative plot of the proportion of variance explained by PCA is shown against principal components PC1–PC5. c Cumulative plot of the percentage of activity categorized into five clusters in each brain region. d Response R (model output) in the first three predominant clusters is plotted. The 3D curvature, contour lines with color maps, u(m), and w(p) are plotted using the mean values of each parameter in each cluster. To draw the 3D curvature (first column) and contour lines (second column), R was normalized to the maximal value.

+1

A neuronal prospect theory model in the brain reward circuitry

October 2022

·

118 Reads

·

10 Citations

Prospect theory, arguably the most prominent theory of choice, is an obvious candidate for neural valuation models. How the activity of individual neurons, a possible computational unit, obeys prospect theory remains unknown. Here, we show, with theoretical accuracy equivalent to that of human neuroimaging studies, that single-neuron activity in four core reward-related cortical and subcortical regions represents the subjective valuation of risky gambles in monkeys. The activity of individual neurons in monkeys passively viewing a lottery reflects the desirability of probabilistic rewards parameterized as a multiplicative combination of utility and probability weighting functions, as in the prospect theory framework. The diverse patterns of valuation signals were not localized but distributed throughout most parts of the reward circuitry. A network model aggregating these signals reconstructed the risk preferences and subjective probability weighting revealed by the animals’ choices. Thus, distributed neural coding explains the computation of subjective valuations under risk.


Fig. 1. Cued lottery task and monkey choice behavior. (a) A sequence of events in choice trials. Two pie charts representing the available options were presented to the monkeys on the left and right sides of the screen. Monkeys chose either target by fixating on the side where it appeared. (b) Payoff matrix. Each magnitude was fully crossed with each probability, resulting in a pool of 100 lotteries from which two were randomly allocated to the left-and right-side target options on each trial. Expected values (EVs) are calculated in mL. (c) The frequency with which the target on the right side was selected for the expected values of the left and right target options. Fig. 1a and c are published in Yamada et al., 2021.
Fig. 3. Value function and its RPE estimated by using the reinforcement learning model. (a) Plots of the V(p,m)t=10,000 against the expected value defined mathematically, i.e., probability time magnitude. (b) r -V(p,m)t=10,000 after reward plotted against the positive component of RPE, i.e., obtained reward magnitude minus the expected values. (c) r -V(p,m)t=10,000 after no-reward (hence, r is zero) plotted against the negative component of RPE, i.e., zero minus expected value. Plots were made for all stimuli, as a function of different learning rates. r is the correlation coefficient.
Dynamic prospect theory - two core decision theories coexist in the gambling behavior of monkeys and humans

August 2022

·

126 Reads

Research in the multidisciplinary field of neuroeconomics has been driven by two influential theories regarding human economic choice: prospect theory, which describes decision-making under risk, and reinforcement learning theory, which describes learning for decision-making. We hypothesized that these two distinct theories guide decision-making in a comprehensive manner. Here, we propose and test a new decision-making theory under uncertainty that combines these highly influential theories. Collecting many gambling decisions from laboratory monkeys allowed for reliable testing of our hybrid model and revealed a systematic violation of prospect theory’s assumption that probability weighting is static. Using the same experimental paradigm in humans, substantial similarities between monkey and human behavior were described by our hybrid model, which incorporates decision-by-decision learning dynamics of prediction errors into static prospect theory. Our new model provides a single unified theoretical framework for exploring the neurobiological model of economic choice in human and nonhuman primates.


Comparison of neural population dynamics in the regression subspace between continuous and categorical task parameters

January 2022

·

70 Reads

·

1 Citation

Neural population dynamics, presumably fundamental computational units in the brain, provide a key framework for understanding information processing in the sensory, cognitive, and motor functions. However, neural population dynamics is not explicitly related to the conventional analytic framework for single-neuron activity, i.e., representational models that analyze neuronal modulations associated with cognitive and motor parameters. In this study, we applied a recently developed state-space analysis to incorporate the representational models into the dynamic model in combination with these parameters. We compared neural population dynamics between continuous and categorical task parameters during two visual recognition tasks, using the datasets originally designed for a single-neuron approach. We successfully extracted neural population dynamics in the regression subspace, which represent modulation dynamics for both continuous and categorical task parameters with reasonable temporal characteristics. Furthermore, we combined the classical optimal-stimulus analysis paradigm for the single-neuron approach (i.e., stimulus identified as maximum neural responses) into the dynamic model, and found that the most prominent modulation dynamics at the lower dimension were derived from these optimal responses. Thus, our approach provides a unified framework for incorporating knowledge acquired with the single-neuron approach into the dynamic model as a standard procedure for describing neural modulation dynamics in the brain.



Figure 1. Cued lottery task, monkeys' choice behavior, and neural coding of
A neuronal prospect theory model in the brain reward circuitry

December 2021

·

63 Reads

Prospect theory, arguably the most prominent theory of choice, is an obvious candidate for neural valuation models. How the activity of individual neurons, a possible computational unit, reflects prospect theory remains unknown. Here, we show with theoretical accuracy equivalent to that of human neuroimaging studies that single-neuron activity in four core reward-related cortical and subcortical regions represents the subjective valuation of risky gambles in monkeys. The activity of individual neurons in monkeys passively viewing a lottery reflects the desirability of probabilistic rewards, parameterized as a multiplicative combination of a utility and probability weighting functions in the prospect theory framework. The diverse patterns of valuation signals were not localized but distributed throughout most parts of the reward circuitry. A network model aggregating these signals reliably reconstructed risk preferences and subjective probability perceptions revealed by the animals' choices. Thus, distributed neural coding explains the computation of subjective valuations under risk.


Figures and Legends
Dynamic prospect theory -two core economic decision theories coexist in the gambling behavior of monkeys

April 2021

·

109 Reads

Research in behavioral economics and reinforcement learning has given rise to two influential theories describing human economic choice under uncertainty. The first, prospect theory, assumes that decision-makers use static mathematical functions, utility and probability weighting, to calculate the values of alternatives. The second, reinforcement learning theory, posits that dynamic mathematical functions update the values of alternatives based on experience through reward prediction error (RPE). To date, these theories have been examined in isolation without reference to one another. Therefore, it remains unclear whether RPE affects a decision-maker’s utility and/or probability weighting functions, or whether these functions are indeed static as in prospect theory. Here, we propose a dynamic prospect theory model that combines prospect theory and RPE, and test this combined model using choice data on gambling behavior of captive macaques. We found that under standard prospect theory, monkeys, like humans, had a concave utility function. Unlike humans, monkeys exhibited a concave, rather than inverse-S shaped, probability weighting function. Our dynamic prospect theory model revealed that probability distortions, not the utility of rewards, solely and systematically varied with RPE: after a positive RPE, the estimated probability weighting functions became more concave, suggesting more optimistic belief about receiving rewards and over-weighted subjective probabilities at all probability levels. Thus, the probability perceptions in laboratory monkeys are not static even after extensive training, and are governed by a dynamic function well captured by the algorithmic feature of reinforcement learning. This novel evidence supports combining these two major theories to capture choice behavior under uncertainty. Significance statement We propose and test a new decision theory under uncertainty by combining pre-existing two influential theories in the neuroeconomics: prospect theory from economics and prediction error theory from reinforcement learning. Collecting a large dataset (over 60,000 gambling decisions) from laboratory monkeys enables us to test the hybrid model of these two core decision theories reliably. Our results showed over-weighted subjective probabilities at all probability levels after lucky win, indicating that positive prediction error systematically bias decision-makers more optimistically about receiving rewards. This trial-by-trial prediction-error dynamics in probability perception provides outperformed performance of the model compared to the standard static prospect theory. Thus, both static and dynamic elements coexist in monkey’s risky decision-making, an evidence contradicting the assumption of prospect theory.




Neural Population Dynamics Underlying Expected Value Computation

January 2021

·

91 Reads

·

20 Citations

The Journal of Neuroscience : The Official Journal of the Society for Neuroscience

Computation of expected values (i.e., probability × magnitude) seems to be a dynamic integrative process performed by the brain for efficient economic behavior. However, neural dynamics underlying this computation is largely unknown. Using lottery tasks in monkeys ( Macaca mulatta , male; Macaca fuscata , female), we examined (1) whether four core reward-related brain regions detect and integrate probability and magnitude cued by numerical symbols and (2) whether these brain regions have distinct dynamics in the integrative process. Extraction of the mechanistic structure of neural population signals demonstrated that expected value signals simultaneously arose in the central orbitofrontal cortex (cOFC; medial part of area 13) and ventral striatum (VS). Moreover, these signals were incredibly stable compared with weak and/or fluctuating signals in the dorsal striatum and medial OFC. Temporal dynamics of these stable expected value signals were unambiguously distinct: sharp and gradual signal evolutions in the cOFC and VS, respectively. These intimate dynamics suggest that the cOFC and VS compute the expected values with unique time constants, as distinct, partially overlapping processes. SIGNIFICANCE STATEMENT Our results differ from those of earlier studies suggesting that many reward-related regions in the brain signal probability and/or magnitude and provide a mechanistic structure for expected value computation employed in multiple neural populations. A central part of the orbitofrontal cortex (cOFC) and ventral striatum (VS) can simultaneously detect and integrate probability and magnitude into an expected value. Our empirical study on these neural population dynamics raises a possibility that the cOFC and VS cooperate on this computation with unique time constants as distinct, partially overlapping processes.


Citations (5)


... These neural properties might be related to the larger changes in carried information as a function of firing rates and dynamic range ( Figure 4B, compare FSNs and RSN regression slopes, Figure 4C, red). As a result, the output neurons in cortical (9, 10, 12, 13) and subcortical (40)(41)(42)(43) structures becomes active via feedforward inhibition ( Figure 4A) during economic behavior. ...

Reference:

Fast-spiking neurons in monkey orbitofrontal cortex underlie economic value computation
Formation of brain-wide neural geometry during visual item recognition in monkeys

iScience

... Indeed, cortical inhibitory dysfunction results in various diseases including mental disorders (6,7). Since excitatory neurons constitute the majority of neurons at the core cortical center, the orbitofrontal cortex (OFC), they have been well examined in relation to economic behavior to obtain rewards (8)(9)(10)(11)(12)(13)(14). ...

Stable Neural Population Dynamics in the Regression Subspace for Continuous and Categorical Task Parameters in Monkeys

eNeuro

... However, as captured in expected utility theory, decisionmakers are usually not indifferent; they have risk preferences. Tversky and Kahneman (1981) introduced these kinds of problems to illustrate critical tests of divergent predictions of expected utility theory versus prospect theory, still both widely used theories today (e.g., Barberis, 2013;Tymula et al., 2023). Prospect theory predicted gain-loss differences in risk preference, which was thought to rule out expected utility theory in its classic form. ...

Dynamic prospect theory: Two core decision theories coexist in the gambling behavior of monkeys and humans

Science Advances

... In general, we examine the outcome of our choice and adjust subsequent choice behavior using the outcome information to choose an appropriate action. Five significant studies on neurons (Kawai et al. (2015); Yamada et al. (2021); Imaizumi et al. (2022); Yang et al. (2022); Ferrari-Toniolo and Schultz (2023)) have examined neuronal responses to loss and gain. These studies suggest that two different neural systems may respond to loss and gain, resulting in a value function with a cusp as a reference point. ...

A neuronal prospect theory model in the brain reward circuitry

... In a previous study using a choice task, we showed that amygdala neurons do encode subjective values that reflected integrated reward probability and magnitude when these reward attributes were cued simultaneously 12 . Perhaps neurons in the prefrontal cortex, including the orbitofrontal cortex, and parietal cortex might be relatively more important in signaling to accumulate decision variables derived from sequential or otherwise complex cues 28,[63][64][65][66] . Some previous studies found largely similar coding of values and choices in the amygdala and orbitofrontal cortex 13,67 , while others emphasized differences in the time courses with which neurons in these structures track changing values 53,68 , and in the specificity with which single neurons encode complex, multisensory food rewards 69 . ...

Neural Population Dynamics Underlying Expected Value Computation

The Journal of Neuroscience : The Official Journal of the Society for Neuroscience