TY - JOUR
T1 - Computational models of exploration and exploitation characterise onset and efficacy of treatment in methamphetamine use disorder
AU - Robinson, Alex H.
AU - Chong, Trevor T.J.
AU - Verdejo-Garcia, Antonio
N1 - Funding Information:
We sincerely thank Adam Rubenis and Rebecca Fitzpatrick for their research efforts which provided the data for this paper. We also sincerely thank all those who participated in the original study. AHR was supported by an Australian Government Research Training Program Scholarship. TT‐JC was supported by the Australian Research Council (DP 180102383 and DE 180100389). AVG was supported by an Australian Medical Research Future Fund, Next Generation of Clinical Researchers, CDF‐2 Fellowship (MRF1141214). Open access publishing facilitated by Monash University, as part of the Wiley ‐ Monash University agreement via the Council of Australian University Librarians.
Funding Information:
We sincerely thank Adam Rubenis and Rebecca Fitzpatrick for their research efforts which provided the data for this paper. We also sincerely thank all those who participated in the original study. AHR was supported by an Australian Government Research Training Program Scholarship. TT-JC was supported by the Australian Research Council (DP 180102383 and DE 180100389). AVG was supported by an Australian Medical Research Future Fund, Next Generation of Clinical Researchers, CDF-2 Fellowship (MRF1141214). Open access publishing facilitated by Monash University, as part of the Wiley - Monash University agreement via the Council of Australian University Librarians.
Publisher Copyright:
© 2022 The Authors. Addiction Biology published by John Wiley & Sons Ltd on behalf of Society for the Study of Addiction.
PY - 2022/5
Y1 - 2022/5
N2 - People with Methamphetamine Use Disorder (PwMUD) spend substantial time and resources on substance use, which hinders their ability to explore alternate reinforcers. Gold-standard behavioural treatments attempt to remedy this by encouraging action towards non-drug reinforcers, but substance use often persists. We aimed to unravel the mechanistic drivers of this behaviour by applying a computational model of explore/exploit behaviour to decision-making data (Iowa Gambling Task) from 106 PwMUD and 48 controls. We then examined the longitudinal link between explore/exploit mechanisms and changes in methamphetamine use 6 weeks later. Exploitation parameters included reinforcement sensitivity and inverse decay (i.e., number of past outcomes used to guide choices). Exploration parameters included maximum directed exploration value (i.e., value of trying novel actions). The Timeline Follow Back measured changes in methamphetamine use. Compared to controls, PwMUD showed deficits in exploitative decision-making, characterised by reduced reinforcement sensitivity, U = 3065, p = 0.009, and less use of previous choice outcomes, U = 3062, p = 0.010. This was accompanied by a behavioural pattern of frequent shifting between choices, which appeared consistent with random exploration. Furthermore, PwMUD with greater reductions of methamphetamine use at 6 weeks had increased directed exploration (β = 0.22, p = 0.045); greater use of past choice outcomes (β = −0.39, p = 0.002) and greater choice consistency (β = −0.39, p = 0.002). Therefore, limited computational exploitation and increased behavioural exploration characterise PwMUD's presentation to treatment, while increased directed exploration, use of past choice outcomes and choice consistency predict greater reductions of methamphetamine use.
AB - People with Methamphetamine Use Disorder (PwMUD) spend substantial time and resources on substance use, which hinders their ability to explore alternate reinforcers. Gold-standard behavioural treatments attempt to remedy this by encouraging action towards non-drug reinforcers, but substance use often persists. We aimed to unravel the mechanistic drivers of this behaviour by applying a computational model of explore/exploit behaviour to decision-making data (Iowa Gambling Task) from 106 PwMUD and 48 controls. We then examined the longitudinal link between explore/exploit mechanisms and changes in methamphetamine use 6 weeks later. Exploitation parameters included reinforcement sensitivity and inverse decay (i.e., number of past outcomes used to guide choices). Exploration parameters included maximum directed exploration value (i.e., value of trying novel actions). The Timeline Follow Back measured changes in methamphetamine use. Compared to controls, PwMUD showed deficits in exploitative decision-making, characterised by reduced reinforcement sensitivity, U = 3065, p = 0.009, and less use of previous choice outcomes, U = 3062, p = 0.010. This was accompanied by a behavioural pattern of frequent shifting between choices, which appeared consistent with random exploration. Furthermore, PwMUD with greater reductions of methamphetamine use at 6 weeks had increased directed exploration (β = 0.22, p = 0.045); greater use of past choice outcomes (β = −0.39, p = 0.002) and greater choice consistency (β = −0.39, p = 0.002). Therefore, limited computational exploitation and increased behavioural exploration characterise PwMUD's presentation to treatment, while increased directed exploration, use of past choice outcomes and choice consistency predict greater reductions of methamphetamine use.
KW - computational modelling
KW - decision-making
KW - explore/exploit
KW - methamphetamine
KW - predictive
KW - substance use
UR - http://www.scopus.com/inward/record.url?scp=85128867607&partnerID=8YFLogxK
U2 - 10.1111/adb.13172
DO - 10.1111/adb.13172
M3 - Article
C2 - 35470564
AN - SCOPUS:85128867607
SN - 1355-6215
VL - 27
JO - Addiction Biology
JF - Addiction Biology
IS - 3
M1 - e13172
ER -