A Cohort-Level Evaluation of Thompson Sampling for Reducing Asthma Risk

Authors

  • Yuxiao Chen

DOI:

https://doi.org/10.54097/yk4etn78

Keywords:

Multi-armed bandits, Thompson Sampling, Empirical Bayes, mHealth, BRFSS.

Abstract

Public-health programs are continually deciding which outreach action will most reduce risk in the populations they serve, often with limited data and changing conditions. In this study, I apply Thompson Sampling (TS) to cohort-level decision making for asthma-related interventions using the Behavioral Risk Factor Surveillance System (BRFSS) 2015 data, I defined seven intervention “arms” that plausibly affect asthma outcomes including baseline, smoking cessation, vaccinations, inhaler adherence education, preventive check-ups, weight control, and air-quality awareness. I also compared two algorithms: (i) fixed Thompson Sampling (TS) with a Beta(1,1) prior per arm and (ii) Empirical-Bayes Thompson Sampling (EB-TS) that fits Beta priors from offline estimates. Over 10 replications of 30,000 rounds, fixed TS achieves mean expected reward 0.2660 and mean regret 0.002065 per round, concentrating ~99% of pulls on the best arm. EB-TS increases the overall reward to 0.2681 and lower the mean regret to 0.000046 by stabilizing early decisions with data-informed priors. Results suggest fixed TS is a strong, hyperparameter-free baseline for cohort-level outreach, while EB-TS helps when offline reward estimates are reliable and improved reward with dense value feedback.

Downloads

Download data is not yet available.

References

[1] W. R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 1933, 25(3–4): 285–294.

[2] Daniel J. Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen. A Tutorial on Thompson Sampling. Foundations and Trends in Machine Learning, 2018, 11(1): 1–96.

[3] Shipra Agrawal, Navin Goyal. Near-Optimal Regret Bounds for Thompson Sampling. Journal of the ACM, 2017, 64(5): 1–24.

[4] Steven Tomkins, Pei Liao, Predrag Klasnja, Susan A. Murphy. IntelligentPooling: Practical Thompson Sampling for mHealth. Machine Learning, 2021, 110(9): 2685–2727.

[5] Aleksandrs Slivkins. Introduction to Multi-Armed Bandits. Foundations and Trends® in Machine Learning, 2019, 12(1–2): 1–286.

[6] Centers for Disease Control and Prevention (CDC). Behavioral Risk Factor Surveillance System (BRFSS) 2015: Survey Data and Documentation. Technical report / public dataset, 2016. (Atlanta, GA: U.S. Department of Health and Human Services, CDC.)

[7] Tor Lattimore, Csaba Szepesvári. Bandit Algorithms. Cambridge University Press, 2020.

[8] Olivier Chapelle, Lihong Li. An Empirical Evaluation of Thompson Sampling. Advances in Neural Information Processing Systems, 2011, 24: 2249–2257.

[9] Shipra Agrawal, Navin Goyal. Thompson Sampling for Contextual Bandits with Linear Payoffs. Proceedings of the 30th International Conference on Machine Learning (ICML), 2013: 127–135.

Downloads

Published

13-03-2026

Issue

Section

Articles

How to Cite

Chen, Y. (2026). A Cohort-Level Evaluation of Thompson Sampling for Reducing Asthma Risk. Academic Journal of Science and Technology, 19(3), 355-359. https://doi.org/10.54097/yk4etn78