Data-Driven Risk Stratification for High-Cost Care Management: An Empirical Evaluation of Generalized and Regularized Models
DOI:
https://doi.org/10.48161/qaj.v5n4a2198Keywords:
high-cost patients; European health systems; health insurance data; predictive modeling; generalized linear models (GLM); generalized additive models (GAM); lasso regression.Abstract
This paper examines data-driven methods for the identification of high-cost patients in European health systems by assessing predictive accuracy and interpretability in generalized and regularized statistical models. We learn binary classification problems from a big health insurance dataset to identify individuals in the upper and upper of overall healthcare spending. Three modeling paradigms-Generalized Linear Models (GLM), Generalized Additive Models (GAM), and LASSO regression-are used and contrasted in terms of predictive accuracy as well as practical interpretability. We find that GAM consistently outperforms, yielding highest F1 values and lowest log loss, in capturing nonlinear associations in health care consumption better than GLM or LASSO. Frequency of surgeries, hospitalizations, and duration of insurance coverage prove to be key determinants of high-cost status, while demographic attributes like gender exert a moderate impact. The comparisons highlight the potential of utilizing interpretable yet adaptable models to enable proactive, risk-based interventions. By presenting evidence of predictive accuracy vs. interpretability trade-offs, the paper aids more efficient high-cost care management, providing pragmatic advice to European health systems to efficiently allocate assets in light of avoidable health care spending.
Downloads
References
National Institute for Health Care Management. (2012). The concentration of health care spending. NIHCM Foundation Data Brief.
Tanke, M. A. C., Feyman, Y., Bernal-Delgado, E., Deeny, S. R., Imanaka, Y., Jeurissen, P. P. T., et al. (2019). A challenge to all: A primer on inter-country differences of high-need, high-cost patients. PLOS ONE, 14(6), e0217353.
Hayes, S. L., Salzberg, C. A., McCarthy, D., Radley, D. C., Abrams, M. K., Shah, T., & Anderson, G. F. (2016). High-need, high-cost patients: Who are they and how do they use health care? The Commonwealth Fund, Issue Brief.
Department of Health. (2012). Long term conditions compendium of information.
Mora, J., Iturralde, M. D., Prieto, L., Domingo, C., Gagnon, M. P., Martínez-Carazo, C., & de Manuel Keenoy, E. (2017). Key aspects related to implementation of risk stratification in health care systems - the ASSEHS study. BMC Health Services Research, 17(1), 331.
Wammes, J. J. G., van der Wees, P. J., Tanke, M. A. C., Westert, G. P., & Jeurissen, P. P. T. (2018). Systematic review of high-cost patients' characteristics and healthcare utilisation. BMJ Open, 8(9), e023113.
Anderson, G. (2020). Chronic care: Making the case for ongoing care.
Grout, R., et al. (2024). Predicting disease onset from electronic health records for population health management. Frontiers in Artificial Intelligence, 6, 1287541.
Society of Actuaries. (2018). Evaluating predictive models of future medical costs.
Wodchis, W. P., Austin, P. C., & Henry, D. A. (2016). A 3-year study of high-cost users of health care. CMAJ, 188(3), 182–188.
Yu, H., Ravelo, A., Wagner, T. H., Phibbs, C. S., Bhandari, A., & Barnett, P. G. (2015). Prevalence and costs of chronic conditions in the VA health care system. Medical Care Research and Review, 60(3_suppl), 146S–167S.
Morid, M. A., Kawamoto, K., Ault, T. M., Dorius, J., & Abdelrahman, S. (2018). Supervised learning methods for predicting healthcare costs: Systematic literature review and empirical evaluation. AMIA Annual Symposium Proceedings, 2017, 1312–1321.
Bauder, M., Lange, M., & Zscheischler, J. (2021). Comparing machine learning approaches to identify high-cost patients using German statutory health insurance claims data. Health Services Research, 56(3), 481–491.
Langenberger, B., Schulte, T., & Groene, O. (2023). The application of machine learning to predict high-cost patients: A performance-comparison of different models using healthcare claims data. PLOS ONE, 18(1), e0279540.
Vimont, A., Younes, S., Jannot, A. S., & Burgun, A. (2022). Interpretable predictive models for healthcare cost stratification using French national claims data. International Journal of Medical Informatics, 162, 104751.
Orji, F. A., & Ukwandu, D. C. (2024). Enhancing insurance cost prediction with explainable AI: A SHAP-based evaluation of boosted tree ensembles. Journal of Computational Healthcare, 12(2), 101–117.
Molnar, C. (2020). Interpretable machine learning. Lulu Press.
Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206–215.
Shah, R. D., & Samworth, R. J. (2013). Variable selection with error control: Another look at stability selection. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 75(1), 55–80.
McCullagh, P., & Nelder, J. (2019). Generalized linear models (2nd ed.). Chapman and Hall/CRC.
Dobson, A. J., & Barnett, A. G. (2018). An introduction to generalized linear models (4th ed.). CRC Press.
Faraway, J. J. (2016). Extending the linear model with R: Generalized linear, mixed effects and nonparametric regression models (2nd ed.). Chapman and Hall/CRC.
Bertsimas, D., Dunn, J., Pawlowski, C., & Silberholz, J. (2020). Predicting health care costs and utilization with machine learning: A comparison with regression-based models. Health Care Management Science, 23, 235–248.
Kansagara, D., Englander, H., Salanitro, A., Kagen, D., Theobald, C., Freeman, M., & Kripalani, S. (2011). Risk prediction models for hospital readmission: A systematic review. JAMA, 306(15), 1688–1698.
Wood, S. N. (2017). Generalized additive models: An introduction with R (2nd ed.). CRC Press.
Zuur, A. F., Saveliev, A. A., & Ieno, E. N. (2014). A beginner’s guide to generalized additive mixed models with R. Highland Statistics.
Marra, G., & Wood, S. N. (2011). Practical variable selection for generalized additive models. Computational Statistics & Data Analysis, 55(7), 2372–2387.
Richman, R. (2021). AI in actuarial science - a review of recent advances (part 1). Annals of Actuarial Science, 15(2), 207–229.
Wang, Y., & Wager, S. (2022). Understanding and improving interpretable machine learning with GAMs. Journal of Machine Learning Research, 23(1), 1–53.
Tibshirani, R., Hastie, T., & Wainwright, M. (2015). Statistical learning with sparsity: The lasso and generalizations. Chapman and Hall/CRC.
Park, T., & Casella, G. (2008). The Bayesian lasso. Journal of the American Statistical Association, 103(482), 681–686.
Bickel, P. J., Ritov, Y., & Tsybakov, A. B. (2009). Simultaneous analysis of lasso and dantzig selector. Annals of Statistics, 37(4), 1705–1732.
Friedman, J., Hastie, T., & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1–22.
Wainwright, M. J. (2019). High-dimensional statistics: A non-asymptotic viewpoint. Cambridge University Press.
Bühlmann, P., & Van De Geer, S. (2011). Statistics for high-dimensional data: Methods, theory and applications. Springer Science & Business Media.
Bates, D. W., Saria, S., Ohno-Machado, L., Shah, A., & Escobar, G. (2014). Big data in health care: Using analytics to identify and manage high-risk and high-cost patients. Health Affairs, 33(7), 1123–1131.
Osawa, I., Goto, T., Yamamoto, Y., & Tsugawa, Y. (2020). Machine-learning-based prediction models for high-need high-cost patients using nationwide clinical and claims data. npj Digital Medicine, 3, 148.
Caruana, R., Lou, Y., Gehrke, J., Koch, P., Sturm, M., & Elhadad, N. (2015). Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1721–1730). ACM.
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics, 44(3), 837–845.
Sun, X., & Xu, W. (2014). Fast implementation of DeLong's algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Processing Letters, 21(11), 1389–1393.
Steyerberg, E. W., Vickers, A. J., Cook, N. R., et al. (2010). Assessing the performance of prediction models: A framework for traditional and novel measures. Epidemiology, 21(1), 128–138.
Van Calster, B., McLernon, D. J., van Smeden, M., et al. (2019). Calibration: The Achilles heel of predictive analytics. BMC Medicine, 17(1), 230.
Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2), 153–163.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Qubahan Academic Journal

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.



