Preprint / Version 1

Machine learning early warning for financial distress in health plan operators

##article.authors##

  • Guilherme Coelho State University of Campinas (UNICAMP) image/svg+xml https://orcid.org/0000-0003-3045-4057
    • Conceptualization
    • Data Curation
    • Formal Analysis
    • Investigation
    • Methodology
    • Resources
    • Software
    • Validation
    • Visualization
    • Writing – Original Draft Preparation
    • Writing – Review & Editing
  • Clarimar José Coelho Pontifícia Universidade Católica de Goiás image/svg+xml https://orcid.org/0000-0002-5163-2986
    • Conceptualization
    • Data Curation
    • Formal Analysis
    • Writing – Original Draft Preparation
    • Writing – Review & Editing
    • Validation
    • Visualization
    • Supervision
    • Project Administration
    • Methodology

DOI:

https://doi.org/10.1590/SciELOPreprints.15181

Keywords:

Early Warning System, Financial Distress, Health Insurance Regulation, Machine Learning, Supplementary Health

Abstract

This study develops the first machine learning–based early warning system for financial distress among Brazilian health plan operators. Using 24,440 operator-quarter observations from public regulatory data (2018–2025), we train and temporally validate LASSO logistic regression, random forest, and XGBoost to predict distress two to four quarters ahead. Random forest achieved the highest discrimination (AUC = 0.847), but LASSO exhibited the smallest generalization gap (0.014), revealing a tension between accuracy and deployment reliability that carries direct implications for regulatory design. The extended combined ratio was the only consensus predictor across all methods, and temporal features added predictive value beyond static ratio levels. A retrospective case study on Brazil's largest operator demonstrated early detection capacity but also revealed model habituation under prolonged stress. The study extends early warning theory from banking to health insurance and demonstrates that open regulatory data can support proactive, transparent surveillance where operator failure directly affects healthcare access.

Downloads

Download data is not yet available.

Posted

03/04/2026

How to Cite

Machine learning early warning for financial distress in health plan operators. (2026). In SciELO Preprints. https://doi.org/10.1590/SciELOPreprints.15181

Section

Applied Social Sciences

Plaudit

Data statement