Big data and LASSO improve health insurance risk prediction
en-GBde-DEes-ESfr-FR

Big data and LASSO improve health insurance risk prediction

06/02/2026 TranSpread

Insurers must price and underwrite policies with incomplete information, while applicants often know more about their own health risks. This information gap can contribute to adverse selection and inefficient pricing. A new study published in Risk Sciences investigates whether alternative data sources (“big data”) and modern predictor-selection methods can improve health insurance risk assessment — which data sources are most worth collecting.

The researchers, from Peking University and University of International Business and Economics in China, analyzed proprietary critical illness insurance application and claim information from Chinese insurance company InsurTech. In addition to standard policy and demographic variables, the dataset includes applicant-authorized smartphone-related “label” information, such as device signals, location- and app-related indicators, and credit-inquiry related signals, as well as public medical-claim records from hospitals.

“To capture health risk, we used outcomes tied to critical illness claims as well as information derived from individuals’ prior public medical-claim history,” explains lead author Ruo Jia. “We found that adding big data and applying LASSO-style methods improves out-of-sample prediction compared with models relying only on traditional underwriting information.”

Notably, big data obtained from smartphone use offer extra-predictive power in addition to past medical histories.

“Because collecting and processing underwriting data can be expensive, we also applied Adaptive Group LASSO to identify which categories of variables are most useful,” says Jia. “We determined that the most fruitful data collection sources for health insurance underwriting are personal digital devices, recent travel experience, and insureds’ credit records.”

The authors emphasize that the analysis is predictive rather than causal: “we do not offer causal interpretations.” They also discuss limitations related to the study’ s coverage and context.

###

References

DOI

10.1016/j.risk.2025.100028

Original Source URL

https://doi.org/10.1016/j.risk.2025.100028

Funding Information

National Natural Science Foundation of China; National Social Science Foundation of China; Research Seed Fund of the School of Economics, Peking University

About Risk Sciences

Risk Sciences is a general-interest journal that publishes academic research and industry practices on risks and disruptive technologies across all fields including agriculture, economics, engineering, environmental science, finance, health, law, management, natural sciences, and public administration.

Paper title: Data-enriched prediction of insurance risk
Fichiers joints
  • Variable-group selection results indicate which categories of information are most informative for predicting the study’s health-risk proxies.
06/02/2026 TranSpread
Regions: North America, United States, Asia, China
Keywords: Science, Mathematics, Society, Economics/Management, Applied science, Technology

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Témoignages

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet
AlphaGalileo is a great source of global research news. I use it regularly.
Robert Lee Hotz, LA Times

Nous travaillons en étroite collaboration avec...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by DNN Corp Terms Of Use Privacy Statement