SKKU develops Bayesian inference for hidden dependence structures in multi-group high-dimensional data
en-GBde-DEes-ESfr-FR

SKKU develops Bayesian inference for hidden dependence structures in multi-group high-dimensional data


The research team of Professor Kyoungjae Lee of the Department of Statistics at Sungkyunkwan University, through joint research with Professor Won Chang of Seoul National University and Professor Xuan Cao of the University of Cincinnati, developed Bayesian inference for the hidden dependence structures of multi-group high-dimensional data.

A Dependence Map in High-Dimensional Data

In today’s scientific and industrial fields, high-dimensional data in which numerous variables are observed simultaneously, such as genomic, climate, financial, and sensor data, are rapidly increasing. In such data, an important problem is to learn the dependent structures connecting the variables and to identify a “dependence map” that reveals hidden information in massive datasets. For example, in climate data, temperatures in nearby regions may be related to one another, and in genomic data, genes located in adjacent positions may act together. If such dependence can be incorporated into inference, more efficient inference is possible than analyzing each variable separately.

Development of the j-LANCE Method for Joint Inference of Dependence Across Multiple Groups

The j-LANCE (joint LocAl depeNdence CholEsky) method proposed in this study focuses on the fact that, in real data such as genomic and climate data, variables have a natural ordering and are mainly related to nearby neighboring variables. Based on this idea, the method estimates the extent to which each variable is connected to neighboring variables and is designed to learn similar structures across multiple groups while allowing group-specific differences. In many existing methods, data from multiple groups are either analyzed separately or simplified by assuming that all groups have the same structure. In contrast, this study uses a Markov random field prior so that similarities and differences across groups can be flexibly learned from the data.

Simultaneously Achieving Theoretical Accuracy and Fast Computation

An important achievement of this study is that it simultaneously attains theoretical accuracy and computational efficiency even in high-dimensional settings. This study theoretically proved that j-LANCE can accurately estimate the dependence structures of multiple groups, and also showed that the rate at which the estimates approach the true values is nearly minimax-optimal. In addition, the methodology was designed to enable Bayesian inference without using MCMC, a complex iterative computation procedure, thereby securing the advantage that fast analysis is possible even for high-dimensional data.

Practical Applicability Confirmed Through Climate Data Analysis

In this study, ERA5 data were used to analyze temperatures at 30 locations in the Pacific Northwest region of the United States from 2019 to 2021, and the dependence structure of temperatures across regions was estimated based on a spatial ordering that reflects wind flow. As a result, j-LANCE was found to capture similar dependence patterns across years while also detecting distinctive dependence structures that appeared in a specific year. This confirmed the practical applicability of j-LANCE to real data, and the method is expected to be applicable in a wide range of fields that require simultaneous analysis of complex data from multiple groups, including climate, genomics, finance, and sensor time series.

*This research achievement was published in Bayesian Analysis, an international journal in the field of statistics.

Attached files
  • Temperature heatmap of ERA5 climate data
  • Cholesky factors estimated from ERA5 climate data. Each column corresponds, from right to left, to 2019, 2020, and 2021, and each row shows, from top to bottom, the results of j-LANCE, year-specific independent LANCE, the penalized likelihood-based inference method, and the group graphical lasso method
Regions: Asia, South Korea, North America, United States
Keywords: Science, Mathematics

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by AlphaGalileo Terms Of Use Privacy Statement