Hybrid Methods Improve Key Variable Prediction in Process Industry Using Small Noisy Datasets
en-GBde-DEes-ESfr-FR

Hybrid Methods Improve Key Variable Prediction in Process Industry Using Small Noisy Datasets

20/03/2026 Frontiers Journals

Soft measurement based on data-driven models is widely used to predict key variables in process industry due to low cost and real-time capability. However, these models struggle with noisy datasets containing limited samples. A study published in Frontiers of Chemical Science and Engineering presents data-mechanism hybrid driven methods to address this challenge.
The research team proposed four hybrid approaches integrating mechanism models with three common data-driven models: random forest, extreme gradient boosting, and artificial neural network. In these methods, mechanism calculations provide constraints that guide data-driven model training and prediction.
The first method uses mechanism model outputs as inputs for data-driven models. The second concatenates original data with mechanism calculations for enhanced input features. The third incorporates mechanism constraints into the loss function during neural network training. The fourth combines all three approaches.
Validation was conducted through two industrial cases. The benzene-toluene-xylene distillation case used 14 input features to predict three product compositions. The steam methane reforming case involved seven input variables predicting seven outputs with three-step reaction kinetics. Datasets were generated through simulation with controlled Gaussian noise addition.
Results showed that hybrid methods consistently improved prediction accuracy, with improvement magnitude depending on noise intensity, sample size, and model choice. Under noise levels of 10 to 20 percent and sample sizes of 100 to 400, coefficient of determination improvements reached 5.2 percent for random forest, 17.7 percent for extreme gradient boosting, and 36.2 percent for artificial neural network.
In the distillation case, hybrid methods with extreme gradient boosting and artificial neural network achieved maximum improvements of 6.7 percent under lower noise and 7.7 percent under 20 percent noise. In the reforming case, hybrid methods with extreme gradient boosting improved by 0.3 to 2.5 percent, while method (d) enhanced artificial neural network performance by 0.003 to 0.159.
A double hybrid method incorporating mass conservation law further improved artificial neural network predictions by 0.005 to 0.177, showing better stability in high-value regions.
This research demonstrates that data-mechanism hybrid driven methods offer superior predictive performance for key variables in process industry, particularly when working with small noisy datasets common in real-world applications.
DOI
10.1007/s11705-026-2632-z

Attached files
  • IMAGE: Methodology for regressing and predicting key variables of chemical processes based on different sample sizes and noise intensity data sets using data-mechanism hybrid driven methods.
20/03/2026 Frontiers Journals
Regions: Asia, China
Keywords: Science, Chemistry

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by AlphaGalileo Terms Of Use Privacy Statement