Researchers Improve AI’s Ability to Learn New Tasks Without Sacrificing Performance
en-GBde-DEes-ESfr-FR

Researchers Improve AI’s Ability to Learn New Tasks Without Sacrificing Performance


A new framework allows AI models that have already been trained to learn new tasks without sacrificing performance when performing old tasks. The framework, called CHEEM, also improves an AI model’s operating efficiency by using fewer computational steps to perform simpler tasks.

“CHEEM addresses two longstanding challenges for AI models: continual learning and adaptive intelligence,” says Tianfu Wu, corresponding author of a paper on the work and an associate professor of computer engineering at North Carolina State University.

Continual learning refers to the ability of an AI model to take in new data and learn to perform new tasks. The challenge with continual learning is that training an AI model to perform new tasks often results in the model getting worse at tasks it was already trained to perform.

Adaptive intelligence refers to the ability of an AI model to change its computation process depending on the complexity of the task it is asked to perform. For example, many prominent AI models – including large language models – run the same chain of computations regardless of what they are being asked to do, which is not very efficient. The challenge here is training an AI model so that it uses fewer computations to solve simple tasks, more computations to solve complex tasks, and so on.

“We think these two challenges are intertwined, and that we can make progress toward adaptive intelligence by improving a model’s ability to engage in continual learning,” says Wu. “This is the fundamental idea behind CHEEM.”

CHEEM, which stands for Continual Hierarchical-Exploration-Exploitation Memory, gives models a great deal of flexibility in terms of how to use their existing computational architecture when learning a new task. A model can use an existing layer, modify an existing layer, skip an existing layer entirely, or add new layers. Ultimately, this flexibility helps a model find a good balance between leveraging its existing knowledge, integrating new data, and allocating computational resources depending on the complexity of the task it is being asked to perform.

To test the CHEEM framework, the researchers made use of a state-of-the-art vision transformer model – a large, complex model that is already in widespread use. Specifically, the researchers used CHEEM to train the vision transformer model using two benchmark datasets: MTIL and VDD.

“Both benchmarks are challenging, because they contain many different tasks and many different kinds of tasks,” says Wu. “That makes them good test cases.”

CHEEM significantly outperformed existing state-of-the-art continual learning methods against both benchmarks.

“CHEEM got very close to achieving the full fine-tuning upper bound for these new tasks, meaning that it was almost as good as if you had trained the model to only perform that one task,” says Wu.

“In addition, CHEEM improved the adaptive intelligence of the model significantly. The model tailored its computational structure depending on the complexity of the task, and it did so in a semantically meaningful way. In other words, if a new task was similar to a previous task, the model would use much of the pre-existing architecture; but if a new task was very different from any previous task, it would add new layers that allow it to perform the task.

“We’re excited about what we’ve been able to demonstrate with CHEEM,” says Wu. “At this point, we’re looking for collaborators who could help us access the computational resources necessary to evaluate CHEEM’s performance on large foundation models that have billions of parameters.”

The peer-reviewed paper, “CHEEM: Continual Learning by Reuse, New, Adapt and Skip – A Hierarchical Exploration-Exploitation Approach,” will be presented at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), being held June 3-7 in Denver, Colo. First author of the paper is Chinmay Savadikar, a Ph.D. student at NC State.

This work was done with support from the Army Research Office, under grants W911NF1810295 and W911NF2210010; the National Science Foundation, under grants 1909644, 2024688 and 2013451; and an NC State Goodnight Early Career Award.

“CHEEM: Continual Learning by Reuse, New, Adapt and Skip – A Hierarchical Exploration-Exploitation Approach”

Authors: Chinmay Savadikar and Tianfu Wu, North Carolina State University; Michelle Dai, Johns Hopkins University

Presented: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR), June 3-7, Denver, Colo.

DOI: 10.48550/arXiv.2303.08250
Regions: North America, United States
Keywords: Applied science, Artificial Intelligence, Computing, Engineering, Technology

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Referenzen

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet
AlphaGalileo is a great source of global research news. I use it regularly.
Robert Lee Hotz, LA Times

Wir arbeiten eng zusammen mit...


  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by DNN Corp Terms Of Use Privacy Statement