Clustering-Based Approach Accelerates AI Learning in Robotics and Gaming
en-GBde-DEes-ESfr-FR

Clustering-Based Approach Accelerates AI Learning in Robotics and Gaming

01/07/2025 Frontiers Journals

Teaching AI to explore its surroundings is a bit like teaching a robot to find treasure in a vast maze—it needs to try different paths, but some lead nowhere. In many real-world challenges, like training robots or playing complex games, rewards are few and far between, making it easy for AI to waste time on dead ends.
To address this challenge, Researchers at Nanjing University and UC Berkeley devised an interesting way to teach AI: Clustered Reinforcement Learning (CRL). Instead of wandering around aimlessly or only chasing big scores, this method sorts similar situations into “clusters.” It rewards the AI for trying new things and for building on past successes.
“By grouping experiences and balancing curiosity with proven success, we’ve given AI a more human-like way to learn,” says Prof. Wu-Jun Li, the project’s lead researcher.
The Two-Step Magic: Clustering Experiences and Rewarding Wins
So, how does CRL pull off these wins? Instead of treating every state as unique and unconnected, CRL groups similar states into clusters using a technique called K-means. Each cluster is then analyzed to measure two things: how often it’s been visited (novelty) and how good the average outcome is (quality). CRL assigns bonus rewards based on these two factors—encouraging the agent to explore areas that are not only new but also likely to yield good results. This contrasts with traditional methods that chase only novelty, often leading the agent into unproductive areas.
Results and Impact: Fast Learning, Real-World Utility
By blending curiosity with outcome-based guidance, CRL allows AI to learn faster and with fewer mistakes. It achieved top performance across multiple standard benchmarks, including robotic control tasks and difficult Atari games, outperforming several state-of-the-art methods. What’s more, CRL can be easily added to existing AI systems as a modular enhancement. This makes it especially promising for high-stakes domains like autonomous driving, energy optimization, and intelligent scheduling—where safe, sample-efficient learning is essential.

By combining simple clustering with light reward tweaks, CRL opens the door to safer, faster, and more reliable AI training. As intelligent machines move into our everyday lives—from warehouse robots to city-street navigation—methods like this will help them learn quickly, avoid costly mistakes, and need less human babysitting. The complete study is accessible via DOI: 10.1007/s11704-024-3194-1.

DOI: 10.1007/s11704-024-3194-1
01/07/2025 Frontiers Journals
Regions: Asia, China
Keywords: Applied science, Computing

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2025 by AlphaGalileo Terms Of Use Privacy Statement