A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
en-GBde-DEes-ESfr-FR

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

23/01/2026 Frontiers Journals

Multilingual Large Language Models (MLLMs) have achieved remarkable success in advancing multilingual natural language processing, enabling effective knowledge transfer from high-resource to low-resource languages. Despite their achievements, MLLMs still face numerous issues and challenges, which can be categorized into three main aspects: corpora, alignment, and bias.

To address these challenges, a research team led by Professor Xu Yue-Mei from Beijing Foreign Studies University published a survey titled "A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias" on 15 November 2025 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.
The study reviews the challenges faced by MLLMs through three key aspects: corpora, alignment, and bias. It starts by presenting an overview of MLLMs, their evolution, techniques, and multilingual capabilities. Then, it explores the role of multilingual corpora and downstream datasets in enhancing model performance. The paper also examines the difficulty MLLMs face in learning universal language representations and reviews current approaches to multilingual alignment. Finally, it discusses the bias present in MLLMs, its categories, evaluation metrics, and debiasing techniques to address harmful outcomes.

Through an in-depth analysis of these dimensions, the researchers aim to shed light on practical strategies for optimizing MLLMs, offering valuable insights for the future development of fairer and more robust multilingual models.
DOI:10.1007/s11704-024-40579-4

Attached files
  • An illustration of the relationship between corpora, misalignment, and bias.
23/01/2026 Frontiers Journals
Regions: Asia, China
Keywords: Applied science, Computing

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by AlphaGalileo Terms Of Use Privacy Statement