Hidden Markov Models: Theory, Algorithms, and Applications in Bioinformatics
en-GBde-DEes-ESfr-FR

Hidden Markov Models: Theory, Algorithms, and Applications in Bioinformatics

24/10/2025 Compuscript Ltd

The hidden Markov model (HMM), a statistical model widely applied in machine learning, has proven effective in addressing various problems in bioinformatics. Once primarily regarded as a mathematical framework for modeling stochastic processes, HMMs have become indispensable tools for solving a wide range of biological sequence problems, from gene prediction to protein structure analysis.

In a recent review published in Genes & Diseases, researchers from Harbin Medical University systematically introduce the theoretical foundations of HMMs, including the three canonical problems—evaluation, decoding, and learning, along with the algorithms most commonly used to address them, such as the Viterbi and Baum-Welch algorithms.

This review emphasizes the wide-ranging applications of HMMs in bioinformatics, with a focus on five major domains: (i) Transmembrane protein prediction – Tools like HMMTOP apply HMM-based approaches to resolve protein topology, providing critical insights for drug discovery and structural biology. (ii) Gene finding – Programs such as GENSCAN and AUGUSTUS utilize generalized HMMs to predict exon–intron boundaries, facilitating accurate genome annotation across species. (iii) Multiple sequence alignment – Profile HMMs underpin widely used resources such as Pfam and HMMER, enabling homology detection, protein family classification, and functional annotation. (iv) CpG island prediction – HMMs offer statistically grounded methods to identify CpG-rich regions involved in epigenetic regulation and disease. (v) Copy number variation (CNV) detection – Algorithms including PennCNV and QuantiSNP rely on HMMs to detect CNVs with high sensitivity, providing insights into genetic diversity and disease susceptibility.

Furthermore, the review critically discusses the strengths and limitations of HMMs, noting their versatility, statistical rigor, and interpretability, while acknowledging challenges such as computational demands and assumptions of linearity. The authors highlight that integrating HMMs with next-generation sequencing, multi-omics, and advanced machine learning approaches will be essential for extending their relevance in modern computational biology.

By consolidating both theoretical insights and practical applications, this review positions HMMs as a cornerstone of bioinformatics research. As biological data continue to expand in scope and complexity, HMMs are expected to remain central to advancing genome annotation, functional genomics, and precision medicine.

Reference

Title of Original Paper: The hidden Markov model and its applications in bioinformatics analysis

Journal: Genes & Diseases

Genes & Diseases is a journal for molecular and translational medicine. The journal primarily focuses on publishing investigations on the molecular bases and experimental therapeutics of human diseases. Publication formats include full length research article, review article, short communication, correspondence, perspectives, commentary, views on news, and research watch.

DOI: https://doi.org/10.1016/j.gendis.2025.101729

Funding Information:
  • National Natural Science Foundation of China (No. 31970651, 92046018)
  • Mathematical Tianyuan Fund of the National Natural Science Foundation of China (No. 12026414)
# # # # # #

Genes & Diseases publishes rigorously peer-reviewed and high quality original articles and authoritative reviews that focus on the molecular bases of human diseases. Emphasis is placed on hypothesis-driven, mechanistic studies relevant to pathogenesis and/or experimental therapeutics of human diseases. The journal has worldwide authorship, and a broad scope in basic and translational biomedical research of molecular biology, molecular genetics, and cell biology, including but not limited to cell proliferation and apoptosis, signal transduction, stem cell biology, developmental biology, gene regulation and epigenetics, cancer biology, immunity and infection, neuroscience, disease-specific animal models, gene and cell-based therapies, and regenerative medicine.

Scopus Cite Score: 8.4 | Impact Factor: 9.4

# # # # # #

More information: https://www.keaipublishing.com/en/journals/genes-and-diseases/
Editorial Board: https://www.keaipublishing.com/en/journals/genes-and-diseases/editorial-board/
All issues and articles in press are available online in ScienceDirect (https://www.sciencedirect.com/journal/genes-and-diseases).
Submissions to Genes & Disease may be made using Editorial Manager (https://www.editorialmanager.com/gendis/default.aspx ).
Print ISSN: 2352-4820
eISSN: 2352-3042
CN: 50-1221/R
Contact Us: editor@genesndiseases.com
X (formerly Twitter): @GenesNDiseases (https://x.com/GenesNDiseases)
Attached files
  • (A) Schematic diagram of eukaryotic gene structure. (B) The overall process of using the HMM to predict eukaryotic genes. (C) HMM architecture for predicting eukaryotic genes. (D) The submodel of the coding region.
  • The amino acid sequence of the transmembrane protein and its corresponding positions on the cell membrane are transformed into a hidden Markov process. After evaluating the parameters, the Viterbi algorithm is used to identify the optimal state sequence.
  • (A) The learning problem and the Baum-Welch algorithm are used in constructing the hidden Markov model (HMM) of protein family A. After discovering the statistical characteristics of protein family A through multiple sequence alignment, the parameters were obtained with the help of the Baum-Welch algorithm to construct the HMM of protein family A. (B) The evaluation problem and forward algorithm are used in the identification of protein family A. A set of sequences is input into the HMM of protein family A, and each sequence is scored to determine whether it belongs to protein family A.
24/10/2025 Compuscript Ltd
Regions: Europe, Ireland, Iceland, Asia, China
Keywords: Science, Life Sciences

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2025 by AlphaGalileo Terms Of Use Privacy Statement