Facilitating single-cell chromatin accessibility research with a user-friendly database
en-GBde-DEes-ESfr-FR

Facilitating single-cell chromatin accessibility research with a user-friendly database

23/01/2026 Frontiers Journals

Single-cell analyses have emerged as powerful tools for studying cellular heterogeneity and gene regulation. Single-cell chromatin accessibility sequencing (scCAS) is a key technology that enables the analysis of chromatin accessibility at the resolution of individual cells. However, there are three main challenges in the use of scCAS data: (1) Publicly available data in public research generated from diverse species, tissues, and experimental conditions are not systematically collected; (2) scCAS data with cell type, tissue, and other labels can be used to train machine learning methods for single-cell tasks such as cell type annotations, but such critically important annotated datasets have not been systematically collected; (3) The diversity of data formats across studies complicates efforts toward format standardization.
To solve these problems, a research team led by Shengquan Chen published their new research on 15 November 2025 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.
The team developed scCASdb, a user-friendly and well-annotated scCAS database that standardized datasets in the h5ad format. By systematically collecting 80 well-annotated datasets from diverse species, tissues, and experimental conditions, scCASdb enables diverse single-cell analyses that were previously hindered by the lack of comprehensive collections. Moreover, the adoption of the h5ad format ensures efficient data accessibility and compatibility with both Python-based tools like Scanpy and machine learning models.
All data stored in the database are saved in h5ad formats, which efficiently manage large-scale single-cell data and can be seamlessly utilized with Python-based machine learning methods, enabling researchers to develop computational tools for single-cell analysis.
Each dataset in scCASdb contains three key components: (1) a cell-by-peak matrix, which records chromatin accessibility information for each single cell, providing a precise description of chromatin accessibility across different genomic regions; (2) cell type labels for cells in cell-by-peak matrix when available, which help researchers identify and classify cell populations, supporting the analysis of cellular heterogeneity; (3) metadata, such as species, genome, organs, diseases, sequencing technologies, and batch labels, which greatly facilitate researchers in diverse single-cell tasks.
Future work can focus on increasing the number of datasets and incorporating additional features to facilitate user access to the data.
DOI
10.1007/s11704-025-41390-5
23/01/2026 Frontiers Journals
Regions: Asia, China
Keywords: Applied science, Computing

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonios

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet
AlphaGalileo is a great source of global research news. I use it regularly.
Robert Lee Hotz, LA Times

Trabajamos en estrecha colaboración con...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by DNN Corp Terms Of Use Privacy Statement