Information on Earth's biodiversity is increasingly collected using DNA-, image- and audio-based sampling. At the same time, new statistical analysis methods are being developed to make more out of the collected data, providing detailed information on the Earth's biodiversity and its status. Now, an international research group has developed a new statistical approach for jointly modeling the distributions of up to millions of species. The new approach improves predictions especially on rare species that have remained difficult to model with earlier approaches.
Earth is home to several millions of species, and only a fraction of nature’s diversity is currently known. One reason for this is that many rare species are difficult to detect, which means that we may not know even about their existence, or at least that there is a lack of information available to evaluate their endangerment status. This can lead to inaccurate and incomplete assessments of their abundance and the reasons for their endangerment.
- It is important to gather information on the distribution and habitat requirements of rare species to understand the current state and changes in biodiversity. This requires models that can for example accurately forecast how climate factors or land use changes will affect species of which we have only few observations, says Academy Professor Otso Ovaskainen from the University of Jyväskylä, Finland.
Statistical modeling advances will improve predictions on biodiversity change
Researchers at the University of Jyväskylä, in collaboration with international research groups, have now developed a new statistical method that enables the analysis of data sets comprising millions of species.
- We have developed the CORAL method, which enables comprehensive modeling and prediction of biodiversity. We illustrated that CORAL leads to much improved prediction and inference in the context of DNA metabarcoding data from Madagascar, comprising 255,188 arthropod species detected in 2874 samples, clarifies Professor David Dunson from Duke University.
By borrowing information from a backbone model of common species, CORAL makes it possible to model even the rarest species in a statistically effective manner by combining an informative prior model with the limited data available for each rare species.
- In our sample analysis of Madagascar, we were able to understand the climatic and evolutionary factors affecting the occurrence of species and seasonal variation. This allows us to produce more accurate information on biodiversity, particularly for rare species, for both decision-makers and researchers, says Professor Brian Fisher from the California Academy of Sciences who led the data sampling in Madagascar.
Towards more effective nature conservation
Academy Professor Otso Ovaskainen at the University of Jyväskylä is a world-class expert in mathematical and statistical modelling. He has developed new methods for empirical data collection and statistical analysis in ecological research. The methods have been used all over the world. Currently, he is leading an internationally unique study in which the diversity of nature is mapped at the same time in over 450 locations all over the world. The research received 12 million euros of ERC funding. The research project will end in March 2026.
- During the research project, we have collected a hundred years of audio samples, millions of camera trap photos, and billions of DNA sequences. We can use these data to estimate the number of still unknown species and their geographical distribution. We will continue to study the material and will develop even more accurate models also in the coming years, even though this research project is coming to an end, says Ovaskainen.
Further information: