Printer friendly version
The Norway spruce genome sequenced
21 May 2013
Swedish scientists have mapped the gene sequence of Norway spruce (the Christmas tree) – a species with huge economic and ecological importance - and that is the largest genome to have ever been mapped. The genome is complex and seven times larger than that of humans. The results have been published in the prestigious journal Nature.
This major research project has been led by Umeå Plant Science Centre (UPSC) in Umeå and the Science for Life Laboratory (SciLifeLab) in Stockholm.
In addition to its scientific interest this new knowledge has immense importance to the forestry industry in many countries.
“Forest tree breeding is now entering a new era, and Sweden has the potential to be in the forefront of development,” says Professor Ove Nilsson from UPSC. “Newer and more effective methods can begin to be used to ensure that the over 200 million tree seedlings planted each year in Sweden are as strong, healthy and well-adapted as possible for both poor and rich soil areas in different parts of the country.”
The scientists have identified about 29,000 functional genes, marginally more than humans have, but the question arises: why is the spruce genome still seven times larger than ours? According to the study an explanation is “genome obesity” caused by extensive repetitive DNA sequences, which have accumulated for several hundred million years of evolutionary history. Other plant and animal species have efficient mechanisms to eliminate such repetitive DNA, but these do not seem to operate so well in conifers.
“It is remarkable that the spruce is doing so well despite this unnecessary genetic load,” says Professor Pär Ingvarsson at UPSC. “Of course, some of this DNA has a function but it seems strange that it would be beneficial to have so very much. This appears to be something special for conifers.”
The greatest challenge in the project has been to get the approximately 20 billion "letters" found in spruce's genetic code into the correct order, rather than obtaining the actual DNA sequences.
“Imagine a library with ten thousand books as thick as the bible, written in a language with only four letters,” explains Professor Stefan Jansson at UPSC. “If someone took one hundred identical copies of each of the ten thousand titles, passed them all through a document shredder and mixed all the shreds, and you then were asked to piece together an accurate copy of each title, you can realize that it can be a bit problematic.”
“We had to customise computers and rewrite many of the computer programmes used in similar studies in order to handle the large amount of DNA sequences,” says Professor Joakim Lundeberg from SciLifeLab. The national data storage system was stretched to the limit, and there were many other practical problems that had to be solved along the way to pull through the project.
“But the timing was optimal; when the new DNA sequencing machines were unpacked at our newly established laboratory, DNA arrived from our model spruce tree. By sequencing and analysing the largest genome in the world so far, we have shown that SciLifeLab has both technical and scientific capacity for research at the highest international level,” concludes Joakim Lundeberg.
About the project:
The project was funded by the Knut and Alice Wallenberg Foundation with the Swedish University of Agricultural Sciences as the main applicant. Scientists from UPSC (Umeå University and Swedish University of Agricultural Sciences) and Science for Life Laboratory (Royal Institute of Technology, Stockholm University and Karolinska Institutet) have participated in the project. Several international leading scientists have also been involved as collaborative partners. The entire project has had a budget of 75 million SEK , in comparison with mapping the human genome which was a significantly higher; 20 billion SEK .
From left: Ove Nilsson, Pär Ingvarsson and Stefan Jansson
Björn Nystedt and Joakim Lundberg
The world's largest genome project – spruce genome sequenced