A stream of surprises from the Atlantic cod genome

Posted: 7 March 2017 | University of Oslo, Faculty of Mathematics and Natural Sciences | No comments yet

Researchers at the University of Oslo (UiO) keep discovering surprises in the Atlantic cod genome…

Researchers at the University of Oslo (UiO) keep discovering surprises in the Atlantic cod genome. The most recent study has revealed an unusual amount of short and identical DNA sequences, which might give cod an evolutionary advantage.

Close to ten years ago, researchers from the Norwegian Institute for Water Research (NIVA) caught “Calvin the Cod” and hauled him out of the cold Arctic waters during an oceanographic expedition to the Barents Sea and the northern coast of Norway and the Lofoten archipelago.

From Lofoten, Calvin’s journey went to NIVA’s research station close to Norway’s capital Oslo. The story could have ended there, but Calvin’s destiny took a sudden twist when researchers from the University of Oslo found him swimming in a tank, killed him with a blow to the head, and took samples from his body home to their big freezer at the Department of Biosciences.

An ordinary cod would have been eaten after being placed in the freezer, but Calvin the Cod instead started a new career. Calvin was in fact a healthy and characteristic representative of the population of skrei, which is the Norwegian term for cod that migrate between feeding grounds in the Barents Sea and spawning areas along the Norwegian coast. Thus, Calvin was chosen for the honourable task of donating his body parts and genes to science.

In 2008, researchers at the University of Oslo initiated a unique project: They wanted to map the genome of a fish of great economic importance, namely Atlantic cod. This project has later become a huge success, and the cod genome researchers have delivered a stream of surprises, based on their studies of Calvin’s genome.

Just to name the two most important discoveries: It made a huge impact when they discovered the strange and unique immune system of cod in 2011, and in 2016 they found a sex gene that can make fish farming more profitable.

New discovery in the cod genome

PhD candidate Ole Kristian Tørresen and senior engineer Lex Nederbragt at the Institute of Biosciences and Centre for Ecological and Evolutionary Synthesis (CEES) have now conducted a more detailed analysis than has been possible ever before, and once again the cod genome has come up with a surprise.

“The new achievement this time is that we have combined data from three different techniques for DNA sequencing. Thus, we managed to map the cod genome in much more detail than what was previously possible.

At the same time, we found the reason why it has been so difficult to map the genome in detail earlier. The reason is that this genome contains an extraordinary amount of so-called short tandem repeats, meaning that short sequences of DNA base molecules are repeated many times in succession”, recounts Ole Kristian Tørresen to Titan.uio.no.

The basis for this observation is that the genomes of all organisms are written in an “alphabet” that consists of only four nucleobase molecules: adenine (A), thymine (T), guanine (G) and cytosine (C).

93 percent of the genome is mapped

“We are talking about short tandem repeats when we find for example the combination “AC” several times in succession in the DNA sequences. But also when the DNA sequence reads for example only “A” several times in succession, or perhaps “CGA”, it is classified as a tandem repeat.

The bottom line is that it has been very difficult to understand what the contiguous DNA sequences really look like, when these tandem repeats appear to make up an enormous amount of exactly similar pieces in an enormous jigsaw puzzle”, explains Lex Nederbragt.

The Atlantic cod genome consists of approximately 700 million pairs of DNA bases (remember that the DNA molecule is a double helix with matching base pairs on each strand). With the recent study, the researchers have now surveyed a total of 93 per cent of the genome and managed to assign the sequences to the cod’s 23 chromosomes. Thus, they have managed to fill in many of the “holes” that remained after earlier surveys.

DNA mapping as a huge jigsaw puzzle

With the best technology available today, it is possible to map contiguous DNA sequences with a length of up to 10,000 base pairs. But that is a long stretch away from a complete cod chromosome, which contains approximately 25 million base pairs. The researchers must therefore divide the DNA strands into pieces that are then read separately.

DNA sequencing and genome mapping can thus be compared to dividing a very long text into lots of small pieces that are read separately – letter by letter, or more exactly: nucleobase per nucleobase. The next step is to create a digitised copy of the whole text, but without knowing exactly where each piece came from.

The result is that researchers are left with a large amount of fragments that they must try to put together in the digitised copy – much like a jigsaw puzzle with an extreme number of pieces. Lex Nederbragt explains that the jigsaw puzzle has an added problem because of the tandem repeats that look exactly the same, similar to a “normal” puzzle with a lot of blue skies. But even if the pieces look exactly the same, the researchers must find out exactly where they came from.

Combining three methods

Tørresen, Nederbragt and their collaborators have solved this problem by using three different sequencing technologies in parallel, and then combining their results. Moreover, they have re-used large amounts of data from previous genomic studies and analysed them again with new and better methods.

“The oldest method, called 454 sequencing(link is external), can identify fragments of up to 700 base pairs. But this method falls short if the fragments contain several similar base molecules after another. If the fragment for example contains the sequence AAAAA, the method is unable to always accurately determine how many A’s the sequence consists of”, explains Tørresen.

“We have also used a newer method called Illumina sequencing. This method can only generate fragments with a length of up to 100 base pairs, but the mapping is more accurate than with the 454 method. In addition, we supplemented with a third method called PacBio sequencing(link is external), which at the time of our experiments could identify DNA sequences with up to 2000-3000 base pairs. This helped us to see the big picture, even if the accuracy is a good deal poorer than with Illumina sequencing”, explains Nederbragt.

“The combination of three different methods allowed for the more accurate results in our study. It is perhaps slightly annoying that the technologies have evolved in the short period since the start of our project, so that we could have had even better results if we had started today. But we can’t just sit around doing nothing while we are waiting for the technology to develop further”, Nederbragt comments.

What is the significance of the copies?

The scientists would of course like to know what the large amount of short tandem repeats means for cod as a species.

“Our suggestion is that the phenomenon has an evolutionary significance, because many of the tandem repeats are found inside DNA sequences that encode the structure of proteins. They also have a tendency to vary in length between generations.

This might mean that the repeated sequences can give rise to many different varieties of the same proteins. We imagine that such a variety of proteins can make it easier for cod as a species to adapt to a new environment, but we can’t say anything definite about this yet”, comments Tørresen.

However, it is definitely possible to determine such things. Now that the scientists have mapped the cod genome in great detail, they can start identifying the genes that contain the code for specific proteins. Researchers at the Institute of Biosciences are just beginning their investigations into this area.

“We have already found a fish species that has even more tandem repeats than cod, namely the related haddock. Both cod (Latin name: Gadus morhua) and haddock (Melanogrammus aeglefinus) are members of the cod family (Gadidae). This may indicate that the whole group has an increased proportion of such repetitions”, adds Nederbragt.

Ole Kristian Tørresen and Lex Nederbragt emphasise that every study of the cod genome so far have been performed on samples from the same individual, namely Calvin. But in the newly established Aqua Genome Project, researchers at the University of Oslo and the Norwegian University of Life Sciences(link is external) (NMBU) are going to study the genomes of large numbers of cod and salmon.

They can for example examine how different populations – such as skrei, Norwegian coastal cod and Baltic Sea cod – compare. NMBU researchers are concentrating their efforts on salmon, while researchers at the University of Oslo are continuing their in-depth studies of the cod genome.

A never ending story

“This new version of the cod genome is a huge improvement and will have implications for the future fisheries management, by being a reference for the sequencing of other stocks and individuals. At the same time, the improved genome comes with new potential for developing cod stocks that are more suitable for aquaculture in terms of growth, maturation and disease resistance”, says professor Kjetill S. Jakobsen at the Institute of Biosciences and CEES.

He has led studies of the Atlantic cod genome at the University of Oslo from the start.

“We are very proud of this huge step forward in the understanding of the cod genome. But the assembly of such large genomes is a “never ending story”, and there is still room for improvement. I can guarantee that there will be a third version sooner or later, and it will be even better than this one but still not perfect, adds senior adviser Sissel Jentoft at CEES.

The new study was conducted in collaboration between scientists at the University of Oslo, NMBU, the Institute of Marine Research in Bergen, Yale School of Medicine and J. Craig Venter Institute.

Related regions

Europe

Cookie	Description
cookielawinfo-checkbox-advertising-targeting	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Description
cf_ob_info	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	This cookie is set by Youtube and is used to track the views of embedded videos.

Cookie	Description
bcookie	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	This cookie is set by LinkedIn and used for routing.
lissc	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Description
advanced_ads_browser_width	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Recommended

A stream of surprises from the Atlantic cod genome

New discovery in the cod genome

93 percent of the genome is mapped

DNA mapping as a huge jigsaw puzzle

Combining three methods

What is the significance of the copies?

A never ending story

Related topics

Related regions

Leave a Reply Cancel reply

Recommended

A stream of surprises from the Atlantic cod genome

New discovery in the cod genome

93 percent of the genome is mapped

DNA mapping as a huge jigsaw puzzle

Combining three methods

What is the significance of the copies?

A never ending story

Related topics

Related regions

Navigating the future of food safety and transparency

Müller and Dairy UK join forces to secure a sustainable dairy future

Turning blooms into ingredients: how ultrasound is repurposing edible flowers

The evolving and enduring supply chain

Resilience by design: transforming food systems for tomorrow

Leave a Reply Cancel reply