Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish

Proof of concept for key components of the insulin-like growth factor axis

Fiona M Lappin, Rebecca L Shaw, Daniel J Macqueen

Research output: Contribution to journalArticle

9 Citations (Scopus)
3 Downloads (Pure)

Abstract

High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any member of the salmonid family, which should enable insights into the evolutionary role of whole genome duplication before additional nuclear genome sequences become available.

Original languageEnglish
Pages (from-to)15–26
Number of pages12
JournalMarine Genomics
Volume30
Early online date23 Jun 2016
DOIs
Publication statusPublished - 1 Dec 2016

Fingerprint

somatomedins
salmonid
Somatomedins
Fishes
genome
Genome
fish
insulin-like growth factor binding proteins
Insulin-Like Growth Factor Binding Proteins
phylogeny
vertebrates
Vertebrates
protein
vertebrate
Salmonidae
Public Opinion
transcriptome
phylogenetics
baits
nuclear genome

Keywords

  • sequence capture
  • target enrichment
  • second-generation sequencing
  • whole genome duplication
  • salmonid fish
  • insulin-like growth factor axis

Cite this

Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish : Proof of concept for key components of the insulin-like growth factor axis. / Lappin, Fiona M; Shaw, Rebecca L; Macqueen, Daniel J.

In: Marine Genomics, Vol. 30, 01.12.2016, p. 15–26.

Research output: Contribution to journalArticle

@article{45f5bc9e06ca4085a9998a26cbe74abe,
title = "Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish: Proof of concept for key components of the insulin-like growth factor axis",
abstract = "High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any member of the salmonid family, which should enable insights into the evolutionary role of whole genome duplication before additional nuclear genome sequences become available.",
keywords = "sequence capture , target enrichment, second-generation sequencing, whole genome duplication, salmonid fish, insulin-like growth factor axis",
author = "Lappin, {Fiona M} and Shaw, {Rebecca L} and Macqueen, {Daniel J}",
note = "Acknowledgements This study was funded by a Natural Environment Research Council grant (NERC, project code: NBAF704). FML is funded by a NERC Doctoral Training Grant (Project Reference: NE/L50175X/1). RLS was an undergraduate student at the University of Aberdeen and benefitted from financial support from the School of Biological Sciences. DJM is indebted to Dr. Steven Weiss (University of Graz, Austria), Dr. Takashi Yada (National Research Institute of Fisheries Science, Japan), Dr. Robert Devlin (Fisheries and Oceans Canada, Canada), Prof. Samuel Martin (University of Aberdeen, UK), Mr. Neil Lincoln (Environment Agency, UK) and Prof. Colin Adams/Mr. Stuart Wilson (University of Glasgow, UK) for providing salmonid material or assisting with its sampling. We are grateful to staff at the Centre for Genomics Research (University of Liverpool, UK) (i.e. NERC Biomolecular Analysis Facility – Liverpool; NBAF-Liverpool) for performing sequence capture/Illumina sequencing and providing us with details on associated methods that were incorporated into the manuscript. Finally, we are grateful to the organizers of the Society of Experimental Biology Satellite meeting 'Genome-powered perspectives in integrative physiology and evolutionary biology' (held in Prague, July 2015) for inviting us to contribute to this special edition of Marine Genomics and hosting a really stimulating meeting.",
year = "2016",
month = "12",
day = "1",
doi = "10.1016/j.margen.2016.06.003",
language = "English",
volume = "30",
pages = "15–26",
journal = "Marine Genomics",
issn = "1874-7787",
publisher = "Elsevier",

}

TY - JOUR

T1 - Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish

T2 - Proof of concept for key components of the insulin-like growth factor axis

AU - Lappin, Fiona M

AU - Shaw, Rebecca L

AU - Macqueen, Daniel J

N1 - Acknowledgements This study was funded by a Natural Environment Research Council grant (NERC, project code: NBAF704). FML is funded by a NERC Doctoral Training Grant (Project Reference: NE/L50175X/1). RLS was an undergraduate student at the University of Aberdeen and benefitted from financial support from the School of Biological Sciences. DJM is indebted to Dr. Steven Weiss (University of Graz, Austria), Dr. Takashi Yada (National Research Institute of Fisheries Science, Japan), Dr. Robert Devlin (Fisheries and Oceans Canada, Canada), Prof. Samuel Martin (University of Aberdeen, UK), Mr. Neil Lincoln (Environment Agency, UK) and Prof. Colin Adams/Mr. Stuart Wilson (University of Glasgow, UK) for providing salmonid material or assisting with its sampling. We are grateful to staff at the Centre for Genomics Research (University of Liverpool, UK) (i.e. NERC Biomolecular Analysis Facility – Liverpool; NBAF-Liverpool) for performing sequence capture/Illumina sequencing and providing us with details on associated methods that were incorporated into the manuscript. Finally, we are grateful to the organizers of the Society of Experimental Biology Satellite meeting 'Genome-powered perspectives in integrative physiology and evolutionary biology' (held in Prague, July 2015) for inviting us to contribute to this special edition of Marine Genomics and hosting a really stimulating meeting.

PY - 2016/12/1

Y1 - 2016/12/1

N2 - High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any member of the salmonid family, which should enable insights into the evolutionary role of whole genome duplication before additional nuclear genome sequences become available.

AB - High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any member of the salmonid family, which should enable insights into the evolutionary role of whole genome duplication before additional nuclear genome sequences become available.

KW - sequence capture

KW - target enrichment

KW - second-generation sequencing

KW - whole genome duplication

KW - salmonid fish

KW - insulin-like growth factor axis

U2 - 10.1016/j.margen.2016.06.003

DO - 10.1016/j.margen.2016.06.003

M3 - Article

VL - 30

SP - 15

EP - 26

JO - Marine Genomics

JF - Marine Genomics

SN - 1874-7787

ER -