Research Article |
Corresponding author: Arne Jacobs ( arne.jacobs@glasgow.ac.uk ) Academic editor: Zarah Pattison
© 2024 Carolin Dahms, Samuel Roch, Kathryn R. Elmer, Albert Ros, Alexander Brinker, Arne Jacobs.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Dahms C, Roch S, Elmer KR, Ros A, Brinker A, Jacobs A (2024) Intra-lake origin and rapid expansion of invasive pelagic three-spined stickleback in Lake Constance. NeoBiota 92: 259-280. https://doi.org/10.3897/neobiota.92.117430
|
The rapid expansion of three-spined stickleback (Gasterosteus aculeatus Linnaeus, 1758 (Perciformes, Gasterosteida)) in the pelagic zone of Lake Constance, Central Europe, since 2012 contributed to stark ecosystem-wide effects, such as food-web shifts and declines in native biodiversity, including commercially important fish species. Yet, the origin of this invasive pelagic population remains unclear. Using RAD-sequencing of Lake Constance sticklebacks, we show that the pelagic Lake Constance population likely arose recently within the lake, potentially from the littoral population. We did not detect any substantial genome-wide genetic differentiation between individuals from different habitats, supporting a recent origin of the pelagic population and/or ongoing gene flow. This is further supported by minimal differences in meristic and morphometric traits. However, we also identified multiple outlier loci between littoral and pelagic individuals across the genome, potentially suggesting early signs of adaptation despite high connectivity. In this study, we provide an important example of rapid within-lake ecological diversification of an invasive species from standing genetic variation. Ultimately, our findings will have major implications for the management of invasive pelagic stickleback, as they indicate that the stickleback population has to be managed as a whole and that management efforts cannot only focus on the hyper-abundant pelagic population.
Genomics, invasion, pelagic, RAD-seq, three-spined stickleback
The introduction and establishment of non-native species into novel habitats pose a serious threat to endemic biodiversity, ecosystem and human health globally (
Lake Constance in Central Europe represents such a system; a large pre-alpine, oligotrophic lake, where the introduction of invasive three-spined stickleback (Gasterosteus aculeatus Linnaeus, 1758) has contributed to substantial ecosystem-wide changes (
Lake Constance three-spined sticklebacks provide an excellent opportunity to study the processes and ecosystem-wide effects of freshwater invasions and they have long been used as a model system for studying contemporary evolution across ecological niches (
This study aims to gain a better understanding of the evolutionary origin of the pelagic three-spined stickleback population in Lake Constance. Using Restriction-site Associated DNA (RAD) sequences, we investigated whether pelagic sticklebacks are genetically distinct from the littoral and tributary (here referred to as ‘inflow’) sticklebacks. Population genetic analyses were complemented by morphological analyses. Overall, this study provides important insights into the potential of an invasive species to rapidly colonise and/or expand into novel habitats, which will directly inform the management of invasive pelagic stickleback in Lake Constance and comparable waterbodies, such as the Eastern Baltic Sea.
Sampling took place in Upper Lake Constance, Germany, in spring and summer 2019 using three different methods: trawling in the pelagic zone, gillnetting in the littoral zone and electrofishing in three tributaries of Lake Constance (Fig.
Summary of individual sampling events and sampling success in Lake Constance. n = number of individuals sampled.
Sampling site | Sampling date | Sampling method | Habitat | n total | n female | n male |
---|---|---|---|---|---|---|
Pelagic zone | 26.03.2019 | Trawling | Pelagic zone | 32 | 13 | 20 |
Littoral zone | 04.06.2019/ 05.07.2019 | Gillnets | Littoral zone | 32 | 19 | 13 |
Alter Rhein | 16.05.2019 | Electrofishing | Inflow | 11 | 7 | 4 |
Nonnenbach | 08.04.2019/ 17.04.2019 | Electrofishing | Inflow | 10 | 6 | 4 |
Brunnisach | 10.04.2019/ 17.04.2019 | Electrofishing | Inflow | 10 | 6 | 4 |
Overview map showing the position of the individual sampling locations in Lake Constance and locations of outgroup populations (insert) used in the genetic analysis. Population information and geographic coordinates for all samples are provided in Suppl. material
To determine the number of lateral plates, formalin-fixed sticklebacks were stained with alizarin red according to a protocol modified from
Determination of morphometric and meristic traits in sticklebacks and results of the principal component analysis (PCA) using landmarks A (Top) Stickleback with stained bony structures (for more details, please refer to the text), allowing the determination of the lateral plate number. (Bottom) Position of 18 unique reference points (“landmarks”) on the body for shape analysis. (Right) Description of the individual locations of the landmarks B combined boxplot and violin plot to illustrate variation in size (left) and lateral plate number (right) of sticklebacks from different sampling sites. Lower case letters indicate statistically significant differences (size: ANOVA + Tukey-Kramer HSD post hoc test, lateral plate number: pairwise Steel-Dwass test). Box plots defined in the insert panel on the right C scatterplot showing the first two principal components (PCs), which explain most of the variance of the data (see axis labelling in percent). Sticklebacks were grouped according to sampling site (lit_zone = littoral zone, pel_zone = pelagic zone, Bru_inflow = Brunnisach inflow, Non_inflow = Nonnenbach inflow, Alt_inflow = Alter Rhein inflow) and sex (m = male, f = female). Wireframe graphs of the shape changes along the first two PCs in the PCA are shown on the right.
For the morphometric analysis, 18 unique reference points (‘landmarks’) were placed on digital images using TPSDig v.2.31 (Rohlf, Stony Brook University, New York, USA) (Fig.
Due to problems with storage of caudal fin tissue samples, brain tissue of the fish stored at -80 °C was dissected and used for DNA extraction. DNA for RAD sequencing was extracted from frozen brain tissue using the PureLink Genomic DNA Mini Kit (Invitrogen, Carlsbad, USA) according to the manufacturer’s instructions. Tissue was homogenised using 2.4 mm diameter metal beads (3 s, max. speed; Bead Ruptor 4, OMNI international, Kennesaw, USA), the homogenate digested for three hours at 55 °C (150 RPM) and RNA was removed using a 1h RNAse treatment. DNA was eluted in 50 μl elution buffer (NanoDrop 2000c, Thermo Fisher Scientific, Waltham, USA), sample concentrations checked using fluorometry (Qubit 3, Life Technologies, USA) and DNA quality determined using agarose gel electrophoresis. Non-degraded samples were shipped to Floragenexs (Eugene, Oregon, USA) for RAD library preparation and sequencing. Restriction-site Associated DNA Sequencing (RAD-seq) libraries were generated as detailed in
FASTQ data files were demultiplexed using GBSX v.1.3 (
Geographic structure across all samples was explored using PCA on filtered SNP data (n = 11,184 unlinked SNPs) in ade4 v.1.7-16 (
Genetic differentiation between sampling sites was investigated using haplotype-based relative allelic differentiation (FST') and absolute divergence (Dxy') between population pairs (populations module in Stacks) for all loci containing filtered SNPs. Haplotype-based estimates have the advantage of accommodating loci with more than two alleles contrary to SNP-based statistics (
To identify genomic regions potentially under selection between littoral and pelagic sticklebacks, z-transformed FST' and Dxy' estimates for the pelagic-littoral population comparison were computed. Loci with z-transformed FST' ≥ 3 were classified as differentiated outlier loci. Loci with increased absolute divergence, zDxy' ≥ 3, potentially highlight the differential sorting of ancient alleles between habitats. A z-transformed value ≥ 3 approximately corresponds to a p-value below 0.01. Outlier loci were further tested for signs of selection by comparing their interpopulation gene diversity differences (∆Hs = Hslittoral – Hspelagic) to the genome-wide background. The expectation was that ∆Hs would be higher in loci under selection in the pelagic populations compared to the genomic background, driven by reduced Hs in the pelagic population. We compared median ∆Hs values for outliers (∆Hsoutlier) to the genomic background (∆Hsbg) using a non-parametric two-sided Wilcoxon rank sum test and further compared the distributions of values using a Kolmogorov-Smirnov test. Genetic differentiation amongst the remaining populations and outlier loci overlaps amongst all pairwise population comparisons were also estimated.
Furthermore, we performed phylogenetic analyses of Lake Constance together with whole-genome data from outgroup populations from across Europe to confirm that Lake Constance stickleback cluster with the Baltic lineage (see Suppl. material
Genetic association mapping using Genome-wide Efficient Mixed Model Association (GEMMA v.2.1;
We tested if phenotype-associated SNPs were also significant outlier loci or showed increased genetic differentiation between littoral and pelagic sticklebacks, which would suggest potential selection acting on these phenotypes. To test if phenotype-associated SNPs showed increased genetic differentiation and divergence compared to a random genomic background, we performed random resampling of the same number of SNPs from the entire SNP dataset, estimated the mean FST' and Dxy' for the corresponding haplotype and repeated this 10,000 times to create a null distribution. Subsequently, we compared the means FST' and Dxy' of the phenotype-associated SNPs and the null distribution using a Wilcoxon test. We did this for each phenotype and for the sex-chromosome and autosomes separately.
Total length of sticklebacks differed between sampling sites (ANOVA: F4,90 = 23.1534, p < 0.001; Fig.
The morphometric analysis, based on landmarks, showed a significant effect of size on body shape (ANOVA: p < 0.001; Suppl. material
Results of the pairwise comparison of the shape of sticklebacks from different sampling sites (littoral = littoral zone, pelagic = pelagic zone, Inflow = Nonnenbach, Brunnisach, Alten Rhein). Upper triangle: pairwise procrustes distances between means. Lower triangle: pairwise p-values between means.
Alten Rhein | Brunnisach | Littoral | Nonnenbach | Pelagic | |
---|---|---|---|---|---|
Alten Rhein | 0.02370511 | 0.03856235 | 0.01409519 | 0.01461575 | |
Brunnisach | 1.000 | 0.04817884 | 0.02631996 | 0.02773204 | |
Littoral | 0.001* | 0.005* | 0.04335706 | 0.03863615 | |
Nonnenbach | 1.000 | 1.000 | 0.001* | 0.01534973 | |
Pelagic | 1.000 | 1.000 | 0.001* | 1.000 |
The admixture analysis suggested that sticklebacks in Lake Constance were highly admixed, as the genetic structure was best explained by 1 cluster (K = 1; Suppl. material
Population structure. Principal Component Analysis representing individual structuring across Lake Constance populations. Analysis was performed on pruned data excluding the sex chromosome (11,184 SNPs). Colours indicate different sampling sites, while shapes represent habitats – inflow (circle), littoral (square), pelagic (triangle). Smaller, lighter data points show individual variation, while the larger shapes with a black centre indicate population Principal Component centroids, which were calculated as the mean of both the 1st and 2nd axes.
Furthermore, phylogenetic analyses showed that stickleback from Lake Constance clustered with the Baltic Lineage (Eastern European) stickleback (Suppl. material
We detected 333 loci with zFST' ≥ 3, distributed across the entire genome (Fig.
Signatures of selection A Z-transformed haplotype-based FST' (zFst') estimates for loci (dots) across all chromosomes (noted on the x-axis). Outlier loci with zFST' ≥ 3 are shown in orange B absolute divergence (Dxy') between outlier loci (orange) and the genomic background on autosomes and the sex chromosome. Individual dots denote genomic loci and the distribution of values is shown by density plots. The sex chromosome was analysed separately due to lower recombination rates compared to autosomes and, therefore, potentially higher absolute divergence C comparison of delta gene diversity (∆Hs) between the genomic background (zFst' < 3; grey), outlier loci (orange) and outlier loci showing increased absolute divergence (zDxy' ≥ 3; red). Delta gene diversity was estimated by subtracting gene diversity in pelagic individuals from gene diversity in littoral individuals. Positive ∆Hs values are indicative of reduced gene diversity in littoral individuals and vice versa. Individual dots denote genomic loci. Box plots defined in Fig.
We further tested for signals of divergent selection by comparing gene diversity (Suppl. material
Genome-wide association analyses for lateral plate number identified 41 associated SNPs (mean PIP > 0.01), with 7 SNPs (17.1%) showing very strong associations (mean PIP > 0.1). These were mainly located on chromosome 4, with one strongly-associated SNP on chromosome 2 (Fig.
The proportion of variance explained by all loci was similar for lateral plate number (PVEPN = 88.4%) and body shape (PVEBS = 85.3%), but the proportion of PVE explained by sparse effect loci (PGEBS = 61.5%, PGEPN = 77.8%) and the estimated number of sparse effect loci (mean n gammaBS = 28; mean n gammaPN = 6) were smaller for body shape compared to lateral plate number (Fig.
Genome-wide association analyses (GWAS) for phenotypic traits A posterior Inclusion Probabilities (PIP) from BSLMMs for all SNPs (dots) across the genome are shown, with outliers SNPs passing significance threshold (PIP > 0.01) shown in red. Manhattan plots are shown for GWAS results with mean lateral plate number, body shape PC1-6 and total length B hyperparameters from BLSMMs are plotted, as the mean (large dot) and 95% confidence intervals (grey lines), for body shape (BS: yellow) and lateral plate number (PN: blue) C the distribution of genetic differentiation (FST‘) values for the permuted null distribution is shown as a histogram and mean differentiation for phenotype-associated loci is indicated as a red line. Results are shown for SNPs associated with later plate number (blue) and body shape (yellow), for autosomes and the sex chromosome separately. No trait-associated SNPs were detected for later plate number on the sex chromosome.
Body shape-associated loci were not strongly differentiated (i.e. outlier loci), but autosomal loci associated with body shape showed increased genetic differentiation between littoral and pelagic sticklebacks compared to the genomic background (Fig.
The aim of this study was to investigate the evolutionary origin of the pelagic three-spined stickleback population in Lake Constance. We found that pelagic sticklebacks in Lake Constance likely originated within the Lake from the already established littoral population without any recent colonisation and/or introgression from external populations. Despite the absence of genome-wide divergence amongst lake habitats, some regions across the genome show increased genetic differentiation. We found that body shape-associated loci, a trait divergent between littoral and pelagic stickleback in Lake Constance, show increased genetic differentiation between littoral and pelagic individuals. Overall, this suggests that the phenotypic difference in pelagic stickleback and its dramatic demographic expansion is best explained by the colonisation of the pelagic zone by stickleback from other lake habitats and the sorting of adaptive standing genetic diversity present within Lake Constance, rather than by recent colonisation. Thus, effective management strategies must focus on the entire stickleback population rather than only on the pelagic population.
Sticklebacks exhibit a high degree of phenotypic diversity between habitats, with some morphological traits having evolved in parallel during their postglacial dispersal into new freshwater habitats (
Littoral and pelagic sticklebacks differed slightly in snout length and body depth, with longer snouts and deeper bodies in littoral fish (
Observed size differences between sticklebacks from the pelagic and littoral zone are likely related to differences in sampling time rather than growth rate, with fish from the pelagic zone having been captured in late March, while fish from the littoral zone were captured in June and July. It can, therefore, be assumed that this is a time-dependent increase in size over the course of the year. Future common garden experiments, temporal sampling throughout the year and more detailed phenotypic and trophic analyses could shed light on the eco-morphological basis of the rapid pelagic invasion of Lake Constance stickleback.
To date, it has been unclear whether pelagic stickleback in Lake Constance, which have increased rapidly in abundance since 2012 (
Stickleback from pelagic and littoral habitats were sampled during slightly different times in our study, potentially biasing estimates of genome-wide genetic differentiation between habitats. However, differences in sampling time would likely result in even lower estimates of genetic differentiation (compared to the ‘true’ differentiation), if pelagic sticklebacks, which were sampled in the spring, move into the littoral zone to spawn in the summer and are caught together with littoral sticklebacks. Thus, we believe that, overall, there is no strong genome-wide differentiation between habitats, in line with a recent expansion under ongoing gene flow.
The phylogenetic clustering of sticklebacks from Lake Constance as sister to Baltic stickleback from northern Germany (Suppl. material
Despite the likely recent colonisation of the pelagic zone and minimal genome-wide differentiation between habitats, or lack thereof, we detected a polygenic signal of divergence with hundreds of outlier SNPs across the genome showing increased genetic differentiation between individuals from littoral and pelagic habitats. Such polygenic patterns of divergence between benthic and limnetic sticklebacks were also observed in Canadian populations (
Genetic differentiation can occur without divergent selection, for example, through linked selection in low recombination regions or genetic drift due to population bottlenecks, yet these are unlikely explanations in this system. Firstly, linked selection is less likely to lead to increased differentiation over such short evolutionary timescales (
Overall, the fact that we observed genetic differentiation of many loci across the genome, despite low levels of genome-wide differentiation, indicates that habitat preferences might be, at least partially, genetically determined and not purely plastic, although a plastic component cannot be excluded.
Increased differentiation of body shape-associated autosomal loci between pelagic and littoral sticklebacks suggests that body shape, a trait that seems to differ between populations in these habitats, is under divergent selection between habitats. The observed signal is likely not due to chance, as loci associated with lateral plate number, a trait that does not differ between pelagic and littoral sticklebacks, do not show increased differentiation. We also recovered a well-studied lateral plate number associated genomic region on chromosome 4, further suggesting that we had sufficient power to detect large-effect loci. Hence, the genetic differentiation of body shape-associated loci suggests that the observed divergence between littoral and pelagic stickleback is not purely due to phenotypic plasticity, but is at least partly genetically determined. Variation in morphology could ultimately lead to assortative mating and divergence into distinct ecotypes over time (
We did not test for genotype-association with variation in body size in our dataset, as individuals were sampled at slightly different time-points throughout the year. Whilst body size divergence between Lake Constance and stream sticklebacks has been demonstrated to be plastic and driven by differences in food availability (
Our results suggest that pelagic three-spined stickleback in Lake Constance, which already have had ecosystem-wide effects on biodiversity and food-web integrity, likely arose within Lake Constance. Divergence in body shape between littoral and pelagic habitats and potentially other relevant ecological and physiological traits, is potentially reflective of divergent polygenic selection on trait-associated genes.
The limited SNP-density across the genome precludes us from determining the genomic targets of selection and phenotype-associated loci. Furthermore, temporal sampling of stickleback throughout the year will be needed to determine if there are seasonal differences in genetic and phenotypic patterns. Lastly, common garden experiments and temporal sampling in the wild could help to better understand the roles of evolutionary change versus plasticity in the rapid invasion of the pelagic zone and identify putatively adaptive phenotypic traits.
A better understanding of the processes facilitating the rapid invasion of the pelagic zone of Lake Constance could aid management of this population and in other systems with rapid pelagic invasions, such as the Baltic Sea. Our results suggest that the observed pelagic colonisation was potentially facilitated by large standing genetic variation and the sorting of potentially adaptive alleles between habitats. The lack of genome-wide differentiation and large amount of standing genetic variation suggest that the entire stickleback population and not only the pelagic sub-population, is potentially capable of colonising the pelagic zone and re-invasions of the pelagic zone from other habitats are a possibility if the pelagic population is removed through control measures. Hence, the entire stickleback population in Lake Constance should be managed as a whole, rather than focusing efforts on the pelagic sub-population.
We thank Sarah Gugele, Jan Baer and Andreas Revermann, who planned and carried out the sampling. Sarah Gugele performed the initial phenotyping. Thanks to the SeeWandel L13 team, especially Cameron Hudson, for funding the RAD-seq, providing organisational support, contributing to lab work and shipping the samples. Thanks to Jonas Reiter for the assistance in staining the fish and conducting the meristic counts. Sam Fenton and Darren Hunter are thanked for helping with coding aspects of the analyses. Thank you to Jasminca Behrmann-Godel without whom this collaboration would not have taken place.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This study was supported by the grant “SeeWandel: Life in Lake Constance - the past, present and future” within the framework of the Interreg V programme “Alpenrhein-Bodensee-Hochrhein (Germany/Austria/Switzerland/Liechtenstein)” whose funds are provided by the European Regional Development Fund as well as the Swiss Confederation and cantons. Arne Jacobs was supported by a Leverhulme Trust Early Career Fellowship (ECF-2020-509) and a University of Glasgow Lord Kelvin/Adam Smith Leadership Fellowship. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
A.B. conceived the project. Sampling was performed by A.R. and phenotypic analysis was performed by S.R. Genomic analysis was performed by C.D and A.J, with input from K.R.E. C.D. and S.R. wrote the first version of the manuscript with input from A.J. All authors contributed to the final version.
Carolin Dahms https://orcid.org/0000-0002-3283-7820
Samuel Roch https://orcid.org/0000-0003-3441-9830
Kathryn R. Elmer https://orcid.org/0000-0002-9219-7001
Albert Ros https://orcid.org/0000-0003-3241-9722
Alexander Brinker https://orcid.org/0000-0002-2433-56520
Arne Jacobs https://orcid.org/0000-0001-7635-5447
All newly-generated sequence data for this study are available in the NCBI SRA under the BioProject PRJNA1090479 with the following run accessions: SRR28409948–SRR28410042.
Supplementary methods and results
Data type: docx
Sample information
Data type: xlsx