Introduction
Atlantic salmon (
Salmo salar) are a culturally, ecologically, and economically important fish species that historically ranged from New York (USA) to Iceland in North America and from Portugal to northwestern Russia in Europe (
MacCrimmon and Gots 1979). Over the past several decades, however, their native range has exhibited a shrinking southern margin, with many of the southernmost populations in North America and Europe now either extirpated or in strong decline (
Parrish et al. 1998;
Lehnert et al. 2019). These declines have been attributed to myriad threats, including loss of habitat quality and quantity, warming winter temperatures, increasing marine mortality rates, and introduced pathogens and parasites (
Chaput 2012;
Forseth et al. 2017;
Lehnert et al. 2019). Furthermore, declines in population sizes have coincided with decreases in effective population sizes (
Ne) and genome-wide changes in patterns of adaptive variation (
Lehnert et al. 2019). These genetic losses may be exacerbated by introgression of maladaptive alleles from escaped farmed salmon or hatchery-reared fish into remaining wild populations (
Bourret et al. 2011;
Ozerov et al. 2016;
Karlsson et al. 2016;
Wringe et al. 2018). In contrast to these losses of genetic diversity, Atlantic salmon populations continue to maintain a wide variety of life history and migratory strategies, with numerous anadromous and landlocked populations distributed throughout their range (
Hutchings et al. 2019).
In North America, landlocked populations of salmon formed around 10 000 years ago, coinciding with the end of the Last Glacial Maximum and the gradual rise of land masses previously depressed by the weight of glaciers (
Yesner et al. 1984;
Lumme et al. 2016). In many instances, this uprising disrupted hydrological connections between freshwater lakes and the ocean. Affected lake populations of salmon became physically isolated from their anadromous progenitor populations and began migrating between freshwater lakes and rivers, rather than between marine and freshwater environments, to complete their life cycles. Adaptation to this new set of environments required shifts in a broad suite of functions, including osmoregulatory, immunological, behavioral, and metabolic traits, as has been described for other fish species with populations secondarily adapted to fresh water (
Deagle et al. 2013;
Kozak et al. 2014;
Willoughby et al. 2018). Despite these adaptations, Atlantic salmon populations are declining throughout their native range, with many landlocked and anadromous populations supported by hatchery supplementation (
Parrish et al. 1998;
Naish et al. 2007;
Lehnert et al. 2019).
Hatchery programs are often used to support imperiled fish populations — particularly for salmonids — either via the restoration of previously extirpated populations or by supplementing declining populations. Initiation of hatchery rearing can result in genetic bottlenecks, increasing the effects of genetic drift and decreasing heterozygosity and
Ne in subsequent hatchery-supplemented generations (
Ryman and Laikre 1991;
Christie et al. 2012). Overall, the effects of increased genetic drift are predicted to reduce genetic diversity throughout the genome due to the random sampling of small numbers of individuals for breeding. More targeted reductions in genetic diversity can result from new agents of selection imposed by hatchery rearing practices, often called domestication selection, affecting genes involved in metabolism, behavior, immunity, and wound healing (
Christie et al. 2016;
Naval-Sanchez et al. 2020;
Jin et al. 2020). Both genetic drift and domestication selection can reduce genetic variation at loci under selection in the natural environment and can diminish individual and population fitness after release into the wild (
Araki et al. 2007;
Milot et al. 2013;
Willoughby and Christie 2019).
To identify candidate loci associated with adaptation to an entirely freshwater environment, we examined genome-wide patterns of genetic variation for landlocked and anadromous populations of Atlantic salmon. Two landlocked populations, Lake Champlain and Sebago Lake, were compared against the anadromous LaHave River population (
Fig. 1), and we examined 23 326 SNPs from RNA-seq data to identify outlier loci associated with differences between landlocked and anadromous life history strategies. We used the same populations to identify candidate loci that have responded to differences in selection between distinct hatchery rearing environments. Individuals from the Sebago Lake and LaHave River populations were maintained in the same hatchery facility prior to sampling, whereas individuals from Lake Champlain were reared in a different hatchery. To determine whether distinct hatchery rearing environments may have led to detectable patterns of selection, we compared Lake Champlain to both Sebago Lake and LaHave River and identified outlier loci shared by the two comparisons. We detect putative signals of selection across the three populations with respect to both life history strategy and hatchery rearing environment, and we highlight strongly differentiated SNPs in genes with physiological functions relevant to freshwater adaptation and hatchery-imposed selection. Our results provide further support for the hypothesis that genetic adaptation to the hatchery environment can occur over brief timescales and demonstrate that associated genetic patterns can be shared among populations with disparate life history strategies. Over broader evolutionary timescales, we find that adaptation to an entirely freshwater life cycle has involved changes in genes related to immune, osmoregulatory, and metabolic functions, consistent with previous investigations of adaptive differences between freshwater and marine fish populations.
Discussion
Of the three populations compared in this study, we found the closest genetic relationship between Lake Champlain and Sebago Lake individuals. This result likely reflects the inclusion of Sebago Lake salmon in the Lake Champlain breeding program. In addition to receiving genetic input from Sebago Lake individuals, the Lake Champlain population has also been supplemented by other source populations (
Dimond and Smitka 2005). Although salmon originating from Sebago Lake were deemed most successful in Lake Champlain by management agencies, the high genetic diversity (i.e.,
HP) in Lake Champlain relative to both Sebago Lake and the LaHave River suggests that fish sourced from other populations may have also been reproductively successful in the Lake Champlain system and genetically contributed to the contemporary population. Alternatively, elevated
HP in Lake Champlain may be due to differences in breeding designs or variance in family sizes between Lake Champlain hatcheries and hatcheries that have propagated Sebago Lake and LaHave River individuals (i.e., the Normandale and Harwood Fish Culture Stations). Disentangling these hypotheses would require a more extensive genetic survey of hatchery broodstock through time and the identification of all sources for Lake Champlain reintroduction efforts.
The history of shared ancestry between Lake Champlain and Sebago Lake also likely explains the relatively low number of outlier SNPs shared between these two populations, but this trend may also be influenced by environmental similarities between the two populations. Both Lake Champlain and Sebago Lake Atlantic salmon are landlocked (i.e., no fish migrate to the ocean), and both lakes also contain prey species with high concentrations of thiaminase in their tissues (
Dimond and Smitka 2005). Thiaminase is an enzyme that degrades thiamine (vitamin B
1), and consumption of this enzyme has been linked to thiamine deficiency in salmonids (reviewed in
Harder et al. 2018). Relative to Sebago Lake salmon, LaHave River fish (with a natural diet lacking thiaminase) have been shown to experience greater decreases in liver tissue thiamine concentrations when fed an experimental diet containing thiaminase, suggesting that the Sebago Lake population may have genetically adapted to dietary thiaminase (
Houde et al. 2015). The genetic variation associated with this putative adaptation may also be advantageous in Lake Champlain, and evidence suggests that the genetic variation required for adaptation to low thiamine availability may already be present in the Lake Champlain population (
Harder et al. 2020).
Of the 37 outlier SNPs identified between Lake Champlain and Sebago Lake, 21 are also outliers in the Lake Champlain – LaHave River comparison. From 2006 until sampling in 2011, the Sebago Lake and LaHave River fish were reared in the same hatchery (
Houde 2015), and shared differences between these two populations and Lake Champlain may reflect parallel patterns of hatchery-imposed selection. Responses to selection in hatcheries can occur over short timescales (e.g., a single generation;
Christie et al. 2016) and may also vary among specific rearing strategies (
Berejikian et al. 2000;
Venney et al. 2021). We identified putative hatchery-associated SNPs annotated to genes with roles in transcription, protein folding, and intracellular signaling, among other functions. Two SNPs are annotated to genes important for blood platelet formation (transcription factor NF-E2 45 kDA subunit-like; LOC106572584) and epithelial cell adhesion (chloride intracellular channel 4;
CLIC4) (
Shivdasani et al. 1995;
Argenzio et al. 2014). Specifically,
CLIC4 knockdown and suppression experiments have demonstrated its critical roles in skin and corneal wound healing (
Padmakumar et al. 2012), and expression of
NF-E2 is required for platelet production by megakaryocytes (
Shivdasani et al. 1995). Differentiation at these genes is consistent with previous findings that the earliest stages of domestication in fishes can lead to new patterns of gene expression in wound healing pathways (
Christie et al. 2016;
Konstantinidis et al. 2020).
We used a second set of comparisons to identify loci that may be associated with life history by comparing the landlocked Lake Champlain and Sebago Lake populations with the anadromous LaHave River population. For putative life history-associated SNPs located in regulatory regions, 77 SNPs were annotated to 89 unique gene sequences (Supplementary Table S5
1). Of these genes, one was previously identified as differentially expressed in response to supplemental thiamine treatment and with respect to family-level survival in the face of thiamine deficiency in Lake Champlain Atlantic salmon by
Harder et al. (2020) (Supplementary Fig. S4
1). This gene, suppressor of cytokine signaling 2-like (LOC106576177;
SOCS2), is a negative regulator of growth hormone signaling and may also be important in adaptation to salinity (
Greenhalgh et al. 2005;
Komoroske et al. 2016;
Dalongeville 2018). In anadromous Arctic char (
Salvelinus alpinus),
SOCS2 is strongly upregulated in response to fasting, indicating a role for this gene in maintenance of energy homeostasis in another migratory fish species (
Jørgensen et al. 2013). The expression patterns of
SOCS2 associated with thiamine deficiency in Lake Champlain coupled with strong differences in allele frequencies between the two lake populations and the LaHave River suggest that differences in diet and ecology in the two types of environments may have led to adaptive differences.
Twelve additional life history-associated SNPs were missense variants annotated to genes with roles in intracellular signaling, translation, and immune function, including one MHC class I gene (major histocompatibility complex class I-related gene protein-like; LOC106588402). Of the genes involved in intracellular signaling, one encodes adenylate cyclase type 6-like (ADCY6; LOC106571946) — a protein that promotes increased water reabsorption from urine in kidney collecting ducts (
Konno et al. 2010;
Jung and Kwon 2016;
Olesen and Fenton 2017) (
Fig. 4) Specifically, activated ADCY6 initiates a signaling cascade that leads to insertion of water channel proteins (aquaporin-2) into collecting duct cell membranes. This allows for increased water reabsorption from urine, a process critical for maintenance of tissue water homeostasis
(Jung and Kwon 2016). Variation in the coding sequence of the
ADCY6 gene between the two landlocked populations and the LaHave River population is likely driven by different osmotic demands in freshwater and marine environments, as has been described for other osmoregulatory genes in fish populations secondarily adapted to entirely freshwater life cycles (
Deagle et al. 2013;
Kozak et al. 2014;
Willoughby et al. 2018).
When considering all outlier SNPs (i.e., with
Z(
FST) ≥ 3 in at least one population comparison), only three genes contained multiple missense variants. One of these genes is zinc finger and BTB domain-containing protein 18-like (LOC106610379;
ZBTB18). Three missense SNPs located in exons of
ZBTB18 are
Z(
FST) outliers in the Lake Champlain – LaHave River comparison (
Figs. 3A,
3B). Substantial allele frequency shifts were also detected between Sebago Lake and LaHave River at these three missense SNPs (mean
FST = 0.77, but they were not identified as
Z(
FST) outliers (
Figs. 3B,
3C).
ZBTB18 is a transcriptional repressor involved in a wide variety of functions including brain and muscle development (
Yokoyama et al. 2009;
Okado 2019) and is also differentially expressed in response to hyperosmotic conditions in cell culture (
Boyd et al. 2005;
Ni et al. 2017). Differentiation at multiple SNPs within this gene between landlocked and anadromous populations may be related to differences in growth and development (e.g., age and size at maturity;
Riley and Power 1987;
Hutchings et al. 2019) or to differences in salinity between the two environments.
It is important to note that differences among the populations in tissue type (whole fry vs. spleen) and developmental stage sampled (fry vs. subadult) likely limited the number of loci that could be detected in all three populations (i.e., locus-specific transcripts upregulated in one study may have been absent in the other, and vice versa). However, because our analyses focused on SNPs that were shared among all three populations, and individual SNPs were only called from reads that aligned to the exact same genomic position, there should not be any bias related to tissue type or stage in the allele frequency estimates of the loci that we did use in this study. Nevertheless, we may be missing additional SNPs important to freshwater and hatchery adaptation, and future studies should use similar experimental designs (e.g., matching tissue types and developmental stage) to maximize the number of shared SNPs for outlier analyses.
By comparing landlocked and anadromous Atlantic salmon populations, we identified outlier SNPs in genes related to immune, osmoregulatory, and metabolic functions, consistent with previous investigations of adaptive differences between freshwater and marine fish populations. In addition to SNPs that may be subject to selection in the natural environment, we also identified outlier loci putatively associated with hatchery rearing conditions by comparing Lake Champlain against Sebago Lake and LaHave River. Some of these SNPs were annotated to genes critical for proper wound healing, similar to findings of other genetic studies examining domestication selection in fishes. Together, these two sets of results highlight putative responses to both natural selection imposed by major environmental changes and artificial selection imposed by differing hatchery environments. Our findings reinforce the importance of minimizing the number of generations spent in captivity, particularly for species already experiencing range-wide declines in population sizes and genetic diversity, and highlight the utility of genomic data for answering questions in both evolutionary biology and fisheries management.