Early life exploration behaviour and life‐history loci are colocalized in an adaptive genomic hotspot in Atlantic salmon
Tutku Aykanat, Jaakko Erkinaro

TL;DR
Atlantic salmon's early exploration behavior and age at maturity are genetically linked but not directly caused by the same genes, suggesting a complex adaptation mechanism.
Contribution
Identifies a genomic hotspot where exploration and maturity traits are genetically associated but not pleiotropically linked in wild salmon populations.
Findings
Exploration behavior was genetically linked to a genomic region in one population but not the other.
A missense SNP in the akap11 gene showed strong association with exploration behavior.
SNPs in vgll3 and akap11 showed marginal interaction, indicating complex genetic architecture.
Abstract
The traits that are important for adaptation may exhibit genetic correlation due to pleiotropy or as a result of linkage. Understanding the genetic architecture of such correlations is important for predicting the selection response of populations. Exploration in fishes is a behavioural trait by which individuals may find habitats with better foraging and growth opportunities that subsequently improve their fitness. For example, in Atlantic salmon (Salmo salar), all individuals originate in spawning rivers where females lay eggs, but some juveniles migrate to small tributaries which are not spawning areas but provide favourable habitat patches for young salmon. The increased growth in these nursery streams may facilitate earlier sexual maturation, implying a potential pleiotropy between exploration and maturation traits.In this study, by sampling juveniles from two wild populations in…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
FIGURE 1
FIGURE 2- —Research Council of Finland10.13039/501100002341
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic diversity and population structure · Fish Ecology and Management Studies · Animal Behavior and Reproduction
INTRODUCTION
1
Elucidating the genetic architecture of adaptive traits is essential to predict the evolutionary trajectories linked to natural selection in the wild. While genome‐wide scans are essential for locating genomic regions associated with phenotypic variation or adaptation, their utility to assess fine‐scale adaptive landscape is limited, especially in natural populations. For example, a genomic region may harbour multiple loci associated with trait variation and, depending on the number of loci and the linkage disequilibrium between them, the predicted response to selection will vary. Close genomic proximity or close genetic linkage distances between adaptive genetic loci can constrain adaptation (Connallon & Hall, 2018), but may also be selected for to preserve coadapted allele combinations in the form of epistatic interaction (Yeaman, 2013; Bank, 2022). For example, the evolution of inversions reduces recombination along the inversion and allows gene complexes to be preserved over large genomic distances (Kirkpatrick, 2010; Küpper et al., 2015). Such an epistasis has been suggested as an important form of genetic architecture that facilitates the coevolution of coadapted trait complexes, such as behaviour and reproduction or behaviour and life history (Funk et al., 2021; Küpper et al., 2015; Tuttle et al., 2016). Likewise, population models suggest such nonrecombining regions may be preserved even without a need for coadaptation or epistasis (Hoffmann & Rieseberg, 2008; Kirkpatrick & Barton, 2006), suggesting their potential prevalence in natural populations.
Loci that are in close proximity are less likely to be broken up by recombination due to their tight linkage. Hence, similar to long nonrecombining genomic regions, they could also harbour coadapted gene complexes or independent adaptive loci. However, the prevalence of such mechanisms is not well understood in natural populations. For example, it is challenging to distinguish whether multiple traits are genetically associated with independent loci within the region (spurious pleiotropy) or due to true pleiotropy (linked to one causal genic polymorphism, see, for example Solovieff et al., 2013). Distinguishing these two modes of inheritance has implications. In the evolutionary sense, traits coupled through pleiotropy are harder to decouple than when inherited by tightly linked but different loci (Chebib & Guillaume, 2021); hence, they may be less responsive to evolutionary pressures. On the other hand, synergistic pleiotropy may evolve as a result of strong natural selection, in which case mutations effecting multiple fitness traits may be selected for (McGee et al., 2016). Likewise, multivariate phenotypes associated with tightly linked loci might indicate coadaptation (between traits) within local demes (Yeaman & Whitlock, 2011), while phenotypes under a pleiotropic effect of a gene may have a developmental basis (Wagner et al., 2007). Therefore, linking genetic architecture to ecological processes and adaptation requires information not only on the phenotypes associated with the genomic region, but also on the fine‐scale genetic association between phenotypes and polymorphisms within genomic regions.
The vgll3 locus in Atlantic salmon (Salmo salar) has strong potential as a genomic tool to better understand the adaptive dynamics in the wild associated with a small linked genomic region. The vgll3 genomic region, which spans approximately 70 kb and encompasses the vgll3 and akap11 genes, was initially identified as a major genomic region that explains age at maturity, a life‐history trait with substantial fitness effects (Ayllon et al., 2015; Barson et al., 2015). Further research identified that variation in other traits was also linked with genetic variation in the region, such as precocious male maturation (Debes et al., 2021), repeat spawning (Aykanat et al., 2019), metabolism (Prokkola et al., 2022, 2024), foraging behaviour (Aykanat et al., 2024), exploration and aggression (Bangura et al., 2022, 2024), morphology (Aykanat et al., 2025), and early life fitness (Aykanat et al., 2024). The region harbours a large number of single nucleotide polymorphisms (SNP), as revealed by whole genome sequencing (e.g. Barson et al., 2015), but four SNPs remain the most important candidates for the association of phenotypes. These include two potentially significant functional missense polymorphisms within the coding regions in the vgll3 gene (vgll3 Met54Thr and vgll3 Asn323Lys), one missense polymorphism in the akap11 gene (akap11 Val214Met), and a putatively regulatory SNP with the strongest association with maturation age, so‐called vgll3 TOP (Barson et al., 2015). However, in most of these studies, the trait‐gene association was not further refined. For example, in studies that involved cross‐breeding designs, parental fish were usually selected to have a homogeneous vgll3 haplotype block with full linkage among these four important SNPs, primarily to increase the inference power to detect the additive genetic relation. In one exceptional case, cross‐breeding was designed to segregate the genetic variation between the vgll3 and akap11 genes, which allowed further refinement of the location of causal SNPs within the genomic region (Sinclair‐Waters et al., 2022). Genetic variation across tightly linked loci can also be refined in wild populations by exploiting natural segregation within the region. For example, Aykanat et al. (2019) attempted to narrow down the genetic variation in the vgll3 region associated with the repeat spawning phenotype along four important SNPs in the region, but none of the four SNPs explained the phenotypic variation better than the others, indicating insufficiency of power. Taken together, the presence of multiple adaptive traits and putatively functionally important SNPs linked to the genomic region signifies the importance of better characterizing the genetic architectures. This will provide insight into the dynamics of evolutionary change of the genomic region within and among Atlantic salmon populations.
Exploration in fishes is an important behavioural trait, enabling populations to spread and occupy new habitat patches and resources. It can reduce the negative effects of density dependence (Einum et al., 2006) and improve population stability (Baker et al., 2025). In juvenile Atlantic salmon, exploration within freshwater (before emigration to the ocean) has the potential to contribute to improved feeding opportunities, better growth and ultimately higher survival (Erkinaro & Erkinaro, 1998; Erkinaro & Niemelä, 1995; Johansen et al., 2005a, 2005b). Such habitat shifts may also affect individual life‐history patterns (Erkinaro et al., 1997) via improved feeding and growth opportunities (e.g. Ueda et al., 2025), making the individuals more likely to exceed the maturation threshold (Jonsson & Jonsson, 2014; Metcalfe, 1998; Thorpe et al., 1998). At the genetic level, exploration and covarying traits such as metabolism and aggression have been linked with the major loci affecting age at maturity, vgll3 (Bangura et al., 2022, 2024; Prokkola et al., 2022, 2024). For example, using a common garden design in an aquarium setting, Bangura et al. (2024) showed that juvenile Atlantic salmon with the late‐maturing vgll3 ^LL^ genotype (L referring to the allele associated with late maturation) were significantly more active in the 30‐min period after their release into a novel environment than individuals with the early‐maturing vgll3 ^EE^ genotype (E referring to the allele associated with early maturation). This indicates that exploration and maturation could be influenced by the same genetic locus. However, in the cross‐design, the vgll3 haplotype block was in complete linkage, and the study could not further refine the genetic basis within the genomic region.
Traditionally, Atlantic salmon juveniles are considered relatively site‐attached and sedentary in their early years in freshwater (see, e.g. Jonsson & Jonsson, 2011). However, in some river systems, older age groups of juvenile salmon (parr) disperse to more favourable habitat patches within freshwater systems, including lacustrine areas and smaller tributaries (Erkinaro, 1995; Erkinaro, Julkunen, & Niemelä, 1998; Halvorsen & Jørgensen, 2006). These small tributaries are nursery streams that are not suitable for salmon reproduction but may provide favourable growth and survival conditions for salmon parr by means of increased food availability and decreased competition and predation (Erkinaro & Erkinaro, 1998; Erkinaro, Julkunen, & Niemelä, 1998; Erkinaro & Niemelä, 1995; Johansen et al., 2005b). The extent of exploratory behaviour to nursery streams and its adaptive importance may be influenced by various factors, including environmental conditions, genetic predispositions and population density (e.g. Erkinaro et al., 1997; Johansen et al., 2010). For example, the use of nursery streams by juvenile Atlantic salmon has been observed especially in northern areas with depauperate fish fauna that exhibit less competition and predation threat to juvenile salmon (Erkinaro, 1995; Erkinaro & Gibson, 1997; Lillehammer, 1974; Shustov et al., 2012).
Measuring the active migration of juvenile Atlantic salmon to nursery streams is a good proxy for quantifying exploration behaviour in natural settings (Erkinaro, Julkunen, & Niemelä, 1998) and may help us to quantify the genetic basis of exploration across populations. Furthermore, such migratory behaviour may be linked to maturation age during adulthood. The relative proportion of early adult maturation is shown to be higher among individuals that occupy nursery streams during the juvenile stage (Erkinaro et al., 1997), which may be facilitated by improved growth opportunities and the likelihood of reaching the maturation threshold (e.g. Thorpe et al., 1998). This suggests that the observed phenotypic correlation between two traits may be explained by pleiotropy, that is mediated pleiotropy via a SNP locus affecting one trait which then causally influences the second trait (Solovieff et al., 2013).
Here, we quantified the genetic basis of exploration to nursery streams in two wild Atlantic salmon populations in the Teno River system in northernmost Fennoscandia. We first ask whether the vgll3 genomic region is associated with exploratory behaviours to nursery streams. By further exploring the genetic architecture within the vgll3 haploblock, we asked if exploration and age at maturity traits are pleiotropic or associated with different genetic loci, and if epistasis within the region may play a role in shaping trait variation.
MATERIALS AND METHODS
2
Study area and sampling
2.1
The Teno River (Deatnu in Sámi, Tana in Norwegian) catchment is located in far‐north Europe (68–70° N, 25–27° E), runs northward between Finland and Norway and drains into the Barents Sea (Figure S1). The Teno system supports a number of Atlantic salmon populations with wide diversity in life‐history characteristics and genetic structure within and among populations (Erkinaro et al., 2019; Vähä et al., 2017). Juvenile salmon samples used in this study were originally collected for a series of studies characterizing the exploration behaviours of juvenile salmon from the main stem spawning areas into small nursery streams in relation to environmental conditions and fish age (Erkinaro, Julkunen, & Niemelä, 1998; Erkinaro & Niemelä, 1995). Samples were collected from two populations that reproduce in the Teno main stem and Inarijoki (Anárjohka in Sámi and Norwegian), a large headwater tributary of the Teno River. These two populations exhibit a low level of genetic divergence (F ST = 0.012, Aykanat et al., 2015; Vähä et al., 2008), but have substantial differences in their life histories, with the former population having high proportions of large, later‐maturing individuals and the latter population primarily consisting of younger individuals at maturity (Czorlich et al., 2018; Erkinaro et al., 2019). In the spawning rivers, the Teno main stem and Inarijoki, where adult Atlantic salmon reproduce and females lay eggs, juvenile salmon were captured using 600‐ to 900‐V pulsed DC electrofishing equipment. Scales were sampled (initially for age determination, but later used for DNA extraction) and after sampling, the fish were released unharmed at the site of capture.
The sampling sites in both spawning rivers were located in close proximity (up to 100 m) to the outlets of two nursery streams, Baððá and Guoldnájohka (Figure S1), which drain into the Teno main stem and Inarijoki, respectively (Erkinaro, Julkunen, & Niemelä, 1998). Juvenile salmon in the streams were captured with a counting fence installed at the lowest parts of the streams, 20–50 m from the outlets, and as above, the scales were sampled. After sampling, fish were released in the direction in which they had been heading (cf., Erkinaro, Julkunen, & Niemelä, 1998). Migrant fish were classified as smolt or parr based on external characteristics, especially coloration, visibility of parr marks and size (Erkinaro & Niemelä, 1995). Overall, we sampled and genotyped 785 individuals, of which 635 were used in the main analysis after filtering out IDs with missing genotype information in focal SNPs (see below) or missing cohort year information (i.e. due to poor‐quality scale samples; Table S1).
DNA extraction, SNP genotyping by targeted sequencing
2.2
DNA was extracted from juvenile Atlantic salmon scales using QIAamp 96DNA QIAcube HTKit (Qiagen) following the manufacturer's protocol. Individuals were genotyped by targeted sequencing using a SNP panel consisting of 173 SNP markers using a GTSeq approach (Campbell et al., 2015) as described in (Aykanat et al., 2020). The SNP panel contained four SNPs from the vgll3 genomic regions including the SNP that is most highly associated with age at maturity (vgll3 TOP), two missense mutations in the vgll3 gene (vgll3 Met54Thr and vgll3 Asn323Lys), and one missense mutation in the akap11 gene (akap11 Val214Met, Barson et al., 2015). These SNPs are located on chromosome 25 and are 0.8 kb (vgll3 Met54Thr), 2.9 kb (vgll3 Asn323Lys), 10.9 kb (vgll3 TOP), and 65.5 kb (akap11 Val214Met) downstream of the transcription start site of the vgll3 gene. The SNP panel also included six6 locus, a SNP highly spatially divergent between Atlantic salmon populations (Barson et al., 2015), and the so‐called outlier and baseline SNPs. The outlier module includes 53 highly differentiated SNPs between the Inarijoki and Teno main stem populations, allowing confirmation of the population of origin of individuals between these two closely related populations (data not shown). The baseline module included 136 putatively neutral markers in low linkage disequilibrium, distributed throughout the whole genome and previously filtered to have a minor allele frequency greater than 0.05 and heterozygosity greater than 0.2 for these two populations (Aykanat et al., 2016). Individuals who did not have genotype information on any of these SNPs were excluded from the analysis. Bonferroni correction was applied when multiple SNPs were compared.
Statistical analysis
2.3
We first tested if the exploration trait is colocalized in the vgll3 genomic region by comparing single SNP association to exploration behaviour, which may also address whether exploration and age at maturity are pleiotropic or associated with different loci within the region. We modelled this separately for the Teno main stem and Inarijoki populations using a generalized linear mixed model with a binomial error structure as follows:
where Y is a Boolean response variable for the explorative behaviour of individuals, in which individuals sampled in the main stem were assumed to be sedentary, and coded as ‘0’, and individuals sampled in the streams, either descending or ascending, were assumed to be explorative and coded as ‘1’. SNP is the genotype score of SNPs, coded as 0 when individuals are homozygous for the reference alleles, 1 when homozygous for the alternative alleles, and 0.5 when heterozygous. Age of the fish was determined from the sampled scales. For the Inarijoki population, ε hatchyear is the random variance associated with the hatch year (i.e. cohort year) calculated as the year sampled minus the age of the fish. Hatch year controls for potential cohort specific allele frequency differences between different years. Neither the hatch year nor the sampling year overlapped for the Teno main stem populations between nursery streams and main stem collections, so for this population, the model did not include the random term ε hatchyear. The model was implemented for four focal SNPs in the vgll3 genomic region (vglll3 TOP, vgll3 Met54Thr, vgll3 Asn323Lys, akap1 Val214Met),six6 locus that is shown to be associated with age at maturity and metabolism (Prokkola et al., 2022, 2024; Sinclair‐Waters et al., 2020), and the rest of 169 SNPs in the SNP panel to assess baseline association levels in the putatively unrelated loci. Using a similar model structure as above, we then implemented a set of models to quantify the effect of epistasis within the genomic region by modelling pairwise interactions of SNPs within the vgll3 genomic regions:
whereby, SNP_1_ and SNP_2_ are two of the four focal SNPs, and we all possible models that included any pairwise comparisons between them.
Finally, we compared the differences in gene trait association between populations, that is Inarijoki and Teno main stem, which are in close proximity and genetically similar, but with substantial differences in their life histories. Such a difference may potentially indicate gene‐by‐environmental, gene‐by‐gene (i.e. due to populations' genetic background) or a combination of these two effects.
All models were run using the glmmTMB package, version 1.1.5, (Brooks et al., 2017) in R software version 4.4.1 (R Core Team, 2019), using a binomial error structure. Marginal means for genotype effects and their confidence intervals were calculated using the emmeans package, version 1.8.5 (Lenth, 2023). Model diagnostics were performed using the DHARMa package, version 0.4.6 (Hartig, 2022).
RESULTS
3
Overall, the genotyping success rate was generally low (85%, Figure S2), mainly due to the small amount of material available for DNA extraction. After removing individuals with no genotype success in any of the focal SNPs (N = 143) and those with no scale aging data (N = 15, of which eight also did not have genotype success), 635 individuals remained in the final dataset out of an initial 785.
In the Teno main stem population, exploration behaviour towards the nursery stream, Baððá, was not significantly associated with any of the four SNPs in the vgll3 genomic region, or any other SNPs in the panel after the Bonferroni correction for multiple comparisons (Figure 1a). In the Inarijoki population, the only SNP that reached the Bonferroni‐corrected threshold was akap11 Val214Met (Figure 1b). Specifically, the akap11 Val214Met ^EE^ genotype was 8.04 times (95% CI: 2.13–30.34) more likely to explore the nursery stream, Guoldnájohka, than the akap11 Val214Met ^LL^ allele (p < 0.001, Figure 2a). The six6 locus did not exhibit a significant association in any of the populations.
SNP—exploration association across 173 SNPs in Teno main stem (a) and Inarijoki (b) populations in Atlantic salmon. Focal SNPs are coloured red. Uncorrected (p = 0.05) and Bonferroni‐corrected p‐value thresholds are indicated with dashed and continuous lines.
(a) Odds of exploration in Inarijoki Atlantic salmon as a function of the akap11 Val214Met SNP, and (b) as a function of the akap11 Val214Met and vgll3 TOP interaction. The error bars indicate 95% CI of the marginal estimates, and the first and second numbers in parentheses indicate the sample sizes in the nursery stream and the main stream, respectively. The Y axis is drawn on a logarithmic scale.
When pairwise interactions within the vgll3 genomic region were modelled in the Teno main stem population, none of the models were more significant than the null model that did not contain genetic effects. For the Inarijoki population, the most parsimonious model explaining the exploration included the interaction of akap11 Val214Met and vglll3 TOP, which was superior to the null model (ΔAIC = 13.07), but only marginally to the model that only included akap11 Val214Met (ΔAIC = 1.12, p = 0.077). This model was also not significantly better than the interaction between akap11 Val214Met and two other SNPs in the vgll3 genomic region (ΔAIC <2, Figure S3). This marginal interaction effect was driven by a decrease in exploration behaviour associated with the genotype combination of akap11 Val214Met ^EL^ and vgll3 TOP ^EL^ (Figure 2b, Figure S4). As such, the model predicts that the odds of exploration for individuals with akap11 Val214Met ^EL^ and vgll3 TOP ^EL^ were 3.69 (95% CI 1.63–8.36, p < 0.001) and 3.04 (95% CI 0.80–11.47, p = 0.14) times less than for akap11 Val214Met ^EE^ and vgll3 TOP ^EL^ and akap11 Val214Met ^EL^ and vgll3 TOP ^EL^, respectively (Figure 2b). The fit of the model with only akap11 genotype modelled, nor the model with the vgll3 TOP and akap11 interaction did not exhibit substantial deviation from model assumption, as assessed visually by DHARMa package's (Hartig, 2022) diagnostics plots (Figure S5). For both rivers, exploration to nursery streams increased with age (p < 0.001 for both rivers), concordant with the previous reports (Erkinaro, 1995).
The fish sampled in Guoldnájohka contained 83 individuals that had undergone smoltification (or were in the process of becoming smolt), which is a suite of physiological, morphological, and behavioural changes in Atlantic salmon juveniles for becoming ready for seaward migration (McCormick & Saunders, 1987), which may have altered their exploration behaviour. However, when the smolts were excluded from the analysis, the results were mostly qualitatively similar (Figure S6). One notable difference was that a model with the akap11 Val214Met and vgll3 TOP interaction was the best model. As in the full dataset, the fit of the interaction model was not significantly better than the fit of the model with only additive effects of only akap11 Val214Met (p = 0.12, ΔAIC = 0.19, Figure S6). However, the change was not meaningfully different from an effect that might have resulted from a reduced sample size in a random manner. As such, when 83 parr were randomly sampled and excluded from the dataset (i.e. the same number as the smolts in the dataset) or 83 randomly selected parr in the dataset were used as a replacement for the excluded smolts, the ΔAIC range of these models (each permuted 1000 times) was similar to the ΔAIC values obtained in the original model (Figure S7).
DISCUSSION
4
The factors shaping the genetic architecture of ecologically important traits are fundamentally important to understand adaptive dynamics in the wild. We have demonstrated that the exploration behaviour in juvenile Atlantic salmon is linked to a genomic region on chromosome 25 in one of the two populations in the Teno River. This region contains the vgll3 and akap11 genes and is associated with a suite of traits involved in salmon adaptation (reviewed in Mobley et al., 2021) including age at maturity (Barson et al., 2015). We have found that a missense SNP in the akap11 gene (akap11 Val214Met) was substantially linked to exploratory behaviour, while SNPs in the vgll3 gene exhibited a marginally supported epistasis with this SNP, suggesting a complex genetic architecture in the region. Furthermore, the genetic effect was only observed in one of the two populations studied, indicating the importance of environment or population‐specific factors in triggering gene‐dependent exploration behaviour.
Our results showed that the early maturation allele in the akap11 gene was associated with increased exploration to nursery streams in the Inarijoki population. This is consistent with the findings of Erkinaro et al. (1997) who, by analysing freshwater growth patterns of salmon, suggested that juveniles entering nursery streams had a greater tendency to mature and return from their sea journey earlier than those that stayed in the main stem for their juvenile years. Differences in growth opportunity in Atlantic salmon juveniles, which are linked to efficient acquisition of resources, may be instrumental in exceeding the maturation threshold in subsequent years (Jonsson & Jonsson, 2014; Metcalfe, 1998; Thorpe et al., 1998). As such, causality between exploration and migrating to nursery streams may be suggested based on the increased likelihood of individuals in the streams to reach the maturation threshold (e.g. Thorpe et al., 1998) upon increased growth opportunities. However, the association was predominantly driven by genetic variation in the akap11 Val214Met SNP, which is about 70 kb downstream of the vgll3 gene and exhibits a weaker association with sea age at maturity (Barson et al., 2015). The akap11 Val214Met SNP was also previously ruled out as the genetic basis for early male maturation in Atlantic salmon in controlled settings (Sinclair‐Waters et al., 2022). Combined, while exploration behaviour may facilitate earlier maturation in a context‐dependent manner, our results do not provide evidence to support a direct genetic causation between the two, but that a small region explains both maturation and exploration behaviours, and that the two traits are genetically correlated by linkage.
A multitude of phenotypic traits may be linked within a genomic region via pleiotropy or linkage disequilibrium (that is, spurious pleiotropy, Solovieff et al., 2013). This makes it challenging to identify the phenotypic traits under selection, for example by using genomic scans (Paaby & Rockman, 2013; Solovieff et al., 2013). Our results indicate that the vgll3 genomic region contains multiple independent SNPs linked to adaptive traits, ruling out pleiotropy as the basis of multi‐trait association. The presence of multiple adaptive traits within a tightly linked genomic region raises the question of whether the genetic covariation between these loci harbours coadapted trait combinations. Linked architectures may facilitate adaptation by enabling co‐inheritance of adapted gene complexes (Mullon et al., 2017; Oomen et al., 2020). For example, genomic inversions have been shown to be important in maintaining coadapted trait combinations, including migratory strategies and dispersal (Mullon et al., 2017; Rubenstein et al., 2019; Wellenreuther & Bernatchez, 2018). Similarly, genetic variation in migratory ecotypes in rainbow trout (Oncorhynchus mykiss, Pearse et al., 2014) and Atlantic cod (Gadus morhua) has been shown to be associated with genetic markers in a large inverted region (Kirubakaran et al., 2016). A tight linkage between adaptive traits might also be selected for when locally adapted populations may have varying local fitness optima (for these traits) and exhibit high gene flow between them (Yeaman & Whitlock, 2011). In this case, a tight linkage between locally adapted traits would help in purging maladaptive genetic variants with a relatively reduced genetic load.
In a practical sense, our results also highlight the insufficiency of drawing conclusions when only one SNP is used as a tag marker to represent the genetic makeup of tightly linked genomic regions. In common garden cross‐breeding designs, one SNP marker may be utilized to tag the whole region when exploring the trait‐genotype association of the region (e.g. Bangura et al., 2024; Prokkola et al., 2022). While such designs provide substantial statistical and cost reduction benefits, they would not be able to discern associations across SNPs in close proximity, and their interactions. For example, using a common garden setup, Bangura et al. (2024), suggested that vgll3 is likely to play a role in exploration behaviours. However, in the cross‐breeding design, parents were chosen to be in complete linkage regarding the genetic variation in akap11 and vgll3 SNPs; thereby, the deduction of SNP effects independently is not possible (see also, Bangura et al., 2022). In fact, when Sinclair‐Waters et al. (2022) refined the association of SNPs in the vgll3 genomic region, in relation to their effect on male parr maturation, they found an opposite effect of SNPs in the vgll3 and akap11 gene, whereby the early maturation allele in the akap11 gene was indeed marginally associated with reduced maturation, further signifying the importance of refining genetic associations when multiple candidate SNPs may underlie the phenotypic variation.
Our study provides a further outlook on the dynamics of behavioural syndromes and correlated behaviours (Axling et al., 2023; Garamszegi et al., 2013; Höjesjö et al., 2004; Mathot et al., 2019). The classic paradigm suggests that dispersal is negatively correlated with dominance and aggression, whereby dominant individuals monopolize favourable habitat patches and force subordinate individuals to displace in search of new habitats (Stamps, 1994, 1999). However, in Atlantic salmon, migration to nursery streams might be carried out by individuals holding higher positions in the dominance hierarchy (Erkinaro & Niemelä, 1995). Using seasonal growth in scale depositions, Erkinaro and Niemelä (1995) estimated growth in previous seasons, a trait correlated with dominance and aggression in early life stages (Cutts et al., 2005), and showed that faster growing individuals are more likely to migrate to the small nursery streams. A similar tendency in individual growth patterns has also been identified in individuals shifting their juvenile rearing habitat to lacustrine areas of the catchment (Erkinaro, Niemelä, et al., 1998). A niche shift, that is, by migrating to nursery streams, may be hypothesized to be a mechanism to overcome potential resource mismatch in the original habitat. Therefore, we would predict that the EE allele in the vgll3 genomic region is likely associated with behavioural traits correlated with exploration in salmon, such as dominance.
On the other hand, common garden experiments under controlled conditions do not support our prediction. For example, by quantifying the exploration of new environments in a confined space (an aquarium with a surface area of 25 × 30 cm), Bangura et al. (2024) found that the vgll3 L allele was associated with greater exploration in a novel environment. Similarly, using a similar experimental setup, Bangura et al. (2022) also found increased aggression associated with the vgll3 L allele. These results suggest that there is a genetic covariation between aggression and novel environmental exploration, but with an opposite direction of allelic effect to our study. (Note that in the crosses used by Bangura et al., 2022, 2024; akap11 and vgll3 were in full linkage, so conclusions related to the genetic variation in the vgll3 SNPs are equally valid for the akap11 SNP). These contrasting results can reflect contextual differences between natural and common garden settings. As such, a meta‐analysis has shown that these two personality traits (aggression and exploration) only correlate when there is a contextual overlap that underlies the similarity of trait expression (Garamszegi et al., 2013). Therefore, the contrasting conclusions in these two settings (natural vs. laboratory) may stem from different functional physiological processes. Alternatively, strong environment (common garden vs. natural setting) by genotype, or population by genotype interactions may explain the contrast. For the latter, for example, the parental fish used in Bangura et al. (2022) and Bangura et al. (2024) are obtained from a hatchery broodstock of salmon managed by the Natural Resources Institute Finland (LUKE), initially originated from a Baltic Sea population (River Neva). Therefore, differences in the genetic background of fish may also explain the discrepancies. In fact, the genetic association between SNPs in the vgll3 genomic region and exploration was significant only in the Inarijoki population, while no detectable effect was found in the Teno main stem population, further highlighting a potential role of population‐specific differences in the genetic basis of trait variation. Our study illustrates the complex dynamics of the correlated behaviours of Atlantic salmon and their genetic background.
Finally, we found marginal support for epistasis between akap11 and the vgll3 gene, in which the fit of the interaction model was better than the fit model with only akap11 SNP, but this did not reach a significance at alpha value = 0.05. The statistical power was also insufficient to determine which of the SNPs in the vgll3 gene are the most likely candidates. However, the suggestive signals of epistasis may be an indication of a complex genetic architecture within the genomic region. Epistasis contributes to the trait variance and helps maintain additive genetic variation upon selection (Cheverud & Routman, 1995). Furthermore, the trait complex between exploration and age at maturity may contribute to the adaptation dynamics. For example, optimal trait combinations (of traits co‐inherited within a haplotype block) may be locally variable among populations and may improve overall fitness compared to what would be predicted by only additive gene action. Such an epistatic interaction would also facilitate purging of maladapted genetic combinations (Yeaman & Whitlock, 2011).
We used historical field data that were not originally intended for a genetic study such as this. Therefore, the design has some drawbacks. For example, the hatching year, which represents the year that the offspring originate from, was not modelled for the Teno main stem and Baððá stream due to the lack of temporal overlap between sites. Therefore, we cannot rule out the possibility that the (lack of) difference in genetic response is due to temporal changes in allele frequencies between cohorts, even though the time span between sampling periods is only a few years. However, the hatch year was accounted for in the model for the Inarijoki population, by which we controlled for temporal genetic changes between cohort years. Second, genotype calling failed for a substantial portion of individuals due to the suboptimal amount of tissue remaining in the storage envelopes. In addition, we used a targeted genotyping approach by scoring only the most likely candidate SNPs in the genomic region. An alternative approach to sequence the whole genomic region might provide finer resolution in trait association and help to better narrow down the causal region.
In this study, two life‐history genomic regions were the primary focus (vgll3 and six6), and other SNPs genotyped were used to account for potential inflation in the statistical tests. Therefore, by design, we did not perform large‐scale scans to explore the signals of association at genomic scale, which may provide insights into the molecular mechanisms underlying trait variation. Genome‐wide association analyses have identified a number of small effect loci linked to age‐at‐maturity (Ayllon et al., 2015; Gutierrez et al., 2015; Sinclair‐Waters et al., 2020), but such studies for traits such as movement or exploration are mostly missing in non‐model organisms, for example Atlantic salmon. The theory of omnigenic genetic architecture predicts that major regulatory proteins may drive covariation in multiple traits via highly pleiotropic function (Boyle et al., 2017; Liu et al., 2019). In humans, genome‐wide meta‐analyses have been used to genetically link personality traits to larger psychological disorders (Gupta et al., 2024; Lo et al., 2017). Such approaches as in human studies may help to better understand linking exploration and life history at the molecular level and potential regulatory networks shared by these two traits (Boyle et al., 2017; Liu et al., 2019; Mathieson, 2021), once the genome‐wide associations are available for the traits of interest (i.e. exploration).
Our system included only two populations from a northern river, which are generally low‐nutrient, low‐productivity areas with depauperate fish species communities, and whether the observed genetic effects are parallel across the species range is unclear. The association between age‐at‐maturity and the vgll3 genomic region is sex and region specific for the North American populations (Kess et al., 2024; Kusche et al., 2017) and may have an environmental component in some European populations (Ayllon et al., 2019; Besnier et al., 2024). Exploration behaviour may alter life‐history variation context dependently for example via means of enabling improved feeding opportunities (Chapman et al., 2012; Wright et al., 2019). Therefore, it is intriguing to ask whether such context‐dependent association between the vgll3 genomic region and age‐at‐maturity may have been linked to the akap11 gene.
Movement patterns in the Salmonidae family are mostly studied in the context of the phenology of migration, which is central to their life histories, has a heritable component and has strong fitness consequences (Niemelä et al., 2006; Stewart et al., 2006). Many species exhibit a strong dichotomy of the trait within and among populations, as evidenced by the presence of both resident and migratory forms in, for example brown trout, rainbow/steelhead trout and Arctic char (Jonsson & Jonsson, 1993). Movement within the freshwater habitat at the juvenile stage and its association with migration at later stages are much less explored. Our study demonstrates a complex genetic architecture associated with freshwater exploration and maturity age. The linked inheritance of these two functional traits (exploration and maturation) may help to maintain coadapted trait complexes and may be instrumental in local adaptation processes, fitness and the long‐term survival of populations.
AUTHOR CONTRIBUTIONS
Tutku Aykanat: Conceptualization, writing—the original draft (lead), review and editing, DNA extraction, genotyping, and data analyses. Jaakko Erkinaro: Conceptualization, sample collection and maintenance, writing—the original draft (support), review and editing.
CONFLICT OF INTEREST STATEMENT
The authors declare no competing interests.
Supporting information
Table S1. Age and hatch year of individuals used in this study across populations (Teno, Inarijoki) and their main stems and small tributary (nursery) streams. The numbers in parenthesis are smolts. Table S2. Observed number of individuals in the Inarijoki population, main stem (0) vs. the Guoldnájohka stream (1) in relation to akap11 and vgll3 TOP genotypes. Figure S1. The study system and the Teno River Basin. River sections with blue and red colours indicate Teno main stem and Inarijoki, respectively, and respective nursery streams, Baððá and Guoldnájohka. Note that the sampling in both spawning rivers were carried out in close proximity (up to 100 m) to the outlets of two nursery streams. Figure S2. Genotyping success per individuals (a) and SNPs (b). Focal SNPs are marked in red in (a). Excluded individuals were marked in red in (b). Figure S3. Parsimony of different model structures as evaluated by Akaike information criteria (AIC). Models in the grey shaded area indicates similarly parsimonious model (i.e. models within 2 AIC units to the best model). Figure S4. Predicted odds of exploration in Inarijoki Atlantic salmon as a function of akap11 and vgll3 interaction with all genotype combinations are visualized. Error bars indicate 95% CI of the marginal estimates and numbers in parenthesis indicates sample size. Y axis is drawn in log scale. Figure S5. Diagnostic plots of the models that has only akap11 as the genotype factor (a) and that modelled akap11 and vgll3 interactions. Figure S6. Comparing the parsimony of model structures between modelled with and without smolts included in the dataset (dark vs. light symbols, respectively), as evaluated by Akaike information criteria (AIC). Models in the grey shaded area indicates similarly parsimonious model (i.e. models within 2 AIC units to the best model). Figure S7. 1000 permutated models to evaluate the effect of including or exluding smolts (individuals that had undergone parr‐smolt transformation) to the parsimony of (ΔAIC) akap11 additive vs. vgll3 TOP x akap11 interaction models. In (a), equal number of parrs as the number of smolts were randomly excluded from the dataset, and parsimony between two model structures were compared, that is between the real model that excludes smolts (red dashed lines) and 1000 permuted models that randomly excluded parr. In (b), equal number of parrs as the number of smolts in the data were randomly included to the dataset in exchange for excluding smolts, and parsimony between two model structures were compared, that is between the real model that still include smolts (red dashed lines) and 1000 permuted models.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Axling, J. , Vossen, L. E. , Peterson, E. , & Winberg, S. (2023). Boldness, activity, and aggression: Insights from a large‐scale study in Baltic salmon (Salmo salar L). P Lo S One, 18, e 0287836. 10.1371/journal.pone.0287836 37471414 PMC 10358900 · doi ↗ · pubmed ↗
- 2Aykanat, T. , Debes, P. V. , Jansouz, S. , Gueguen, L. , House, A. H. , Ruokolainen, A. , Erkinaro, J. , Pritchard, V. L. , Primmer, C. R. , & Bolstad, G. H. (2025). Large effect life‐history genomic regions are associated with functional morphological traits in Atlantic salmon. G 3 (Bethesda), 15.10.1093/g 3journal/jkaf 106PMC 1223963840408779 · doi ↗ · pubmed ↗
- 3Aykanat, T. , Johnston, S. E. , Orell, P. , Niemelä, E. , Erkinaro, J. , & Primmer, C. R. (2015). Low but significant genetic differentiation underlies biologically meaningful phenotypic divergence in a large Atlantic salmon population. Molecular Ecology, 24, 5158–5174. 10.1111/mec.13383 26363183 · doi ↗ · pubmed ↗
- 4Aykanat, T. , Lindqvist, M. , Pritchard, V. L. , & Primmer, C. R. (2016). From population genomics to conservation and management: A workflow for targeted analysis of markers identified using genome‐wide approaches in Atlantic salmon Salmo salar . Journal of Fish Biology, 89, 2658–2679. 10.1111/jfb.13149 27709620 · doi ↗ · pubmed ↗
- 5Aykanat, T. , Mc Lennan, D. , Metcalfe, N. B. , & Prokkola, J. M. (2024). Early survival in Atlantic salmon is associated with parental genotypes at loci linked to timing of maturation. Evolution, 78, 1441–1452. 10.1093/evolut/qpae 072 38736399 · doi ↗ · pubmed ↗
- 6Aykanat, T. , Ozerov, M. , Vähä, J. P. , Orell, P. , Niemelä, E. , Erkinaro, J. , & Primmer, C. R. (2019). Co‐inheritance of sea age at maturity and iteroparity in the Atlantic salmon vgll 3 genomic region. Journal of Evolutionary Biology, 32, 343–355. 10.1111/jeb.13418 30697850 · doi ↗ · pubmed ↗
- 7Aykanat, T. , Rasmussen, M. , Ozerov, M. , Niemelä, E. , Paulin, L. , Vähä, J. P. , Hindar, K. , Wennevik, V. , Pedersen, T. , Svenning, M. A. , & Primmer, C. R. (2020). Life‐history genomic regions explain differences in Atlantic salmon marine diet specialization. Journal of Animal Ecology, 89, 2677–2691. 10.1111/1365-2656.13324 33460064 · doi ↗ · pubmed ↗
- 8Ayllon, F. , Kjaerner‐Semb, E. , Furmanek, T. , Wennevik, V. , Solberg, M. F. , Dahle, G. , Taranger, G. L. , Glover, K. A. , Almen, M. S. , Rubin, C. J. , Edvardsen, R. B. , & Wargelius, A. (2015). The vgll 3 locus controls age at maturity in wild and domesticated Atlantic Salmon (Salmo salar L.) males. P Lo S Genetics, 11, e 1005628. 10.1371/journal.pgen.1005628 26551894 PMC 4638356 · doi ↗ · pubmed ↗
