How Grouping Data over Time Can Hide Signs of Stock Status: A Case Study Using LBSPR on Frigate Tuna (Auxis thazard, Lacépède, 1800) in the Northeast Atlantic Ocean
Mustapha Sly Bayon, Kindong Richard, Amidu Mansaray, Edwin Egbe Atem, Komba Jossie Konoyima, Jiangfeng Zhu

TL;DR
This paper shows that how fish size data is grouped over time can hide or reveal important changes in fish populations, using frigate tuna in the Northeast Atlantic as an example.
Contribution
The study demonstrates that temporal data grouping significantly affects fish stock assessments and should be treated as a critical sensitivity factor in data-limited fisheries.
Findings
Finer temporal groupings reveal extended periods of reproductive depletion and recovery in frigate tuna stocks.
Broader data groupings can mask overfishing signals and lead to misleading stock status conclusions.
Temporal grouping is a structural decision that influences length-based stock assessment outcomes.
Abstract
It is important to understand the status of fish populations for the well-being of marine ecosystems and for regions reliant on fishing. Scientists face challenges in assessing some fish stocks due to insufficient data. A significant challenge of using length-based spawning potential ratio (LBSPR) lies in how to organise and group the collected data. Broad data groupings, such as merging multiple years, can hide crucial changes in fish populations, including overfishing periods and stock recovery. This study used LBSPR, a method that estimates fish population conditions using fishery-dependent size data from the Northeast Atlantic Ocean Frigate tuna (Auxis thazard) as an illustrative case to examine how different temporal grouping choices ranging from broad multi-year pooling to finer time blocks (groups) affect the interpretation of an illustrative case stock status within the same…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —College of Marine Living Resources Science and Management, Shanghai Ocean University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarine and fisheries research · Ichthyology and Marine Biology · Fish Ecology and Management Studies
1. Introduction
Fishery stock assessments interpret limited observations into management-relevant conclusions, yet the reliability of these conclusions is significantly influenced by the available data and how they are processed and integrated [1,2,3]. For several exploited stocks, especially small pelagics and small tunas, age-structured data and standardised abundance indices are either unavailable or inadequate, necessitating the development and implementation of data-limited assessment methodologies [4,5]. In this context, length-based methods are widely used due to the flexibility of length data collection and their capacity for preserving biological information regarding growth, maturation, selectivity, and exploitation [6,7]. Length-based approaches infer stock status by relating observed size composition to underlying life history parameters and fishing selectivity, which makes them particularly sensitive to what portion of the size range is represented in the data and how well the assumed processes approximate reality [4,7]. In practice, assessment conclusions can differ when data inputs, selectivity patterns, or key biological assumptions differ—even when the same general modelling framework is used [5,8]. This sensitivity is not unique to length-based methods; it is a general feature of inference under uncertainty in fishery models and suggests that there must be explicit evaluation of methodological choices that may affect the data signal presented to the assessment [9].
A practical challenge inherent in all length-based assessment methods is obtaining an adequate and representative sample of length measurements. To achieve sample sizes that facilitate stable estimation, length-frequency data are often grouped over multiple years or seasons [8]. In practice, length-frequency sampling may be infrequent or inconsistent, prompting researchers to pool observations over time to enhance sample size, reduce sampling variability, and conform more closely to equilibrium assumptions [10]. As a result, a common but not always explicitly evaluated methodological choice in length-based assessments is how length observations are grouped temporally prior to model fitting. However, this form of temporal grouping relies on explicit assumptions of stationarity inherent to length-based assessment models. Specifically, LBSPR assumes that fishing selectivity, fishing mortality relative to natural mortality, and the underlying sampling process remain approximately constant within the period over which length-frequency data are pooled [11]. Under these assumptions, the observed length distribution is interpreted as reflecting a stable equilibrium between growth, mortality, and selectivity. The choice of temporal grouping scheme can therefore directly influence conclusions about reproductive potential, fishing pressure, and perceived stock status [4,6,7], because length-based estimation is driven by size composition and the truncation of the upper tail of the length distribution. In dynamic fisheries where fishing pressure fluctuates, gear technology evolves (e.g., the widespread deployment of drifting fish aggregating devices, FADs), and environmental conditions vary, these stationarity assumptions are likely to be violated [12,13]. Consequently, the selection of a temporal grouping scheme should be viewed not merely as a statistical convenience but as a structural modelling decision with the potential to alter assessment outcomes.
The length-based spawning potential ratio (LBSPR) method provides a useful framework for evaluating the implications of temporal grouping because it estimates the spawning potential ratio (SPR), relative fishing mortality (e.g., F/M), and selectivity parameters directly from length-frequency data using life history priors [14,15]. SPR is a widely used biological reference metric and can be interpreted as the reproductive capacity relative to an unfished state, which makes it particularly relevant for data-limited management contexts [14,16]. Assessments of the performance of various data-limited models prove that length-based approaches, such as LBSPR frameworks, can yield valuable insights while remaining susceptible to the nuances of data characteristics and underlying assumptions. This underscores the importance of examining how outcomes fluctuate with reasonable data-grouping decisions [5,8]. Although the way we group data over time can significantly influence how we infer stock status, this methodological detail is often overlooked. Most assessments rely on just one, sometimes arbitrary, method for combining years of data, such as lumping all years together or dividing them into decades, without considering other possible approaches. This lack of exploration is evident in recent studies (e.g., [17,18,19]), which typically present results based on a single grouping scheme rather than testing different options. This approach leaves an important question unanswered: does the temporal grouping of length data mask or reveal trends in exploitation, and to what extent can it alter management advice? Coarse grouping may smooth over periods of overfishing or recovery, presenting a misleading picture of “average” stock status, while finer grouping may better capture dynamics but with increased estimate uncertainty [9,11].
To address this critical methodological gap, we analyse the effects of various temporal grouping schemes utilising a dataset from a tropical purse seine fishery in the Northeast Atlantic, characterised by the prevalent use of drifting fish aggregating devices (FADs) that yield juvenile-dominated and size-truncated catches [12,13,20]. Frigate tuna (Auxis thazard) serves as the illustrative case species due to the availability of long-term Task II length frequency observations for this fishery within the International Commission on the Conservation of Atlantic Tunas (ICCAT) spatial reporting framework (ICCAT, Task II database). Hence, for our study, we focused on assessing how different temporal grouping schemes that are relevant to practice influence LBSPR-derived prediction when applied to the same length dataset. We also identified the extent to which apparent stock status signals may be concealed or revealed as temporal resolution changes. The evaluation of biological parameter uncertainty serves as a secondary factor to contextualise sensitivity; however, the primary emphasis is on the modifications in inference resulting from the temporal grouping decision itself.
2. Materials and Methods
2.1. Study Area
The study region (Figure 1) covered the Northeast Atlantic Ocean, aligned with the 1° × 1° International Commission on the Conservation of Atlantic Tunas (ICCAT) stock assessment grid, extending from latitude 0.5° N to 25.5° N and longitude −40.5° W to −0.5° W. This area encompasses significant tropical and subtropical ecosystems, including the Canary Current Large Marine Ecosystem and prominent tuna aggregation sites along the West African coastline. This area is highly productive for purse seine fisheries focused on small tunas, especially Frigate tuna [21], and has experienced a substantial increase in purse seine fishing on floating objects over the past two decades [13,22]. These conditions help small pelagic and epipelagic species grow in large numbers and make it easier for tuna to gather in large groups along the continental shelf and in nearby offshore waters. Coastal upwelling, mesoscale circulation, and frontal systems all work together to create patterns of tuna distribution that are different in different parts of the region but stay the same over time [20].
2.2. Length Data Source
The length-frequency data for this study come from the International Commission for the Conservation of Atlantic Tunas (ICCAT) Task II database, the main public repository for fishery-dependent data submitted by member countries for stock assessment purposes [4,23]. The dataset, called “t2sz-SMTuna1950-2023” (available at https://www.iccat.int/en/accesingdb.html, accessed on 5 August 2025), includes records of size compositions from different fleets and gears. These data represent length measurements taken from fish caught by commercial boats, not from a standardised scientific survey. Consequently, sample size per year and area is proportional to fishing effort and catch, as illustrated in Figure 2. The data were validated during the 2022 ICCAT meeting for skipjack tuna and subsequent standing committee sessions [24]. There are various fleets in the Atlantic Ocean exploiting small tuna, and ref. [4] have demonstrated the selectivity patterns of various fleets targeting small tuna in the Atlantic Ocean and proffered recommendations for management based on purse seine fisheries. Furthermore, among the primary fleets and gears engaged in the catch of our illustrative case species, it was suggested that purse seine fisheries can offer improved spatial coverage for this Northeast Atlantic Ocean small scombrid stock [25]. The illustration of the size distribution of Northeast Atlantic Ocean Frigate tuna from 1994 to 2023, derived from principal fleets (i.e., higher catches and major time series) in the ICCAT statistics database, consists of purse seines, bait boats, trolls, and gill nets, with only one or two specimens as potential outliers, exceeding 80 cm in length as documented in Supplementary Figure S1. This indicates that the stock’s predominant maximum length (L_max_) is between 70 and 80 cm, aligning with data from FishBase [26]. The largest sizes of Frigate tuna were observed in trap fisheries (Supplementary Figure S1), which exclusively target mature specimens. This practice may render them unsuitable, as LBSPR could yield skewed results when analysing data from fleets that focus on the adult segment of the population [4]. In addition, when compared to major fleets, the time series data from trap fisheries is inadequate. Consequently, the size ranges of the Northeast Atlantic Ocean Frigate tuna across all primary fisheries are analogous. According to [25], the purse seine fisheries exhibit a more desirable length-frequency distribution for the Northeast Atlantic Ocean Frigate tuna (Supplementary Figure S1), with a L_ma_x (FAD: 71 cm; FSC: 80 cm fork length (FL)). The length at first maturity (Lmat50), defined as the length at which 50% of individuals mature, as reported in this study corresponds approximately to the modal length of the length-frequency distribution (Supplementary Figure S1), which asymmetrically distinguishes the sizes of juveniles and adults. Finally, we only used length data from ICCAT’s Northeast Atlantic small tuna stock, using a 1 cm bin size from FAD-associated purse seine sets to make sure the model inputs were correct. Because FSC sets were infrequent and missing in some years, constructing evenly distributed temporal blocks for FSC length compositions was not feasible; therefore, we restricted the temporal-grouping evaluation to the more complete FAD-associated purse seine length data.
2.3. Life History Parameters of Case Study
Models that rely on length are particularly sensitive to the values of the input life history parameters [14,27]. The life history parameters required as inputs to this study’s length-based models include growth metrics (asymptotic length [L∞], von Bertalanffy growth coefficient [K]), natural mortality (M), and reproductive factors (length at maturity [Lmat50, Lmat95]). Despite on-the-ground research, ref. [23] provides an updated list of life history parameters for the small tuna species in the Atlantic Ocean. However, the parameters are unsatisfactory for assessing the Northeast Atlantic Ocean Frigate tuna stock, as they were originally established by [28] nearly 40 years ago under environmental and fishery conditions that are likely to differ from those prevailing today. To derive life history priors, this study utilised length frequency-based methods considered suitable for estimating stock status in data-limited fisheries (e.g., [6]). We use the FishLife R package (version 4.4.3) [29]. This package can reliably estimate life history parameters (in our case, L∞, K, M, t_max_, t_m_, and L_mat_), and applies a multivariate hierarchical Bayesian model to infer missing parameters from correlated life history traits across marine fish taxa. The package draws on the FishBase and SeaLifeBase databases and provides posterior means and credible intervals for parameters such as asymptotic length (L∞), growth coefficient (K), natural mortality (M), and maximum age (t_max_). Definitions of life history parameters and notation used in the LBSPR analysis are provided in Table 1. For this study, species-specific priors for our illustrative case species were estimated (Supplementary Figure S3) and used as inputs for establishing a reference point. Additionally, under the assumption of knife-edge maturity, ref. [15] utilised fish length at maturity to differentiate between juvenile and adult size classes. Recent studies (e.g., [18,19,30]) have extensively used this relationship, which can be expressed as Lmat50 = 1.1 × Lmat95. This method was used to estimate Lmat95 for the case study, using the length at first maturity (Lmat50) as input.
2.4. Temporal Data Grouping Schemes (Scenarios)
Time-based grouping of fisheries data can often be used for minimising sparse sampling and to approximate equilibrium conditions, especially in data-limited assessment [10]. Nonetheless, temporal grouping can alter the size composition provided to length-based models, which are known for their sensitivity to size structure and selectivity patterns [4,6,7]. In practice, researchers may group data over long periods to maximise sample size or over shorter periods to capture temporal variability in exploitation and selectivity. However, the potential consequences of these alternative grouping decisions for length-based stock status inference are rarely evaluated explicitly. To evaluate the impact of data grouping on the illustrative case status inference, we refer to “Temporal grouping” as the process of pooling length-frequency observation over specific time intervals prior to model fitting, without altering the underlying observations or applying additional data transformation. In this study, each alternative temporal configuration is referred to as a “grouping scheme” (or “analytical scenario”), representing a specific implementation of temporal grouping applied to the same underlying data. The data were arranged into four grouping schemes on the same 30-year length-frequency dataset (1994–2023), reflecting realistic options that an analyst could reasonably apply in practical stock assessment. The following are the grouping schemes used in this present study:
S1: Full pooled period (1994–2023): This scheme takes all available length data and combines them into one composite distribution. This kind of pooling is often used when data are sparse or when the objective is to estimate an average long-term stock status under assumed equilibrium conditions.
S2: Broad temporal regime blocks (1994–2008; 2009–2023): This scheme represents a coarse subdivision of the time series into two multi-year periods, which may be motivated by perceived changes in fishing practices, fleet behaviour, or data coverage. Similar regime-based groupings are frequently adopted in applied assessments when researchers seek to compare “early” and “recent” conditions while retaining sufficient sample size.
S3: Decadal groupings (1994–2003; 2004–2013; 2014–2023): Decadal grouping is an intermediate level between sample size and temporal resolution. Such groupings are often used in retrospective or exploratory analyses to examine gradual changes in stock status over time without assuming annual equilibrium.
S4: Five-year groupings (1994–1998; 1999–2003; 2004–2008; 2009–2013; 2014–2018; 2019–2023): Five-year blocks provide finer temporal resolution while maintaining adequate sample sizes for length-based modelling. This scheme allows the detection of shorter-term changes in exploitation patterns and size structure that may be obscured under broader temporal pooling.
Meanwhile, it is worth noting that we made changes only on the temporal grouping of observations; length measurements themselves were not re-binned or modified. In other words, for each scenario, length data within a group were pooled as a single composite sample. All other model inputs were held constant across scenarios to isolate the effect of temporal grouping.
2.5. Model Application, Inputs, and Assumptions
Following the estimation of species-specific life history parameters (Section 2.3), we applied the LBSPR model (ShinyApp at http://barefootecologist.com.au/lbspr.html (accessed on 20 October 2025)) to evaluate the illustrative case inferred status under alternative temporal grouping schemes. LBSPR is a length-based assessment approach that estimates SPR, (F/M), and selectivity parameters from length-frequency data of exploited populations [15,31]. A summary of the principal model inputs, assumptions, and outputs is provided in Table 2. We fitted the model independently to the length composition associated with each temporal group across the four grouping schemes. To examine the effect of temporal grouping on the inferred status of the illustrative case, this study used the identical life history parameters and prior distributions derived as described in Section 2.3 across all model runs.
Model inputs included length-frequency data, asymptotic length (L∞), the ratio of natural mortality to growth (M/K), maturity parameters (Lmat50 and Lmat95), and assumptions regarding variability in length at age (Table 2). Selectivity was modelled using a logistic ogive parameterised by the lengths at 50% and 95% retention (SL50 and SL95), which were estimated jointly with relative fishing mortality (F/M) using maximum likelihood methods [14,31].
Study Assumptions
To isolate the impact of temporal data grouping as the principal experimental variable, the subsequent essential model components were maintained constant across all four grouping scenarios (S1–S4):
- Life history parameters (L∞, M/K, Lmat50, Lmat95): These were obtained from the FishLife package and were based on empirical relationships (see Section 2.3). They were used as fixed input priors. Keeping the same biological parameters across scenarios makes sure that any changes in the estimated inferred status of illustrative case (SPR, F/M) are due to differences in the input length composition data structure, not changes in the biological assumptions.
- For all model runs, a logistic selectivity ogive was assumed for the selectivity model structure. This standardisation means that the differences in the estimated selectivity parameters (SL50, SL95) are due to the data given to the model, not because the shape of the assumed selectivity curve has changed.
- Equilibrium condition within temporal groupings: The LBSPR model assumes that the illustrative case species is in a state of equilibrium within each period analysed. We applied this assumption uniformly to every temporal group, from the 30-year pooled period (S1) down to the 5-year blocks (S4).
This approach allows us to test how the scale of temporal grouping, and thus the potential violation of the equilibrium assumption when pooling over dynamic periods, influences assessment outcomes, which is the central question of this study. The controlled scenario-based approach is methodologically similar to simulation testing frameworks used to evaluate assessment performance [5,8]. It allows for a clear diagnosis of how a single, common preprocessing decision temporal grouping can alter inference within an otherwise consistent assessment workflow.
We used conservative benchmark values of SPR = 0.30 and F/M = 1.0 to determine the stock status. These values are often used in fisheries with limited data to show possible reproductive risk [32]. A study suggested that SPR targets of about 40% could be used as management goals [33], but this study mainly uses SPR as a diagnostic metric to see how the inferred status of the illustrative case changes when different temporal grouping schemes are used.
2.6. Sensitivity Studies
To contextualise the influence of biological uncertainty relative to temporal grouping effects, sensitivity analyses were conducted by varying key life history parameters within plausible bounds. These analyses were not intended to exhaustively characterise parameter uncertainty but rather to assess whether the direction of stock status inference across grouping schemes was robust to reasonable misspecification of growth and mortality parameters. Therefore, we performed sensitivity analysis exclusively under scenario S4 (five-year time blocks), which provides the highest temporal resolution among the grouping schemes. This scenario retains the strongest temporal signal in terms of fishing pressure, selectivity, and spawning potential, and therefore represents the most informative and conservative framework for evaluating estimates of uncertainty in life history parameters. For growth-related parameters (L∞ and M/K), which are usually less well constrained, we used ±20%. For length at maturity (Lmat50), which is usually less uncertain because of physiological and reproductive constraints [31], we used ±10%. This generated a series of scenarios reflecting underestimation (L∞ × 0.8, M/K × 0.80, Lmat50 × 0.9) and overestimation (L∞ × 1.2, M/K × 1.2, Lmat50 × 1.1) of each parameter, alongside the reference base case model (B_c). We maintained the main reference points F/M, SPR, SL50, SL95 to measure bias across. A comparison was made between the original estimates of stock status reference points (reference setting or B_c) using the unmodified life history priors and the modified reference points resulting from the input prior misspecification. The percentage in the stock status was calculated using the following relationship in Equation (1):
where P is the value of the reference points when the life history priors are not misspecified and Q is the value obtained when the priors are misspecified by ±20% and ±10. The uncertainty in the life history parameters increases with the percentage change in value being farther from zero [19].
2.7. Comparative Analysis of Stock Status Inference
We focused our primary analysis on comparing the inferred status of the illustrative case diagnosis across grouping scenarios. For each scenario, we calculated the percentage of temporal blocks where SPR fell below 0.30 and F/M exceeded 1.0 using baseline LBSPR estimates obtained [29] with unmodified life history priors (B_c) in Table 3. Agreement and disagreement in the overall inferred status of the illustrative case (classified as “sustainable” or “unsustainable” based on these thresholds) among scenarios were then assessed to highlight the consequences of the grouping choice.
3. Results
3.1. Life History Parameters Used for LBSPR Estimation
Life history parameters informing the LBSPR analysis are summarised in Table 4 and Supplementary Figure S3. Parameter estimates indicate a fast-growing, short-lived stock, characterised by a relatively high natural mortality-to-growth ratio (M/K > 1) and early maturation relative to maximum observed size. These parameters were held constant across all temporal aggregation scenarios to ensure that differences in stock status inference arose solely from data structuring rather than from variation in biological inputs.
3.2. Effects of Temporal Aggregation on LBSPR Reference Points
LBSPR-derived reference points varied with temporal grouping (Table 5). Fully pooled data (S1) yielded SPR and F/M estimates near standard biological thresholds. Introducing temporal structures (S2–S4) increased variability, with SPR often below the precautionary threshold (≈0.30) and F/M frequently above one. Finer grouping (S4) produced SPR values ranging from 0.15 to 0.39 and F/M from 1 to over 9, while broader schemes lessened variability and exploitation signals. Selectivity parameters also shifted: earlier, temporally resolved periods showed longer SL50 and SL95; recent periods had shorter lengths and narrower spreads, with selectivity spread (ΔL_c_) decreasing over time in S3 and S4 (Table 5). Detailed LBSPR model-derived reference points for individual years and model fits to annual length-frequency data for the illustrative case species are documented in Supplementary Table S1 and Figure S2, respectively.
3.3. Stability and Agreement of Inferred Status of Illustrative Case Classification Across Grouping Schemes
The inferred status of the illustrative case differed depending on the temporal grouping scheme applied. S1 classified the stock as sustainable based on a single grouped estimate. In contrast, all scenarios incorporating the temporal structure classified the stock as unsustainable, with the proportion of periods exceeding the biological thresholds increasing with temporal resolution. In S4, 83% of periods exhibited SPR values below 0.30 and F/M values above 1. These results are summarised in Table 6.
3.4. Temporal Patterns in LBSPR Output Indicators
Figure 3 summarises the temporal patterns in LBSPR reference points across the grouping scheme. For SPR and F/M, inverse trajectories were displayed, with periods of elevated fishing pressure corresponding to reduced reproductive potential. Under S2, S3, and S4, prolonged intervals of low SPR and high F/M were evident prior to the most recent period, whereas these signals were absent under S1. Selectivity parameters exhibited concurrent temporal shifts. Earlier periods were characterised by higher SL50 and SL95 values and broader selectivity spreads (ΔL_c_), while more recent periods showed reduced selectivity lengths and narrower ΔL_c_ values. These patterns were most clearly resolved under the five-year grouping scheme and were progressively muted under broader grouping. A general overview of the length-based selectivity ogives for Frigate tuna purse seine fisheries (1994–2023) estimated by the LBSPR model is presented in Supplementary Figure S4.
3.5. Sensitivity of LBSPR Estimates to Life History Uncertainty
Sensitivity analyses performed in S4 demonstrated that LBSPR-derived reference points consistently reacted to changes in life history parameters (Figure 4 and Supplementary Table S2). Major changes in SPR and F/M happened when the L∞ was misspecified, and the second most changes were documented when the M/K ratio was wrongly recorded. In contrast, changes in length at 50% maturity (Lmat50) caused only small differences from baseline estimates. In all sensitivity scenarios, selectivity parameters (SL50 and SL95) showed little response to life history misspecification, and the relative temporal patterns in SPR and F/M remained the same.
4. Discussion
4.1. Temporal Grouping Alters Length-Based Inference
The primary finding of this study is that the temporal grouping of length-frequency data, a routine practice in data-limited stock assessments, can substantially alter the inference of stock status within a single length-based model. When length data were fully pooled (highly grouped dataset (S1)) across the entire period under consideration, LBSPR produced an apparently benign assessment for the illustrative case species. On the other hand, progressively finer temporal groupings (S2–S4) revealed historical or extended periods of reproductive depletion followed by partial and recent recovery. This divergence is not merely a matter of precision with increased resolution but a fundamental change in the inference arising from how data are grouped or structured before being subjected to analysis. Importantly, the “period” effects observed here may reflect the aggregation of multiple temporally structured processes that are embedded within the ICCAT Task II length data, including seasonal variation in fishing activity, fish availability, and recruitment pulses that are characteristic of tropical tuna purse seine fisheries. When length composition is averaged across periods that encompass contrasting temporal conditions or exploitation intensities, these structured signals can be obscured. By averaging length composition across periods characterised by contrasting exploitation histories, coarse pooling can hide transient dynamics and effectively shift the reference baseline against which the status is interpreted. This effect is conceptually analogous to the “shifting baseline” phenomenon described by [35] and [36] whereby long-term averaging masks historical depletion. Importantly, this masking occurs even when the same data, biological/life history priors, and the assessment model are used, underscoring that grouping is a structural decision rather than a neutral preprocessing step.
4.2. Mechanisms Underlying Grouping Effects
There are several mechanisms that explain why temporal grouping influences LBSPR inference and subsequently the illustrative case species. In the first instance, LBSPR assumes that length distributions within an assessment unit reflect a broadly stable selectivity pattern and sampling process over the analysis window [14,15]. When data are pooled across periods spanning changes in fishing operation, fishing grounds, or gear deployment, the resulting length distribution becomes a mixture of selectivity regimes. This combination of mixtures can obscure the estimated selectivity ogive and reduced signals of size truncation that drive SPR estimation [5]. This mechanism is especially important for tropical tuna purse seine fisheries, where the growth and improvement of drifting fish aggregating devices (FADs) have changed how fish are caught and the size of the catch over time [13,37]. Pooling across these phases may therefore obscure shifts in exploitation patterns that are biologically meaningful but temporally structured.
A second related mechanism arises from demographic non-stationarity. In short-lived species with rapid population turnover, interannual variability in recruitment and cohort strength can drive pronounced and directional shifts in length structure that reflect genuine changes in population dynamics rather than random sampling noise. When data from different periods are grouped, these dynamics are treated as random variation around an assumed equilibrium state. Research using simulations and statistical estimates has shown that changes in equilibrium and recruitment can influence the accuracy of length-based assessment methods, such as LBSPR [8]. Further, when length data from such periods are temporally grouped, these cohort-driven signals are effectively averaged out and treated as equilibrium variation, despite the population being in transition. The implications of such averaging for length-based reference points have been discussed by [7], who showed that departures from equilibrium and interacting recruitment variability can bias length-based inference when equilibrium assumptions are violated [7]. In reality, such grouping can create confusion, as equilibrium assumptions may not hold when applied to data from non-equilibrium situations, potentially leading to the estimation of inaccurate reference points, including SPR [11].
4.3. Implications for Length-Based Assessment Practice
SPR is widely used in data-limited fisheries because it links exploitation to reproductive capacity through the per-recruit theory and can be interpreted without full age-structured assessments [16]. The LBSPR framework operationalises this concept by estimating SPR using length compositions based on assumptions about growth, natural mortality, maturation, and selectivity [14]. This approach could possibly allow fisheries managers to make informed decisions regarding sustainable harvesting levels while accounting for the biological characteristics of the stock. By integrating these factors, the LBSPR framework enhances the ability to assess the health of fish populations even with limited data (e.g., [38]). Because SPR inference in LBSPR is driven by size truncation relative to life history, input under equilibrium approximation [14,27], the choice of temporal grouping scheme directly influences how result should be interpreted [11]. Pooling across the full time series addresses long-term average status under an implicit stationarity assumption [14], whereas finer temporal grouping allows evaluation of whether reproductive signals are consistent or vary through time [15]. Recruitment and fishing mortality are often time-varying in real fisheries and may violate equilibrium assumptions [9]. In the present study, these alternative grouping choices produced meaningfully different SPR inferences from the same dataset within a single assessment framework. This diagnostic approach can help avoid over-interpreting pooled results when non-stationarity is plausible [5,11], and this approach not only reduces the risk of overly optimistic inferences about stock status from coarse temporal pooling but also supports precautionary interpretation in data-limited fisheries, which is a key principle in biodiversity conservation and sustainable resource use, particularly in data-poor fisheries where assessment uncertainty is high and management decisions often rely on limited information. According to [1], such a conservation of biodiversity resources, particularly small pelagic commercial fish stocks, can also be achieved through the reduction in fishing pressure for resource sustainability.
From a practical standpoint, reliance on a single, arbitrarily chosen grouping scheme risks making stock status conclusions based on inference contingent on that choice rather than on underlying stock dynamics. For species that have short lifespans, fast-turnover species, aggregation blocks exceeding several years may be insufficient to detect changes in exploitation pressure. It is important to confirm key results such as SPR and F/M using different methods of grouping the data over time. This should be a routine part of the assessment process, much like testing how sensitive results are to changes in biological assumptions [39,40]. This approach follows ICES recommendations, which emphasise the value of transparency, careful diagnostics, and a cautious approach when interpreting results in situations where data are limited [41,42].
4.4. Aggregation Bias Within the Broader Uncertainty Framework and Case Study Perspective
Uncertainty in fishery stock assessments is commonly classified into process, observation, and model structural uncertainty [43,44,45]. Temporal grouping bias is a type of model structural uncertainty, and more specifically, it relates to how data are organised. In our sensitivity analysis, we used the most detailed time grouping on purpose. The results show that uncertainty in biological parameters can change the value of LBSPR outputs but does not outweigh the strong effect that the way data are grouped over time has on conclusions about stock status. This finding aligns with the increasing acknowledgement that “meta-decisions” in assessment, such as data filtering, standardisation, and aggregation, can have effects that are equal to or exceed those of parameter uncertainty [8,45]. The observed reduction in selectivity spread (ΔL_c_) with a finer grouping scheme (S4) further demonstrates that pooling may conceal trends in fishery operations, including alterations in size selectivity, which are critical for understanding exploitation dynamics [12].
Using Frigate tuna in the Northeast Atlantic as an illustrative case strengthens the methodological argument rather than limiting its generality. The fishery’s evolution, characterised by increasing FAD use and fishing efficiency, creates temporal variability that coarse grouping is ill-suited to capture [13]. While the recovery signal in the most recent five-year block is plausible, this case demonstrates how long-term pooling can place disproportionate weight on recent conditions and obscure historical dynamics. This scenario is common in data-limited fisheries, where researchers face a trade-off between sample size and temporal resolution. Our analysis demonstrates that, for this illustrative case, disaggregated analyses, even with higher variance, are more likely to reveal biologically meaningful trends relevant for adaptive management [18]. Importantly, this study isolates temporal grouping as a single source of uncertainty, holding other factors constant to clarify its specific influence on the illustrative case.
4.5. Limitation
This study is designed to figure out the effect of temporal grouping within a single length-based framework, and its limitations follow from standard LBSPR assumptions. In particular, equilibrium is assumed within each temporal grouping period or block. If exploitation intensity, fishing effort, or selectivity changes rapidly within a block, even five-year groupings may oversimplify underlying dynamics [14].
From an exploitation view, the LBSPR inference indicates the cumulative impact of fishing pressure throughout the grouping period rather than temporary variations in exploitation. Thereby, temporal grouping can smooth out times when exploitation is high or low, which could hide short-term periods of overfishing or recovery when broad groupings are used. Additionally, compositional sampling design, non-independence, and changes in effective sample size from year to year can influence length-based inference, which can make stability unstable when using high-resolution groupings [46,47]. Furthermore, fishery-specific factors like moving fishing grounds, changing FAD densities, or improving buoy technology can cause selectivity processes that are not stable and are difficult to show fully in a simple length-based diagnostic model [13,37]. Future research could address these limitations through simulation estimation studies that explicitly treat temporal aggregation as an experimental factor, complementing existing evaluations of data-limited methods [5,8]. Further development of workflows that integrate temporally resolved length-based diagnostics with explicit modelling of compositional overdispersion would also strengthen inference [48,49].
5. Conclusions
The current study demonstrates that stock status inference from length-based assessments can depend materially on how length observations are grouped over time. Coarse temporal pooling can obscure patterns that become evident under finer temporal resolution (S1) even when identical data, models, and biological priors are used. Using Frigate tuna as an illustrative case strengthens this conclusion by showing the effect in a real fishery context where temporal non-stationarity is plausible, rather than only in simulation. The practical implication is that temporal grouping should be treated as a transparent, justified, and routinely tested decision in length-based assessment workflows, particularly in fisheries where selectivity and exploitation dynamics are likely to change over time. This study shifts the focus from a stock-specific assessment to a critical examination of a common assessment practice. By reframing temporal aggregation as a structural decision rather than a neutral preprocessing step, this work shifts attention from stock-specific outcomes to a critical, yet often overlooked, aspect of assessment practice. Based on these findings, we suggest the following diagnostic considerations for length-based assessments: (i) treat temporal grouping as a sensitivity dimension, not a single fixed preprocessing choice, (ii) justify grouping choices based on practical factors, and (iii) report selectivity diagnostics and size range coverage, ensuring transparent reporting and scenario-based evaluation in a data-limited fisheries context.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hong X. Zhang K. Li J. Xu Y. Sun M. Xu S. Chen Z. Stock assessment of the commercial small pelagic fishes in the Beibu Gulf, the South China Sea, 2006–2020 Biology 20241322610.3390/biology 1304022638666839 PMC 11048411 · doi ↗ · pubmed ↗
- 2Hilborn R. Walters C.J. Quantitative Fisheries Stock Assessment: Choice, Dynamics and Uncertainty Springer Science & Business Media Berlin/Heidelberg, Germany 2013 Available online: https://www.academia.edu/download/48361346/bf 0004288320160827-27539-a 5xhv 8.pdf(accessed on 10 October 2024)
- 3Maunder M.N. Punt A.E. A review of integrated analysis in fisheries stock assessment Fish. Res.2013142617410.1016/j.fishres.2012.07.025 · doi ↗
- 4Pons M. Kell L. Rudd M.B. Cope J.M. Lucena-Fredou F. Performance of length-based data-limited methods in a multifleet context: Application to small tunas, mackerels, and bonitos in the Atlantic Ocean ICES J. Mar. Sci.20197696097310.1093/icesjms/fsz 004 · doi ↗
- 5Chong L. Mildenberger T.K. Rudd M.B. Taylor M.H. Cope J.M. Branch T.A. Wolff M. Stäbler M. Performance evaluation of data-limited, length-based stock assessment methods ICES J. Mar. Sci.2020779710810.1093/icesjms/fsz 212 · doi ↗
- 6Froese R. Keep it simple: Three indicators to deal with overfishing: Ethimology of Ghoti George Fish Fish.20045869110.1111/j.1467-2979.2004.00144.x · doi ↗
- 7Cope J.M. Punt A.E. Length-based reference points for data-limited situations: Applications and restrictions Mar. Coast. Fish.2009116918610.1577/C 08-025.1 · doi ↗
- 8Pons M. Cope J.M. Kell L.T. Comparing performance of catch-based and length-based stock assessment methods in data-limited fisheries Can. J. Fish. Aquat. Sci.2020771026103710.1139/cjfas-2019-0276 · doi ↗
