Formation-Constrained Cooperative Localization for UAV Swarms in GNSS-Denied Environments

Qin Li; Peng Wang; Xiaochun Li; Jieyong Zhang; Ying Luo; Wangsheng Yu; Haiyan Cheng

PMC · DOI:10.3390/s26061984·March 22, 2026

Formation-Constrained Cooperative Localization for UAV Swarms in GNSS-Denied Environments

Qin Li, Peng Wang, Xiaochun Li, Jieyong Zhang, Ying Luo, Wangsheng Yu, Haiyan Cheng

PDF

Open Access

TL;DR

This paper introduces a new method for improving drone swarm localization in areas without GPS by using formation geometry.

Contribution

A formation-constrained cooperative localization method is proposed for UAV swarms in GNSS-denied environments.

Findings

01

The proposed method improves localization success rate, reliability, and stability in simulations.

02

It adapts well to asymmetric formations, making it suitable for practical applications.

03

Formation constraints are integrated into localization algorithms to enhance accuracy.

Abstract

Cooperative localization is critical for UAV swarm operations in GNSS-denied environments. The backbone-listener scheme, using a small subset of agents as active backbone nodes and others as passive listeners, offers notable advantages in reducing communication overhead and enhancing swarm scalability. Building on this scheme, we propose a formation-constrained cooperative localization method to improve accuracy by integrating known formation geometry into the localization process. First, backbone node selection uses a formation-constrained greedy node activation (GNA) strategy with weighted distance fusion, combining measured and ideal formation distances to enable near-optimal selection aligned with formation structure. Second, listener node localization incorporates formation constraints into Chan’s algorithm, paired with angle-of-arrival (AOA) refinement, to ensure estimated…

Figures7

Click any figure to enlarge with its caption.

Funding2

—Natural Science Basic Research Program of Shaanxi Province
—China Postdoctoral Science Foundation

Keywords

cooperative localizationformation constraintsUAV swarmGNSS-denied environmentsbackbone-listener scheme

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndoor and Outdoor Localization Technologies · UAV Applications and Optimization · Distributed Control Multi-Agent Systems

Full text

1. Introduction

With the continuous development of swarm control technologies and the advancement of fields such as low-altitude economy and intelligent infrastructure monitoring, multi-UAV swarms have become a key platform for transforming aerial operations [1,2]. Their application value is increasingly prominent in scenarios such as full-domain reconnaissance, disaster rescue, and collaborative power grid inspection [3]. Accurate positioning and navigation capabilities are the key prerequisite for efficient collaborative operations of UAV swarms [4]. Currently, mainstream solutions mainly rely on the Global Navigation Satellite System (GNSS) to obtain position information. However, in complex environments like urban canyons, mountain gullies, and electromagnetically jammed battlefields, GNSS signals tend to lose lock. This is caused by blockage, multipath reflection, or intentional jamming, thus leading to problems such as swarm positioning drift, interrupted collaborative tasks, and even UAV collisions [5,6]. Cooperative localization has emerged as a promising solution for GNSS-denied scenarios [7]. Ultra-Wideband (UWB) technology offers high-precision ranging capability (centimeter-level positioning accuracy) due to its ultra-wide operating bandwidth. It also has strong resistance to multipath interference and electromagnetic interference, as well as excellent adaptability to non-line-of-sight (NLOS) environments. Thus, it has become the preferred technology for UAV swarm positioning in GNSS-denied environments [8,9,10,11].

Cooperative localization for multi-UAV systems has received significant attention. Existing localization algorithms can be classified into three categories: anchor node-based, multidimensional scaling (MDS)-based, and distributed cooperative methods.

Anchor node-based positioning algorithms achieve target position estimation by deploying fixed, known-position anchor nodes to establish a local positioning network. Their core mechanism relies on obtaining signal parameters between unknown nodes (target) and anchor nodes through radio frequency (RF) communications—including Time of Arrival (TOA), Time Difference of Arrival (TDOA), and Angle of Arrival (AOA)—to estimate positions via triangulation or hyperbolic methods. This type of algorithm has several typical variants for different application scenarios: TOA/TDOA/AOA localization algorithms use a single signal metric, with simple principles but poor resistance to multipath noise interference, thus requiring high-density anchor nodes to guarantee accuracy [4,12,13,14,15]. The Chan algorithm improves TDOA localization accuracy via two-step least squares estimation, with high computational efficiency [16]. Hybrid TDOA/AOA localization algorithms combine time difference and angle information to reduce the localization ambiguity of single-metric algorithms, thus improving stability under non-line-of-sight (NLOS) conditions [17]. A common limitation of such algorithms is the high cost of anchor node deployment. Moreover, fixed anchor nodes cannot dynamically follow mobile multi-agents, making them difficult to adapt to dynamically changing network topologies and limiting their applicability in scenarios such as UAV swarms.

To reduce dependence on anchor nodes, MDS-based positioning algorithms are developed. They do not require fixed anchor nodes, construct a similarity matrix based on inter-node relative measurements (e.g., distance or angle), and recover relative positions via MDS, serving as a core solution for infrastructure-free localization. Classical MDS (C-MDS) constitutes the foundation of such algorithms and achieves position reconstruction with high accuracy via distance measurements in fully connected networks [18]. Nyström approximation-based MDS reduces complexity via node sampling, balancing accuracy and efficiency [19]. Hierarchical MDS employs network clustering to enable distributed computation, thereby reducing centralized bottlenecks [20]. Distributed noise-robust MDS adds noise suppression to reduce error propagation [21]. Three-dimensional distributed MDS extends to 3D space for UAV networks [22]. Complex MDS optimizes nonlinear model solution efficiency for wide-band and mobile scenarios [23]. Continuous optimization has addressed issues of complexity, robustness, and dimension adaptation, making it the preferred solution for infrastructure-free scenarios.

While MDS-based algorithms are the preferred choice for infrastructure-free localization via continuous optimization, their applicability in large-scale UAV swarms is limited by the inherent need for full-mesh communication. This causes excessive overhead, conflicting with the real-time requirements of swarm cooperative localization and spurring the development of distributed cooperative localization algorithms.

Distributed cooperative localization algorithms are specifically designed for distributed multi-agent networks, achieving global relative positioning through local inter-node communication and cooperation without a centralized processing unit [24]. Guo et al. proposed a UWB-IMU fusion method for multi-UAV positioning, reducing anchor node dependence [25]. Lv et al. proposed a UWB-based distributed Kalman filtering method for cooperative localization, enabling high-precision positioning in partially GNSS-denied environments [26]. Additionally, the backbone-listener localization algorithm proposed in recent studies selects backbone nodes via the GNA strategy, enables passive localization of listener nodes, and optimizes accuracy through back calibration. This distributed architecture achieves decimeter-level accuracy and supports large-scale network expansion [27].

Distributed cooperative localization algorithms have addressed the limitations of anchor node-based and MDS-based methods, with promising performance in UAV swarms. However, for complex swarm operations, conventional methods fail to sufficiently exploit formation geometry information as complementary prior knowledge, limiting accuracy and robustness in dynamic conditions. Notably, in practical UAV swarm operations, the formation geometry, as a basic constraint for mission execution, contains valuable spatial correlation information that has not been sufficiently integrated into localization optimization, while extensive literature has focused on formation control [28,29], few existing studies utilize relevant formation information in localization. Specifically, Schindler et al. developed a relatively infrastructure-free localization algorithm for swarm formations [30], yet their work lacks the explicit incorporation of formation constraints as a priori knowledge. This reflects a broader research gap wherein formation control and localization have traditionally been investigated as isolated research domains, with insufficient attention paid to the cross-integration of formation-derived information into localization processes.

This gap motivates the need for a systematic approach that explicitly incorporates formation constraints as a priori knowledge in the localization optimization process. This work aims to add to existing research by directly integrating a priori formation geometric constraints into cooperative localization optimization, thereby seeking to develop a unified optimization framework that may jointly improve localization accuracy and robustness. In practical UAV swarm operations, maintaining predetermined formation patterns (e.g., linear, circular, or grid formation) is a basic requirement for mission execution; when appropriately used as an a priori constraint, such formation-related geometric information may provide supplementary spatial correlation indicators to help reduce localization ambiguity. The primary contributions of this work are outlined as follows:

(1) A formation-constrained greedy node activation (GNA) strategy is proposed, which integrates formation geometry information through weighted distance fusion to improve backbone node selection and may enable near-optimal selection that aligns with the formation structure.

(2) A formation-constrained listener localization method is proposed, which integrates formation constraints into Chan’s algorithm with angle-of-arrival (AOA) refinement, helping ensure estimated positions conform to expected inter-agent distances.

(3) A global formation constraint optimization stage is proposed, which uses gradient descent-based refinement to strengthen formation constraints across all agent positions after initial localization.

The remainder of this paper is organized as follows: Section 2 formulates the problem. Section 3 presents the cooperative localization algorithm with formation constraints. Section 4 analyzes theoretical aspects of the proposed method. Section 5 presents numerical experiments. Section 6 concludes the paper.

2. Problem Formulation

This section introduces the network setting with a backbone-listener scheme, explains the measurement model, and formulates the cooperative localization problem with formation constraints. Table 1 summarizes the mathematical notation used throughout this section and the remainder of the paper.

2.1. Network Setting

We consider a two-dimensional (The present analysis is confined to the planar case for clarity. Extension to 3D requires augmenting the state and AOA models; the main algorithmic structure remains applicable, with limitations discussed in the conclusion) network that consists of N agents, whose set is denoted as $[eqn]$ . For agent $[eqn]$ , the global position is denoted as $[eqn]$ , and the set of its neighbor agents is denoted as $[eqn]$ , with $[eqn]$ elements. In the local coordinate system of agent i, the relative position of any other agent $[eqn]$ is $[eqn]$ . The relative position parameter of the formation around agent i is then denoted as $[eqn]$ .

Additionally, the global orientation parameter vector is denoted as $[eqn]$ , where $[eqn]$ is the orientation of agent i. The global position parameter vector is defined as $[eqn]$ , and the estimations of $[eqn]$ and $[eqn]$ are denoted as $[eqn]$ and $[eqn]$ , respectively.

The network adopts a backbone-listener scheme [27]. Backbone nodes actively transmit and receive wireless signals to establish mutual constraints; listener nodes passively receive signals for localization, avoiding redundant communication. The set of $[eqn]$ backbone agents is denoted as $[eqn]$ , with $[eqn]$ , and the set of $[eqn]$ listener agents as $[eqn]$ , with $[eqn]$ . The scheme is illustrated in Figure 1. Roles may be fixed or switched; backbone selection can be preset or determined by a node activation strategy (see Section 3.1).

2.2. Measurement Model

Agent $[eqn]$ transmits a wideband signal to agent i. Agent i receives the signal and measures the Time of Arrival (TOA) and Angle of Arrival (AOA).

TOA Measurement: The measured distance between agent i and agent j is

[eqn]

where c is the speed of signal, and $[eqn]$ is the time measurement noise. The TOA measurement is

[eqn]

For each time slot, the distance between agent i and agent j is:

[eqn]

The unknown parameter θ in the localization problem is

[eqn]

where $[eqn]$ , $[eqn]$ .

AOA Measurement: Note that the orientation $[eqn]$ and the AOA measurement $[eqn]$ are different parameters, as shown in Figure 1. The AOA measurement model differs for backbone and listener agents due to their different roles in the network.

For backbone agents, both sides of AOA measurements are available; thus, the relationship between orientation and AOA is given by

[eqn]

where $[eqn]$ is the AOA measurement at agent i from agent j, and $[eqn]$ is the orientation of agent i.

For listener agents, only one direction of AOA is available. As illustrated in Figure 1, when listener agent $[eqn]$ receives a signal from backbone agent $[eqn]$ , the AOA measurement $[eqn]$ is available. However, the measurement $[eqn]$ is missing. Therefore, the estimation of the orientation and position for listener agents is obtained based on the listener’s position estimation $[eqn]$ , which is obtained by

[eqn]

where $[eqn]$ is the estimated position of listener agent k.

Then, the cooperative localization problem is formulated as

[eqn]

where $[eqn]$ and $[eqn]$ are the predicted range and bearing between agents i and j.

3. Cooperative Localization with Formation Constraints

This section presents the cooperative localization algorithm with formation constraints based on the backbone-listener scheme. We first introduce the formation-constrained greedy node activation strategy for activating backbone nodes. Subsequently, we perform localization of backbone and listener nodes based on the backbone-listener scheme, with a detailed description of the formation-constrained listener localization method.

3.1. Formation-Constrained Greedy Node Activation (GNA)

The selection of backbone agents can be formulated as an optimization problem that aims to minimize the relative localization error of the network. Directly minimizing this error is a non-convex problem, and convex-relaxation-based approximations tend to be computationally intensive for practical swarm systems. Reference [27] adopts a suboptimal yet computationally efficient GNA strategy, which follows two principles: (1) the localization performance degrades when listener agents lie outside the convex hull formed by the transmitting agents and (2) the direction of the received signal affects the major axis of the “information ellipse,”; thus, backbone agents should be placed as dispersively as possible around the listener agents to maximize information gain.

The standard GNA relies only on noisy measured distances, which may not reflect the actual formation structure. To address this limitation, this work proposes a formation-constrained GNA strategy that integrates formation geometry information to select more suitable backbone nodes. To achieve this integration, this work introduces a weighted distance fusion approach that fuses measured distances with ideal inter-agent distances. The core insight is that the measured distance $[eqn]$ (from noisy measurements) may be unreliable, while the ideal inter-agent distance $[eqn]$ (derived from prior formation knowledge) reflects the intrinsic geometric potential. Thus, this work fuses these two distance estimates through a weighted fusion equation,

[eqn]

where $[eqn]$ is the measured distance between agents i and j, and $[eqn]$ is the expected formation distance. The adaptive weight $[eqn]$ is defined as

[eqn]

where $[eqn]$ is a threshold parameter that determines when formation information becomes dominant. When $[eqn]$ is small (weak formation constraints), $[eqn]$ and the algorithm behaves like standard GNA. When $[eqn]$ is large (strong formation constraints), $[eqn]$ and formation geometry plays a more significant role. The fused distance $[eqn]$ provides a more reliable estimate that utilizes both measurement data and formation prior knowledge. When measurement noise causes $[eqn]$ to deviate significantly, the ideal distance $[eqn]$ pulls it back to the true geometric level, preventing the GNA algorithm from being misled by noisy measurements.

The improved potential field function is then computed based on the fused distance

[eqn]

where $[eqn]$ is a scaling parameter.

The formation-constrained GNA algorithm is shown in Algorithm 1. Algorithm 1 Formation-constrained Greedy Node Activation

1:Input: $[eqn]$ , $[eqn]$ ( $[eqn]$ ), $[eqn]$ , $[eqn]$ , $[eqn]$
2:**Output: ** $[eqn]$
3:Compute $[eqn]$ and $[eqn]$ from $[eqn]$
4:Initialize $[eqn]$ , $[eqn]$
5:for each starting agent $[eqn]$ do
6: Compute fused distances from i: $[eqn]$ , $[eqn]$
7: Find $[eqn]$ ; compute $[eqn]$ from j and find $[eqn]$
8: Initialize $[eqn]$ , $[eqn]$
9: while $[eqn]$ do
10: Find $[eqn]$ and add $[eqn]$ to $[eqn]$
11: Set $[eqn]$
12: end while
13: Compute $[eqn]$
14: if $[eqn]$ then
15: Update $[eqn]$ , $[eqn]$
16: end if
17:end for
18:Return $[eqn]$

Remark 1. Formation constraints and GNA are complementary: the combination yields more consistent measurement configurations that match the expected formation geometry; additional constraints from known inter-agent distances help resolve localization ambiguities, especially under sparse or noisy measurements.

3.2. Backbone Relative Localization

The backbone agents are localized relative to each other using the classical multidimensional scaling (C-MDS) method. The position parameter of backbone agents is denoted as $[eqn]$ .

For a selected agent $[eqn]$ , the coordinate system is established by finding the agent j closest to it and setting the direction from i to j as the x-axis. According to (5), this implies

[eqn]

With backbone agents in signal-transmitting mode, the TOA and AOA measurements are collected as

[eqn]

[eqn]

The distance matrix is constructed as $[eqn]$ . The angle between links from $[eqn]$ and $[eqn]$ towards i is

[eqn]

The angle-based correlation matrix $[eqn]$ is constructed as

[eqn]

Performing singular value decomposition (SVD) of $[eqn]$ yields $[eqn]$ , where $[eqn]$ contains the left singular vectors and $[eqn]$ is a diagonal matrix of singular values. The relative positions of other agents with respect to agent i are estimated as

[eqn]

where $[eqn]$ denotes the first two rows and first $[eqn]$ columns of the distance matrix $[eqn]$ .

The algorithm above is derived under the assumption that there is no noise in the measurement; however, it can still perform estimation in the presence of noise.

3.3. Listener Relative Localization with Formation Constraint

Under the backbone-listener scheme (Section 2.1), listener agents rely on backbone nodes for passive localization. As noted in Section 2.2, $[eqn]$ is unavailable for listeners; hence, it cannot be estimated from (5). The listener relative localization algorithm therefore proceeds as follows. First, Chan’s algorithm [16] obtains an initial position estimate based on TOA measurements. Then, the localization result is refined by incorporating AOA information and formation constraints derived from the initial solution. The detailed procedure is described below.

3.3.1. Listener Relative Localization

Let $[eqn]$ denote the set of backbone neighbors of listener agent k, with $[eqn]$ . The initial position $[eqn]$ is estimated as $[eqn]$ using the Chan algorithm [16]. For listener agent k with $[eqn]$ backbone neighbors, Chan’s algorithm constructs a linear system from TOA measurements. The measurement matrix $[eqn]$ and measurement vector $[eqn]$ are derived from the distance measurements between listener k and its backbone neighbors. The initial position estimate is obtained by solving

[eqn]

where $[eqn]$ is the weighted covariance matrix. Solving (17) yields an initial estimated position $[eqn]$ , which is used to calculate the initial orientation $[eqn]$ through the geometric relationship in (6). The weighted covariance matrix is computed as

[eqn]

where $[eqn]$ is a diagonal distance matrix with elements $[eqn]$ (the measured distance from listener k to backbone neighbor $[eqn]$ ), and $[eqn]$ is the noise covariance matrix with $[eqn]$ being the variance of the TOA measurement noise.

Based on the initial estimation $[eqn]$ and estimated orientation $[eqn]$ , AOA information is used to refine the position estimation. According to (6), by linearizing the geometric relationships and applying appropriate approximations, the augmented measurement model becomes

[eqn]

where $[eqn]$ is the augmented measurement matrix combining TOA and AOA measurements, $[eqn]$ is the augmented measurement vector, $[eqn]$ is the number of backbone neighbors of listener agent k, and $[eqn]$ denotes the AOA measurement from listener k to backbone agent j. The covariance matrix $[eqn]$ is updated as

[eqn]

where $[eqn]$ is the variance of the AOA measurement noise from listener k to backbone neighbor j (indexed by $[eqn]$ ). The refined position estimate is then obtained by

[eqn]

The orientation estimation can be updated using $[eqn]$ through the geometric relationship in (6).

3.3.2. Formation-Constrained Listener Localization

When formation constraints are enabled, the constraint terms are incorporated into the optimization. For listener agent k, the formation-constrained refinement extends the augmented system in (19) as:

[eqn]

where $[eqn]$ is the linearized formation constraint matrix for listener agent k, $[eqn]$ is the corresponding constraint vector containing expected formation distances, and $[eqn]$ is a constraint weight parameter for listener localization. The formation-constrained position estimate is then obtained by solving:

[eqn]

where $[eqn]$ is the formation constraint term for agent k.

The formation structure is represented by a distance matrix $[eqn]$ , where $[eqn]$ denotes the expected distance between agent i and agent j in the formation. The formation constraint loss is unified as:

[eqn]

where $[eqn]$ if agents i and j have a formation relationship, and $[eqn]$ otherwise.

3.4. Back Calibration

After listener localization (Section 3.3), backbone positions are refined using measurements from listener agents (Section 2.1), improving overall accuracy [27].

For a listener agent k, let $[eqn]$ denote the estimated positions of its backbone agent neighbors. Let $[eqn]$ and $[eqn]$ be the covariance matrices for agent k and its backbone neighbors, respectively.

Then, a linear back calibration step is written as

[eqn]

where $[eqn]$ stacks the range and AOA measurements between listener k and its backbone neighbors as defined in Section 2.2 (with entries $[eqn]$ , $[eqn]$ , etc.), and $[eqn]$ and $[eqn]$ are parameter matrices with appropriate dimensions to be solved.

A first-order approximation of the measurement model in Section 2.2 around the current estimates yields an equivalent linear model

[eqn]

where $[eqn]$ is the Jacobian matrix, and $[eqn]$ . Here, $[eqn]$ accounts for both measurement noise and the uncertainty of the listener estimate via error propagation

[eqn]

where $[eqn]$ is the measurement noise covariance of $[eqn]$ , and $[eqn]$ is the Jacobian with respect to the listener state $[eqn]$ (Section 2.2).

The goal of back calibration is to minimize the mean squared error (MSE) of backbone agents. The optimization problem can be expressed as

[eqn]

Solving (28) by taking derivatives with respect to $[eqn]$ and $[eqn]$ and setting them to zero, we obtain the closed-form solution

[eqn]

[eqn]

Substituting $[eqn]$ and $[eqn]$ back into (25) yields the calibrated coordinates of the backbone agents.

Next, we update the orientation estimation of backbone agent $[eqn]$ . For such i, let $[eqn]$ denote the set of its backbone neighbors. Using the calibrated positions $[eqn]$ and the geometric relationship in (5), the preliminary orientation estimation is

[eqn]

where $[eqn]$ and $[eqn]$ are the corresponding elements in $[eqn]$ . Combined with the geometric relationship (5), a set of equations can be obtained

[eqn]

Solving (32) by the least squares method, we obtain the calibrated orientation $[eqn]$ of each backbone agent.

3.5. Formation Constraint Optimization

Finally, a global formation constraint optimization is performed to ensure consistency with the known formation geometry.

The formation constraint optimization problem is

[eqn]

subject to maintaining reasonable agreement with the measurement-based estimates.

The optimization uses gradient descent with the following update rule.

Gradient Computation: For distance-based formation constraints, the gradient with respect to agent i’s position is:

[eqn]

provided $[eqn]$ for a small threshold $[eqn]$ .

Update Rule:

[eqn]

where $[eqn]$ is a learning rate and $[eqn]$ is a formation constraint weight for post-processing optimization.

Convergence Criterion: The algorithm terminates when the change between two consecutive iterations becomes sufficiently small, i.e.,

[eqn]

where $[eqn]$ is a prescribed tolerance. A maximum iteration limit $[eqn]$ is also enforced to avoid excessive computation.

The overall derivation of the formation-constrained cooperative localization algorithm with optimization is given in Algorithm 2. Algorithm 2 Formation-Constrained Cooperative Localization Algorithm

1:Input: All agents $[eqn]$ ; TOA measurements $[eqn]$ ; AOA measurements $[eqn]$ ; formation matrix $[eqn]$ ; constraint weights $[eqn]$ , $[eqn]$
2:Output: Estimated positions $[eqn]$ for all agents $[eqn]$
3:GNA: Select $[eqn]$ backbone agents by minimizing potential field (see Algorithm 1)
4:Backbone Localization: Given backbone set $[eqn]$ and distance matrix $[eqn]$ , use MDS to localize backbone agents relative to each other and establish reference frame
5:for each listener agent $[eqn]$ do
6: Step 1—Chan Algorithm: Solve (17) using TOA measurements to obtain initial position estimate $[eqn]$ and orientation $[eqn]$
7: Step 2—AOA Refinement: Construct augmented system $[eqn]$ and $[eqn]$ as in (19), update covariance $[eqn]$ as in (20), and solve (21) to obtain refined position $[eqn]$
8: if formation constraints enabled then
9: Formation-Constrained Refinement: Extend the augmented system as in (22) and solve (23) to obtain formation-constrained position estimate $[eqn]$
10: end if
11:end for
12:Back Calibration: For each listener k, form $[eqn]$ and Jacobians $[eqn]$ and $[eqn]$ , compute $[eqn]$ , then update $[eqn]$ by (25)–(30)
13:if formation constraints enabled then
14: Formation Optimization: Perform gradient descent for all agents:

[eqn]

15: Repeat until convergence: $[eqn]$ or maximum iterations reached
16:end if

Remark 2. The current formulation assumes a known, fixed formation geometry encoded in the distance matrix $[eqn]$ (defined in Section 3.3; $[eqn]$ ). For dynamic formation reconfiguration in real missions—e.g., when the swarm switches from one formation pattern to another— $[eqn]$ can be updated at each time step to reflect the new expected inter-agent distances; the algorithmic structure (GNA, listener localization, back calibration, formation optimization) remains applicable without modification. Thus, the method can adapt to time-varying formation geometry provided the updated $[eqn]$ is supplied (e.g., by a higher-level mission planner). Online estimation of time-varying formation geometry from measurements is beyond the scope of this work and is left for future study.

4. Performance Analysis

This work seeks to analyze how formation constraints may improve localization accuracy in terms of information gain (relevant to backbone selection) and error propagation mitigation (relevant to listener localization and global optimization).

Formation constraints enhance localization by incorporating geometric prior information. The measurement objective function $[eqn]$ (defined in (7)) minimizes the squared errors between predicted and measured TOA and AOA values. The combined objective becomes

[eqn]

where $[eqn]$ penalizes deviations from the expected formation geometry and $[eqn]$ is the constraint weight.

4.1. Information Gain

The Fisher Information Matrix (FIM) quantifies the information that measurements provide about the unknown state $[eqn]$ (e.g., agent positions and orientations). The unconstrained FIM is

[eqn]

where $[eqn]$ is the measurement likelihood. With formation constraints, the effective FIM becomes

[eqn]

where $[eqn]$ is the information contributed by the formation term $[eqn]$ . The matrix $[eqn]$ is positive semi-definite, since it arises from the Hessian of a squared-distance penalty. Since $[eqn]$ is positive semi-definite, $[eqn]$ is also positive semi-definite for $[eqn]$ . Therefore, $[eqn]$ is positive semi-definite, which implies that $[eqn]$ provides no less information than $[eqn]$ , thus increasing total information and lowering the Cramér-Rao Lower Bound (CRLB).

The improvement can be expressed via error covariance. The unconstrained covariance satisfies

[eqn]

while the constrained one satisfies

[eqn]

where the approximation holds when the FIM is well-conditioned. Because $[eqn]$ is positive semi-definite, $[eqn]$ , thus the estimation error is reduced. The relative improvement depends on $[eqn]$ and the strength of the formation prior. A larger $[eqn]$ or a more informative formation structure yields a smaller $[eqn]$ .

4.2. Error Propagation Mitigation

In cooperative localization, errors propagate along the dependency chain from backbone to listener agents. For a listener k with position $[eqn]$ depending on backbone positions $[eqn]$ ,

[eqn]

where $[eqn]$ are backbone errors and $[eqn]$ is the error from listener measurements. Without constraints, the $[eqn]$ can be large and weakly correlated; thus, errors accumulate.

Formation constraints enforce geometric consistency. For agents with a formation relationship, $[eqn]$ with small $[eqn]$ . This ties the relative positions of neighboring agents to the prior $[eqn]$ , which bounds relative errors and reduces error accumulation along the chain. As a result, both backbone and listener estimates are pulled toward a geometrically consistent configuration.

Remark 3. Formation constraints act as a Tikhonov-type regularizer that guides the solution toward physically reasonable configurations. When λ is chosen appropriately, they may provide three benefits. (1) Ambiguity resolution: When the measurement model has multiple solutions (e.g., symmetric or degenerate geometries), the formation term prefers the one that matches the expected inter-agent distances, thus helping resolve ambiguities. (2) Outlier suppression: Large, isolated deviations from the formation are penalized by $[eqn]$ , so estimates that would otherwise explain measurements through a few poor links are pulled back toward the prior geometry. (3) Noise robustness: Under strong measurement noise, $[eqn]$ alone can yield unstable or scattered estimates; the formation term stabilizes the objective and reduces the optimizer’s sensitivity to individual noisy measurements. These effects may moderately improve both accuracy and robustness.

4.3. Complexity Analysis

In summary, the overall computational complexity of the proposed formation-constrained cooperative localization algorithm is given by $[eqn]$ , where N is the total number of UAV agents, $[eqn]$ is the number of backbone agents, $[eqn]$ denotes the number of listener agents, and $[eqn]$ is the maximum number of iterations in the formation optimization stage.

Since N and $[eqn]$ are typically moderate in swarm deployment scenarios, with $[eqn]$ and $[eqn]$ being a bounded constant, the lower-order term $[eqn]$ from the WLS-based listener localization step is negligible in practice. The dominant computational costs, namely $[eqn]$ for the formation-constrained GNA backbone selection, $[eqn]$ for backbone node localization, and $[eqn]$ for the global formation optimization refinement, remain computationally tractable for standard embedded or onboard UAV hardware. Thus, the proposed algorithm is feasible for real-time or near-real-time formation-constrained cooperative localization applications.

5. Numerical Experiments

This section presents numerical experiments to validate the effectiveness of the proposed formation-constrained cooperative localization method.

5.1. Experimental Setup

The simulation environment consists of a $[eqn]$ m^2^ area with $[eqn]$ agents. TOA noise standard deviation: $[eqn]$ s; AOA noise standard deviation: $[eqn]$ rad. In the simulation of the localization algorithm, the experiments were performed across multiple formation types to evaluate the method’s adaptability. Each experiment consists of $[eqn]$ agents (We focus on small-scale UAV swarms in this work, but the proposed method can also be applied to large-scale UAV swarm localization by using the Distributed Geometry Merging approach from [27]) in total— $[eqn]$ backbone nodes and $[eqn]$ listener nodes—across 5000 Monte Carlo trials.

The formation geometries used in the localization simulation are defined as follows:

Line + Wingman Formation: This formation consists of $[eqn]$ agents arranged in a straight line with spacing s, and a single wingman agent positioned off the line:

[eqn]

where $[eqn]$ m. This formation is commonly used in convoy protection and patrol missions, where the main line provides forward coverage while the wingman offers lateral surveillance and early threat detection.

Grid Formation: This formation consists of agents arranged in a rectangular grid with uniform spacing s in both x and y. The grid has $[eqn]$ columns and $[eqn]$ rows, with nodes indexed in row-major order:

[eqn]

where $[eqn]$ is the row index, $[eqn]$ is the column index, and $[eqn]$ m. The grid formation provides regular spatial distribution suitable for area coverage, distributed sensing, and coordinated inspection tasks where uniform node spacing is desired.

Diamond Formation: This formation consists of agents on the four edges of a diamond. The four corner nodes and six edge nodes are:

[eqn]

where $[eqn]$ m. The corners (1–4) are top, right, bottom, and left; nodes 5–10 lie on the edges. The diamond formation provides 360-degree coverage suitable for defense and escort missions.

Circular Formation: This formation consists of agents uniformly distributed on a circle:

[eqn]

where $[eqn]$ is the angular position of agent i on the circle (distinct from the state $[eqn]$ in Section 2.2) and $[eqn]$ m is the circle radius. The circular formation provides symmetric geometric distribution and is typically employed in perimeter surveillance, area monitoring, and coordinated search operations where equal coverage in all directions is required.

The following metrics are used to evaluate localization performance.

Mean/Median RMSE (over trials):

[eqn]

The median RMSE provides a robust metric that is less sensitive to outliers compared to the mean RMSE.

5.2. Performance of Formation-Constrained GNA

The performance of the proposed formation-constrained backbone selection strategy is compared with several benchmark methods, as illustrated in Figure 2. The random strategy (marked as squares) selects backbone agents uniformly at random from all agents. The convex strategy (marked as diamonds) selects backbone agents on the convex hull of the agent positions. The standard GNA strategy (marked as circles) applies the reference method [27] without using formation information. The formation-constrained GNA strategy (marked as pentagrams) applies Algorithm 1. Finally, the brute force solution (marked as stars) is obtained by exhaustively traversing all candidate backbone sets and choosing the one that minimizes the theoretical rSPEB (root squared position error bound) [31].

As shown in Figure 2, for highly regular formations such as circular, grid, and diamond, the standard GNA strategy already yields backbone sets that are very close to the brute force solution. As a result, the proposed formation-constrained GNA provides only marginal improvement in the theoretical rSPEB metric in these symmetric cases. This behavior is consistent with the intuition that, when the node geometry is well-conditioned and almost symmetric, there is limited room for further improvement in terms of Fisher information. In contrast, for irregular and asymmetric “line + wingman” formations, the standard GNA may select suboptimal backbones, while the proposed formation-constrained GNA can better exploit the known formation pattern and achieve a more significant reduction in rSPEB.

5.3. Performance of Formation-Constrained Cooperative Localization

To validate the effectiveness of the proposed method, we conducted simulations under multiple typical formation configurations. Figure 3, Figure 4, Figure 5 and Figure 6 compare the estimation performance and RMSE of the proposed formation-constrained localization (FCL) method with those of two benchmark methods: (1) the listener relative localization (LRL) method reported in [27], and (2) the hybrid TDOA/AOA localization method [17], which adopts the same GNA-based backbone node selection strategy as the scheme proposed in [27]. Neither benchmark exploits formation geometry. For the box plots, the median is marked by the red line, the interquartile range by the blue box, the whiskers by black dashed lines, and outliers by red crosses. Additional statistical markers, including the mean (red circles) and median (blue squares) of the full dataset, are provided with corresponding numerical annotations.

The simulation results demonstrate that the proposed FCL method consistently outperforms both the LRL and the hybrid TDOA/AOA benchmarks across all tested formation types. The hybrid TDOA/AOA method, although it fuses TOA and AOA measurements, does not use formation constraints; its RMSE is comparable to or slightly higher than LRL in most formations, and both are clearly outperformed by FCL. This indicates that integrating formation geometry into the localization process yields measurable gains over methods that use the same backbone selection but lack formation constraints. While the theoretical rSPEB analysis shows only modest gains for symmetric formations, the full localization experiments show that the proposed formation-constrained method consistently reduces the RMSE compared with both benchmark methods. Beyond the information-theoretic lower bound, formation constraints help stabilize the non-linear estimation process and mitigate error propagation in practice.

Improvements are relatively significant in the line + wingman formation, where both LRL and the hybrid TDOA/AOA method exhibit clear deviations for wingman nodes, while the proposed FCL method locates both the main line and wingman nodes more closely. The proposed method also features a narrower, lower-shifted RMSE and fewer large errors than both benchmarks. For the diamond, grid, and circular formations, the proposed method consistently outperforms LRL and the hybrid TDOA/AOA method: it preserves the original formation structure, its estimates gather closely around the true positions, and it achieves a tighter, lower RMSE distribution for enhanced consistency and stability.

Table 2 presents the single-trial runtime of all compared methods under different formations, averaged over 5000 Monte Carlo runs. All timings were obtained using MATLAB R2022b on a PC equipped with a 13th Gen Intel Core i9-13900HX processor (2.20 GHz) and 64 GB RAM. The results show that the proposed FCL method maintains millisecond-level runtime per trial across all tested formation configurations, with only a marginal increase compared with the benchmark LRL and Hybrid TDOA/AOA methods, and fully meets the real-time requirements of cooperative formation localization systems.

6. Conclusions

This work seeks to establish a formation-constrained cooperative localization method for UAV swarms in GNSS-denied environments. It aims to systematically integrate known formation geometry into localization processes. First, the formation-constrained GNA strategy helps improve measurement geometry quality by integrating formation geometry into backbone selection. Second, the formation-constrained listener localization reduces ambiguity and enhances accuracy through explicit constraint integration during position estimation. Third, the global optimization stage helps maintain geometric consistency across all agents, mitigating error propagation. Simulation results of the localization process indicate the method achieves good performance in success rate, stability, and accuracy and performs stably across different formation types. The experiments across line + wingman, grid, diamond, and circular formations illustrate the method’s performance under different single-task formation patterns. In addition, its ability to adapt to asymmetric geometries (line + wingman) highlights its potential practical value for applications such as tunnel and corridor operations. Potential extensions include applying the proposed method to three-dimensional (3D) UAV formations and to online estimation of time-varying formation geometry; the integration of multi-task collaboration with formation-constrained localization also merits further investigation. Generalizing the approach to 3D would require elevation-capable AOA measurements, at least four non-coplanar anchors for backbone localization, and careful treatment of vertical observability when formations are coplanar or near-planar; addressing these aspects would strengthen the applicability of formation-constrained localization to full 3D swarm operations and is left for future work.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Zhou J. Yi Y. Hu X. Deng X. Yang L.T. Zhang Z. A novel scheme for dynamic task collaboration among large-scale UAV swarms in the era of low-altitude economy IEEE Trans. Cogn. Commun. Netw.2026123117313210.1109/TCCN.2025.3618271 · doi ↗
2Cui L. Zhou K. Wang J. Du Z. Jiang C. Qin H. Research on UAV swarm inspection path and defect identification based on LLM multi-agent collaborative optimization Microchem. J.202521911583810.1016/j.microc.2025.115838 · doi ↗
3Li W. Feng Y. Zhou F. Kostromitin K.I. Wang J. Zhang P. Overlapping coalition formation for resource allocation in post-disaster rescue UAV swarms Drones 2025983710.3390/drones 9120837 · doi ↗
4Gezici S. Tian Z. Giannakis G. Kobayashi H. Molisch A. Poor H. Sahinoglu Z. Localization via ultra-wideband radios: A look at positioning aspects for future sensor networks IEEE Signal Process. Mag.200522708410.1109/MSP.2005.1458289 · doi ↗
5Dardari D. Conti A. Ferner U. Giorgetti A. Win M.Z. Ranging with ultrawide bandwidth signals in multipath environments Proc. IEEE 20099740442510.1109/JPROC.2008.2008846 · doi ↗
6Shahzad F. Sheltami T.R. Shakshuki E.M. DV-max Hop: A fast and accurate range-free localization algorithm for anisotropic wireless networks IEEE Trans. Mob. Comput.2017162494250510.1109/TMC.2016.2632715 · doi ↗
7Amjad M. Ali M.S. Yao S. Rahaman M.F. Zheng C. Raza M.K. Zouaoui B. Self and target locating with cooperation of heterogeneous unmanned vehicles in the denial environment IEEE Access 202513646996471810.1109/ACCESS.2025.3558873 · doi ↗
8Chen Y.-E. Liew H.-H. Chao J.-C. Wu R.-B. Decimeter-accuracy positioning for drones using two-stage trilateration in a GPS-denied environment IEEE Internet Things J.2023108319832610.1109/JIOT.2022.3231704 · doi ↗