Navigating an Infinite Space with Unreliable Movements
Anders Martinsson, Jara Uitto

TL;DR
This paper studies a search problem on an infinite grid where an agent with unreliable movements tries to find its home, revealing a phase transition in solvability depending on the fault probability.
Contribution
It establishes conditions under which the agent can or cannot reliably find home, and provides an algorithm for small fault probabilities with polynomial hitting times.
Findings
No finite expected time solutions exist for certain fault probabilities.
An algorithm achieves finite expected hitting time for sufficiently small p.
Hitting time decays polynomially as a function of time for p<1.
Abstract
We consider a search problem on a -dimensional infinite grid with a single mobile agent. The goal of the agent is to find her way home, which is located in a grid cell chosen by an adversary. Initially, the agent is provided with an infinite sequence of instructions, that dictate the movements performed by the agent. Each instruction corresponds to a movement to an adjacent grid cell and the set of instructions can be a function of the initial locations of the agent and home. The challenge of our problem stems from faults in the movements made by the agent. In every step, with some constant probability , the agent performs a random movement instead of following the current instruction. This paper provides two results on this problem. First, we show that for some values of , there does not exist any set of instructions that guide the agent home in finite expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Computational Geometry and Mesh Generation · Advanced Graph Theory Research
Navigating an Infinite Space with Unreliable Movements
Anders Martinsson
ETH Zürich
Jara Uitto
ETH Zürich and University of Freiburg
[email protected] Supported by ERC Grant No. 336495 (ACDC)
Abstract
We consider a search problem on a -dimensional infinite grid with a single mobile agent. The goal of the agent is to find her way home, which is located in a grid cell chosen by an adversary. Initially, the agent is provided with an infinite sequence of instructions, that dictate the movements performed by the agent. Each instruction corresponds to a movement to an adjacent grid cell and the set of instructions can be a function of the initial locations of the agent and home. The challenge of our problem stems from faults in the movements made by the agent. In every step, with some constant probability , the agent performs a random movement instead of following the current instruction.
This paper provides two results on this problem. First, we show that for some values of , there does not exist any set of instructions that guide the agent home in finite expected time. Second, we complement this impossibility result with an algorithm that, for sufficiently small values of , yields a finite expected hitting time for home. In particular, we show that for any , our approach gives a hitting rate that decays polynomially as a function of time. In that sense, our approach is far superior to a standard random walk in terms of hitting time. The main contribution and take-home message of this paper is to show that, for some value of , there exists a phase transition on the solvability of the problem.
1 Introduction
We study a search problem on an infinite -dimensional grid, where the task of a mobile agent is to find home, i.e., a designated grid cell chosen by an adversary. The agent is endowed with a sense of orientation, i.e., at all times, the agent is able to distinguish between the four globally consistent cardinal directions. The time is divided into discrete time steps and in every step, the agent is able to move to an adjacent grid cell. Initially, the agent is placed in the origin of the grid and the agent operates according to an infinite sequence of instructions, where an instruction corresponds to a movement to one of the cardinal directions. The set of instructions can be a function of the location of home. We incorporate errors in navigation during the search by introducing a parameter , that corresponds to the probability of making a mistake in any time step . More precisely, in each time step, with probability the agent executes the next instruction in its sequence and, with probability , ignores the instruction and instead moves to an adjacent cell chosen uniformly at random.
The search process can be seen as a random walk augmented by a set of deterministic moves. The case of is rather unexciting, since the shortest path home yields deterministic strategy that clearly has a finite expected hitting time. Also, if we set , we get into a setting where the agent simply follows an unbiased random walk. This case is also degenerate in the sense that it is well-known that an unbiased random walk has an infinite expected hitting time on any cell of the grid. Hence, it is natural to ask whether we can bias the random walk with some determinism in order to obtain better (i.e., smaller) hitting times. Indeed, our main research questions is: Which values of admit a finite expected hitting time for home?
Definition 1.1**.**
Let a sequence of instructions consisting of an infinite walk in with and a fixed probability that the agent makes a mistake be given. We call the random process defined by our search protocol a guided random walk. Formally, this is defined as a countable state Markov chain , where and
[TABLE]
Our search problem is inspired by search tasks with limited information and traveling in unmarked terrains. In practical settings, there are inherently errors in distance estimations and orientations. In particular, in natural settings, such as foraging behavior of ants, the agents need to operate on imperfect information and need to be able to tolerate errors. For example, a foraging desert ant needs to find its nest without knowing its exact location and without being able to leave markers on the ground. However, when heading back to the nest from their search, the ants are able to estimate their distance to the nest quite well [WWW06] and have a good sense of the direction towards the nest [WS81].
Observations and Some Notation.
An initial observation about the guided random walk is that, for any sequence of instructions, the average position of the walk after steps is with a standard deviation of . We note that finding home becomes strictly harder when increases and similarly, easier when gets smaller in the following sense: If home cannot be found for some value of , the same holds for any and conversely, a strategy that works for can be adapted to work for any . For the remainder of the paper, we denote the location of home by and the random variable captures the time agent finds home.
Contributions and Technical Challenges.
In this paper, we examine the proposed navigation problem from two different angles. Our first contribution, captured in 1.2, is to show that no sequence of instructions guarantees finite expected hitting time if is sufficiently close to one. The technical challenge is to show that the position of the guided random walk after steps is anti-concentrated in the sense that the probability of to equal a certain vertex in cannot be higher than . Moreover, this picture does not change too much if one conditions on certain properties of the history of .
Guided by this analysis, we derive an upper bound on the probability that, given that a guided random walk has not found home at time , it finds home before time . Informally speaking, this needs to be at least on average in order for the hitting time to be finite. We get the following result.
Theorem 1.2**.**
Let be the random variable that captures the time step where the agent finds home. If , then .
Proof.
Using the estimate from Proposition 2.3 we see that the analytic condition from Proposition 2.7 fails for . Hence the expected hitting time of any guided random walk must be infinite in this range. ∎
Remark**.**
In fact, by using computer assistance to calculate the bound on from Proposition 2.3 for higher values of , it is possible to improve this bound to .
Our second contribution, captured in 1.3, is an algorithm that finds a sequence of instructions such that, for sufficiently small , the agent is able to locate home in expected finite time. The intuition behind our strategy is relatively simple: After steps of a guided random walk, we expect the agent to be at distance from its estimated position. Keeping this in mind, we repeatedly tell the agent to perform sweeps covering an area within this distance of the current estimation of the location of home. Once the sweep is done, we re-estimate the location and continue to the next sweep.
There are two main challenges of this analysis. We first show that, if the true position of the agent at the beginning of a sweep is close to its estimated position, then the sweep has a good chance to get the agent home at some time step. Second, we show that is unlikely to deviate too much from its estimated position, even when conditioning on the agent not having been home in any previous phase. This makes use of a Chernoff-type bound, which is updated recursively to account for dependencies.
Theorem 1.3**.**
Let be the random variable that captures the time step when the agent finds home. There exists a sequence of instructions such that, for any and any , the guided random walk following these instructions satisfies .
Proof.
By Proposition 3.7, the strategy described in Section 3 with parameter yields a finite expected hitting time for any if
[TABLE]
Optimizing the left-hand side over , we get a maximum of at . Using this value of , the analytic condition holds for any , as desired. ∎
Remark 1.4**.**
By a slightly more careful consideration of the analysis of our strategy, one can see that, for any , the asymptotic hitting probability converges significantly faster than in the unbiased case. More precisely, it is well-known that an unbiased random walk satisfies whereas, for any , our strategy obtains a hitting rate of for .
A Phase Transition Behavior:
Let denote the expected time to find home given an error parameter . Then, there is a constant value such that
for any , there exists a sequence of instructions that yields and
for any , all sequences of instructions yield .
Our work shows the existence of such a threshold and narrows down the range of possible values of the threshold. Finding the exact value of is left as an intriguing open question.
1.1 Related Work
Our work falls under a much wider umbrella of graph exploration. In the standard graph exploration setting a mobile agent (or a group of them) is placed on one of the nodes of the graph. The goal of the agent is to explore the graph, i.e., visit every node or every edge of the graph by traversing the graph along its edges. There are many variants of graph exploration. For example, in directed [DP99, AH00] and undirected graph exploration [DKK06, DHK16] the edge traversals are uni- and bidirectional, respectively. Another distinction is to consider graphs where nodes are equipped with unique identifiers [PP98] and anonymous graphs [Rol79, BS94]. Our setting corresponds to the undirected and anonymous setting. Typically, the efficiency of a graph exploration algorithm is measured in terms of its time complexity, i.e., how many edge traversals are required to complete the exploration task.
Another measure of interest is the number of bits of memory the agent has [FI04]. For example, Fraigniaud et al. [FIP*+*05] showed that a deterministic agent needs memory to explore a graph with diameter and maximum degree . A measure close to the number of bits of memory is the precision of randomness the agent is allowed to use [LLNR14]. In the case of a random agent in a finite graph, the agent can simulate a random walk without any memory (and with precision) and achieve a polynomial hitting time for every node [AKL*+*79]. In a -dimensional infinite grid, it is known that the random walk is null-recurrent, i.e., it will eventually reach every node, but the hitting time tends to infinity. If the dimension grows larger, the walk is not guaranteed to reach every node. In our setting, the agent has no memory, but blindly follows the instructions that it is given. Cast in the terms of random walks, our research question is whether we can improve the random walk to have a finite expected hitting time on some node in the grid.
A classic graph exploration problem is the cow-path problem proposed by Baeza-Yates, Culberson, and Rawlins [BYCR93]. The cow is located in some node of a path and its task is to find a dedicated node on the path. In the case of the cow-path problem, the cow needs to fix her strategy without knowing the location of the dedicated node. The authors showed that a simple spiral search strategy achieves a constant competitive ratio. In a sense, our approach is an adaptation of the spiral search strategy that is resilient to the challenges caused by the random steps.
Also, exploring an infinite grid with a group of agents has been intensively studied [LOS01, Rol79, FGKP06, ELS*+*15]. A recent work that falls relative close to ours is by Cohen et al. [CELU17], where they showed that even allowing two agents and a constant amount of memory per agent (i.e., essentially two correlated random walks) cannot make the expected hitting time finite.
Paper’s Organization.
In Section 2, we provide the necessary tools for the proof of 1.2. Then, Section 3 gives the means to prove 1.3.
2 The Impossibility Result
In this section, we give the necessary claims to prove 1.2. We start with a technical lemma, where we derive an anti-concentration property of a guided random walk using Fourier analysis. Then, the second step is to introduce a measure, , that very roughly speaking, bounds from below the drift of the guided random walk within steps, where will be chosen to be a small constant. A more formal statement will be provided later in this section. Then, as the third step, we split the execution into exponentially growing intervals. We show that, conditioned on the event that home was not found prior to the current interval, home is unlikely to be found in the current interval.
2.1 Step 1: An Anti-Concentration Result for Guided Random Walks
Proposition 2.1**.**
Let be a denote a guided random walk with any sequence of instructions and any (possibly random) initial position. Then
[TABLE]
Proof.
In what follows, we consider fixed, and let . For any , let
[TABLE]
This function is -periodic in both and and, for any , satisfies
[TABLE]
Moreover, we can rewrite this function as
[TABLE]
where are the numbers of times up until when the agent has been instructed to walk north, south, west and east respectively, and
[TABLE]
One can observe that the functions in (2) have absolute value one if and only if, up to periodicity, or , and have absolute value less than one everywhere else. Moreover, by Taylor expanding around either or we get the estimates
[TABLE]
and
[TABLE]
Let be any sufficiently small constant. We can bound the integrand in the right-hand side of (1) by the above expressions for any point -close to or , and by everywhere else to obtain
[TABLE]
where and . The product is minimized when either and or and , which implies that . Letting equal a sufficiently large constant times yields the desired bound. ∎
2.2 Step 2: Bounding Hit Probabilities
In order to make use of the estimate from Step 1, we would like to be able to bound the probability that for some in terms of the expected number of such . One simple way of obtaining such a bound would be Markov’s inequality. However, the bound one would obtain in this way is slightly too weak for our purposes. So, in this step we will derive a stronger bound. This makes use of the fact that, due to the nature of a (mostly) random walk, if the agent reaches , it is likely to return there in subsequent time steps.
Proposition 2.2**.**
For any positive integer , let
[TABLE]
be the expected number of times between [math] and a guided random walk will return to to its initial position, where the minimum is taken over all possible sequences of instructions the agent could receive.
Then, for any , and any , a guided random walk with any (possibly random) initial position satisfies
[TABLE]
Proof.
Conditioned on the event that is the smallest integer such that , the process behaves as a guided random walk starting in the origin. Thus the expected number of times , that is times when is at least . ∎
Estimating .
Next, we compute estimates for . We derive a general upper bound for for any , and then compute this quantity explicitly for , which is what will be used later in the proof. Using larger values of would result in in stronger bounds on , but as the coefficients get increasingly hard to compute by hand, we will not do this here.
Proposition 2.3**.**
Let denote the number of -step walks in the grid from [math] to and let denote the minimum of over all such that and . Then, for any even ,
[TABLE]
In particular,
[TABLE]
Proof.
Let us initially consider any fixed , , and any sequence of instructions . By linearity of expectation, the expected number of times for is given by
Condition on the event that the agent has taken precisely random steps before time , and the times at which these occurred. Here we consider a step as random if it is produced by walking in a random direction even if this direction turns out to match the instructions. Given this information, we can split the movement of the agent up until time into two terms: the translation produced by the non-random steps, which is some vector where and , and the translation produced by the random steps, which would then have to equal in order for .
Using this, we get the lower bound
[TABLE]
The first formula in the statement of the proposition immediately follows.
It is easily seen that , and , and it can further be seen without too much effort that and , which yields the second formula in the statement of the proposition. Equality follows from the fact that the lower bound is attained exactly if the instructions follow a straight line in any cardinal direction.
∎
2.3 Step 3: Handling Dependencies
So far in the proof, we have only dealt with properties of the guided random walk at some time without any conditions on the history of the process up to time . Here, we combine the results from the previous two steps to obtain an estimate for the probability that given that .
To achieve this, we pick a time . (It turns out optimal to choose .) Intuitively, if the agent has not found home yet at time , and we know that it is unlikely for her to get home during the interval , then conditioning on her not finding home during this interval should not affect the behavior of the agent after time too much. Let denote the location of home.
Proposition 2.4**.**
For any and any , we have
[TABLE]
Proof.
Let and denote the events that for some , some , and some respectively. Then, we can rewrite the inequality above as
[TABLE]
But by the multiplication rule for conditional probabilities, the left-hand side equals , which is clearly less than the right-hand side. ∎
By combining Propositions 2.1 and 2.2, we can further obtain an estimate for the numerator in Proposition 2.4. Here we assume that is sufficiently large, and . Conditioned on , the agent will move according to an unconditioned guided random walk starting from time . Hence, for any , we have
[TABLE]
Moreover, as it is only possible for at every second time step, due to parity, there are times during when the hitting probability could be non-zero. Hence,
[TABLE]
Finally, by applying Proposition 2.2, we obtain the following:
Proposition 2.5**.**
For any , any sufficiently large and any ,
[TABLE]
2.4 Step 4: A Necessary Condition for Finite Expected Hitting Time
Lemma 2.6**.**
Let be fixed. Suppose there exists a choice of such that for any sequence of instructions. Then, for any choice of apart from the origin.
Proof.
Suppose there exists a starting position such that . Then, for any choice of there exists a time such that, with positive probability, and for all . But the conditional expected hitting time from this state is infinite, thus . ∎
Proposition 2.7**.**
Suppose, for a given and , there exists a sequence of instructions such that . Then the analytic condition
[TABLE]
holds for any positive integer .
Proof.
Let be a sufficiently small constant. By Lemma 2.6, we may, without loss of generality, assume that is sufficiently far from the origin to make the argument below follow through.
For divide the process up into time intervals . We note that if
[TABLE]
for all , then dominates , which has infinite expectation. Thus, in order for there must exist such that (6) is satisfied for all , but not for . By choosing sufficiently far away, we may assume that is much larger than .
Let and where is the smallest positive integer such that . As corresponds to the first interval where hitting probability is “high” we have
[TABLE]
Hence, by Propositions 2.4 and 2.5, we have
[TABLE]
On the other hand, by assumption of , we have
[TABLE]
The analytic condition in the statement of the proposition follows by combining these two bounds, and taking the limit as . ∎
3 Finding Home
Throughout this section, we assume that and that home is in some vertex at constant distance from the origin. The goal of this section is to construct a sequence of instructions such that the guided random walk following these instructions has a finite finite expected hitting time to any vertex in for any , where is sufficiently small constant.
Our strategy proceeds in phases, which are performed iteratively until the guided random walk hits the designated vertex in . To simplify the correctness proof, we construct the strategy such that at the beginning of each phase. Our strategy is an adaptation of the following canonical approach: At time , fix a box with side length roughly around home. Then, explore every cell in the box and iterate.
Duration of a Phase.
An issue with the canonical approach is that a phase starting at time takes roughly steps to complete. During the phase, missing home is relatively likely and hence, we end up using a lot of time scanning all of the vertices in the area. To mend this, in a phase starting at time , we only walk along roughly horizontal lines over a box with side lengths and centered around the origin, where and are parameters given to the strategy. By a careful design and choice of these parameters, we can guarantee that the probability of finding home is roughly .
3.1 Step 1: The Strategy
Our strategy is divided into phases and, in each phase, we explore an area (and a few other grid cells) that has a shape of a rectangle. For simplicity, we will assume that, at the beginning of each phase, we have . Moreover, we suppose that the agent is likely, say with probability , to be contained in a box around the origin.
3.2 Step 1: The Strategy
Let and be two positive constants to be defined later. For a phase starting at time , we let
[TABLE]
We give the following sequence of instructions:
Move steps west and steps south to . 2. 2.
Move steps north where . 3. 3.
Form horizontal lines at horizontal distance by alternatingly moving steps east, steps north, steps west, steps north, and so on. 4. 4.
Move steps north, and then return to .
Remark**.**
In our strategy, Step (4) is performed to make the total number of steps in a phase non-random and hence, the number of steps does not bring any extra randomness into our analysis.
3.3 Step 2: Analyzing a Single Iteration
Proposition 3.1**.**
Fix . Suppose we start a phase at time . Condition on the initial position of the agent at the beginning of the phase being inside the box . Then with probability at least , the agent visits home at some point during the phase.
Notation.
Before formulating the proof we introduce some notation. Let be the vector denoting the actual translation that the agent performs during , the translation during , and for any let be the actual translation performed by the agent during the first steps of . Note that , and are independent.
Our proof will consist of a coupling argument over the different values of . In doing so, we extend to a guided random walk starting in the origin, and where the directions are to always move north. We will show that, with high probability, there is at least one choice of that would make the agent get home during the phase.
Lemma 3.2**.**
Let be any -step guided random walk. Then, with high probability, for all , where .
In particular, with high probability, , and all deviate from their expectations by at most .
Proof.
The first part is a direct consequence of Azuma’s inequality, or a suitable version of a standard Chernoff bound. The second statement is a consequence of the first, observing that the random variables can be expressed as three guided random walks with steps, yielding a maximum deviation of from their respective maximum values. ∎
Proof of Proposition 3.1..
Let and be as above, and condition on the event that these never deviate in max-norm from their expectations by more than . We show that, under this condition, and assuming is contained in there is at least one choice of by which the agent would get home at some point during the phase. For an illustration, refer to Figure 1.
For the claim to hold, it suffices to show that there exist a such that
[TABLE]
for some . Consider the set of points the left- and right-hand sides of this expression covers as and goes over the respective intervals. In particular, the left-hand side contains mostly horizontal lines going between -coordinates
[TABLE]
and with -coordinates
[TABLE]
for each . In particular, the last line has -coordinate at least
On the other hand, the right-hand side has -coordinates contained in the slightly smaller interval of , and with its highest -coordinate contained inside and its highest and lowest -coordinate differing by . As this difference is larger than the gap between two horizontal lines, there must be a where the two sets intersect, as desired. ∎
3.4 Step 3: How Far is the Guided Random Walk from the Origin?
With Proposition 3.1 at hand, our next goal is to get a handle on how far from the origin we can expect the agent to be at the beginning of a phase, conditioned on home not being found before this phase. Fix a parameter and let denote the starting time of phase . To simplify the analysis, we will assume that
[TABLE]
Model Modification.
To ensure this bound, we modify the model in the following way. Every time the agent is about to hit home, there is a small probability that the agent does not “realize” that home is found, and simply moves normally in the next step. Clearly, any such modification would increase , hence any upper bound to in the modified model would immediately yield the same bound for the original model.
For each cardinal direction, i.e., North, East, South and West, we define
[TABLE]
where and . That is, is the standard deviation of one step of a guided random walk, parallel to the given direction, and is the standard deviation in the orthogonal direction.
Proposition 3.3**.**
Let be as above, and let be any positive constant. Conditioned on , the probability that is outside the box is .
Before proving this proposition, we need some ground work.
Claim 3.4**.**
For any , we have
[TABLE]
Proof.
We show the inequality for . The remaining cases can be shown analogously. We first note that for any , we have
[TABLE]
Thus,
[TABLE]
and therefore
[TABLE]
Hence,
[TABLE]
Notice that in the estimate marked by , we use the bound given by Inequality 7 to get rid of the term. By Jensen’s inequality
[TABLE]
Moreover, following a standard derivation of a Chernoff bound, we have
[TABLE]
where, in the last step, we used that most instructions between and go either west or east, meaning has standard deviation in most steps. ∎
Claim 3.5**.**
For any , we have
[TABLE]
Proof.
For any fixed , pick sufficiently large such that the exponent in the right-hand side of 3.4 is at most for any .
Thus, picking a constant such that and observing that, as is concave, , it follows by induction that
[TABLE]
for all , as desired. ∎
Proof of Proposition 3.3..
By 3.5 and Markov’s inequality, we can bound the probability of this event by Minimizing over yields the claimed probability. ∎
3.5 Step 4: Wrapping it up
Now, we have gathered almost all necessary claims to complete the proof of 1.3. As the last steps, we first provide a (rather) simple observation and then bundle up the statements of this section into Proposition 3.7, that almost immediately yields the theorem.
Claim 3.6**.**
Suppose there exists an such that for all . (That is, combined with Inequality 7, we have equality for all large .) Then .
Proof.
As is a convex function, we have . This means that for all . Thus, for any sufficiently large , we get
[TABLE]
Proposition 3.7**.**
Let be given. Suppose there exists a constant such that
[TABLE]
Then, by setting the parameters and in our sweeping strategy, it holds that .
Proof.
We know by Proposition 3.3 that the agent will be inside the box with probability at least . Moreover, by Proposition 3.1 we know that, given this event, one phase of our sweeping approach finds home with probability at least , where the iteration takes time, hence yielding a success probability per time step that, up to lower order terms, equals the left-hand side of the analytic expression in Proposition 3.7. Under the hypothesis of the proposition, it follows that, eventually, Inequality (7) is saturated, so eventually the success probability of an iteration will always equal . But then, by 3.6, it follows that , as desired. ∎
Remark**.**
It seems likely that the bound , as obtained from our analysis, is far from optimal. In fact, we believe that the sweeping strategy described in this section should be able to perform much better than our analysis indicates – perhaps even close to optimal. The main source of loss seems to be the concentration bounds on . However, it seems like new ideas are required to gain a better understanding of the limits of our approach.
Acknowledgements
We would like the thank Lucas Boczkowski, Sebastian Brandt and Manuela Fischer for fruitful discussions. A special thanks goes to Amos Korman, who introduced the problem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AH 00] Susanne Albers and Monika Henzinger. Exploring Unknown Environments. SIAM Journal on Computing , 29:1164–1188, 2000.
- 2[AKL + 79] Romas Aleliunas, Richard M. Karp, Richard J. Lipton, Laszlo Lovasz, and Charles Rackoff. Random Walks, Universal Traversal Sequences, and the Complexity of Maze Problems. In IEEE Annual Symposium on Foundations of Computer Science (FOCS) , pages 218–223, 1979.
- 3[BS 94] M. A. Bender and D. K. Slonim. The Power of Team Exploration: Two Robots can Learn Unlabeled Directed Graphs. In IEEE Annual Symposium on Foundations of Computer Science (FOCS) , pages 75–85, 1994.
- 4[BYCR 93] Ricardo A. Baeza-Yates, Joseph C. Culberson, and Gregory J. E. Rawlins. Searching in the Plane. Information and Computation , 106:234–252, 1993.
- 5[CELU 17] Lihi Cohen, Yuval Emek, Oren Louidor, and Jara Uitto. Exploring an Infinite Space with Finite Memory Scouts. In ACM-SIAM Symposium on Discrete Algorithms (SODA) , pages 207–224, 2017.
- 6[DHK 16] Yann Disser, Jan Hackfeld, and Max Klimm. Undirected Graph Exploration with Θ ( log log n ) Θ 𝑛 \Theta(\log\log n) Pebbles. In ACM-SIAM Symposium on Discrete Algorithms (SODA) , pages 25–39, 2016.
- 7[DKK 06] Christian A. Duncan, Stephen G. Kobourov, and V. S. Anil Kumar. Optimal Constrained Graph Exploration. ACM Trans. Algorithms , 2(3):380–402, 2006.
- 8[DP 99] Xiaotie Deng and Christos Papadimitriou. Exploring an Unknown Graph. Journal of Graph Theory , 32:265–297, 1999.
