Floyd's Cycle detection algorithm | Determining the starting point of cycle
-
16-10-2019 - |
Question
I am seeking help understanding Floyd's cycle detection algorithm. I have gone through the explanation on wikipedia (http://en.wikipedia.org/wiki/Cycle_detection#Tortoise_and_hare)
I can see how the algorithm detects cycle in O(n) time. However, I am unable to visualise the fact that once the tortoise and hare pointers meet for the first time, the start of the cycle can be determined by moving tortoise pointer back to start and then moving both tortoise and hare one step at a time. The point where they first meet is the start of the cycle.
Can someone help by providing an explanation, hopefully different from the one on wikipedia, as I am unable to understand/visualise it?
Solution 3
I found the answer on stackoverflow. Thanks if anyone was looking into this for me. And for those who like me wanted an explanation, please refer to: https://stackoverflow.com/questions/3952805/proof-of-detecting-the-start-of-cycle-in-linked-list The chosen answer to the question, explains it!
OTHER TIPS
You can refer to "Detecting start of a loop in singly linked list", here's an excerpt:
Distance travelled by slowPointer
before meeting $= x+y$
Distance travelled by fastPointer
before meeting $=(x + y + z) + y$
= x + 2y + z
Since fastPointer
travels with double the speed of slowPointer
, and time is constant for both when the reach the meeting point. So by using simple speed, time and distance relation (slowPointer
traveled half the distance):
\begin{align*} 2*\operatorname{dist}(\text{slowPointer}) &= \operatorname{dist}(\text{fastPointer})\\ 2(x+y) &= x+2y+z\\ 2x+2y &= x+2y+z\\ x &= z \end{align*}
Hence by moving slowPointer
to start of linked list, and making both slowPointer
and fastPointer
to move one node at a time, they both have same distance to cover.
They will reach at the point where the loop starts in the linked list.
I have seen the accepted answer as proof elsewhere too. However, while its easy to grok, it is incorrect. What it proves is
$x = z$ (which is obviously wrong, and the diagram just makes it seem plausible due to the way it is sketched).
What you really want to prove is (using the same variables as described in the diagram in the accepted answer above):
$z = x\ mod\ (y + z)$
$(y + z)$ is the loop length, $L$
so, what we want to prove is:
$z = x\ mod\ L$
Or that z is congruent to x (modulo L)
Following proof makes more sense to me:
Meeting point, $M = x + y$
$2(x + y) = M + kL$, where $k$ is some constant. Basically, distance travelled by the fast pointer is $x + y$ plus some multiple of loop length, $L$
$x + y = kL$
$x = kL - y$
The above equation proves that $x$ is the same as some multiple of loop length, $L$ minus $y$. So, if the fast pointer starts at the meeting point, $M$ or at $x + y$, then it will end up at the start of the loop.