EIS A001405 Analysis

Someone on 知乎 invited me to answer this question. 中文版在此: https://www.zhihu.com/question/420632736/answer/1469911740

Problem Description

A point started at the origin, each step it can either move to left or right by one unit with equal probability.

Q: what is the probability for after m steps, this point never touch any negative point?

Solution

Classes

We are studying string like “+-+-” in this problem. Define the set of all possible strings that satisfying the condition of this problem to be \mathcal{A}, so \mathcal{A} = \{\epsilon, +, ++, +-, \dots \}. Introduce the size function for the strings to be size = s \mapsto len(s) which means we use the string length to be size function. With size function attached, we have the math structure class. Using the same size function, we define more classes:

  • \mathcal{A}: all strings (including empty string) that match the problems condition (any partial leading sum is not negative)
  • \mathcal{B}: all strings (including empty string) that match the problems condition and also end at the original point
  • \mathcal{C}: all strings (including empty string) that match the problems condition and also end at some positive point

Construction Equations

It is straightforward to get \mathcal{A} = \mathcal{B} + \mathcal{C}. And B is known as the Catalan class. So now we only need one more equation to find all the secrete of A.

Let’s study the recurrent structure of \mathcal{A}. The element in \mathcal{A} is:

  • empty string,
  • or a “+” append at the end of some element from \mathcal{B}
  • otherwise, is a “+” or “-” append at the end of some element from \mathcal{C}

Generating function

We can write the above as \mathcal{A} = \epsilon + \mathcal{B} * \{-\} + \mathcal{C} * \{+,-\}. Then turn all the equations in to generating function equations:

  • A(z) = B(z) + C(z)
  • A(z) = 1+ zB(z) + 2zC(z)

After some computation, we get the generating function for class \mathcal{A}:

A(z) = \frac{1-zB(z)}{1-2z}

Catalan class’s generating function is Catalan(z) = \frac{1-\sqrt{1-4z}}{2z}, this is a little different from the class \mathcal{B}, because the size of \mathcal{B} is doubled. Thus the length of all elements in \mathcal{B} is even. It is just insert 0 into Catalan class: 1, 0, 1, 0, 2,0, 5, 0,14,0, 42,\dots. With the above analysis, we get:

B(z) = Catalan(z^2) = \frac{1-\sqrt{1-4z^2}}{2z^2}

And finally the generating function of \mathcal{A}:

A(z) = \frac{\sqrt{1-4z^2}+2z-1}{2z(1-2z)}

Compute [z^N]A(z)

During the analysis, I find the problem’s even term and odd term can be taken separately. We can adopt this idea to compute [z^N]A(Z).

A(z) = \frac{1}{\sqrt{1-4z^2}} + \frac{1-\sqrt{1-4z^2}}{2z\sqrt{1-4z^2}}

It is easy to show that the first part contributes only even terms in the final result, and the second part contributes only odd terms in the final results.

\frac{1}{\sqrt{1-4z^2}} = (1-4z^2)^{-\frac{1}{2}} = \sum_{N\ge 0}{-\frac{1}{2} \choose N}(-4z^2)^N=\sum_{N\ge 0}{2N \choose N}z^{2N}

For the odd terms, let’s first try to resolve the function f(z) = \frac{1-\sqrt{1-z}}{\sqrt{1-z}}

f(z) = (1-z)^{-\frac{1}{2}}-1 = \sum_{N\ge 1} (-1)^N{-\frac{1}{2} \choose N}z^N

Thus for the odd part’s generating function:

\frac{1-\sqrt{1-4z^2}}{2z\sqrt{1-4z^2}} = \frac{f(4z^2)}{2z} = \frac{1}{2} \sum_{N \ge 1} {-\frac{1}{2} \choose N}(-4)^Nz^{2N-1} = \sum_{N \ge 1} {2N-1 \choose N}z^{2N-1}

The computation with {-\frac{1}{2} \choose N} is using the following formula:

{-\frac{1}{2} \choose N} = \frac{(-\frac{1}{2})^{\underline{N}}}{N!} = (-1)^{N} \frac{1*3*5\dots *2N-1}{2^NN!} = (-1)^{N} \frac{(2N)!}{4^NN!N!} = (-\frac{1}{4})^{N}{2N \choose N}

We reach the final answer: [z^N]A(z) = {N \choose \lfloor N/2 \rfloor} .

A: the probability is \frac{{m \choose \lfloor m/2 \rfloor}}{2^m}

Asymptotical Results

Let’s do some asymptotical analysis of the previous result. The math tool needed here is Gamma function and Stirling formula:

  • {x \choose y} = \frac{x!}{y!(x-y)!} = \frac{\Gamma(x+1)}{\Gamma(y+1)\Gamma(x-y+1)}
  • \Gamma(x+1) \sim \sqrt{2\pi x}(\frac{x}{e})^x

With the above tools, we have

{m \choose \lfloor m/2 \rfloor} \sim \frac{\Gamma(m+1)}{\Gamma(\frac{m}{2}+1)^2} \sim \frac{2^{m+1}}{\sqrt{2\pi m}}

And the probability asymptotic result is \frac{2}{\sqrt{2\pi m}}

Python Program Simulation Results

Print some values in the sequence and search in OEIS, I find it is https://oeis.org/A001405.

Let’s write a program to print the exact values and the asymptotic results:

Exact ValueAsymptotic ResultDiffAsymptotic Probability
m=51011.4214.18%0.178
m=10252258.372.53%0.126
m=20184756187078.971.26%0.089
m=2552003005354512.652.97%0.080
m=284011660040476311.020.90%0.080
Asymptotic Results

Python program:

from math import sqrt, pi

def asym(m):
    return 2**(m+1)/sqrt(2*pi*m)

exact_ans = [1, 2, 3, 6, 10, 20, 35, 70,
             126, 252, 462, 924, 1716, 3432,
             6435, 12870, 24310, 48620, 92378,
             184756, 352716, 705432, 1352078,
             2704156, 5200300, 10400600, 20058300,
             40116600]

asym_ans = [asym(i) for i in range(1,len(exact_ans)+1)]

m = 1
for e,a in zip(exact_ans, asym_ans):
    diff = abs(e-a)/e*100
    print("m=%d: Exact=%d, Asym=%lf, Diff=%lf%%, AsymProbability=%lf" %
          (m, e, a, diff, a/(2**(m+1))))
    m += 1

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.