The Poisson Distribution
The poisson and the binomial distribution are actually the same distribution.
Let us try to derive the poisson distribution from nothing but the binomial distribution.
We assume that in any moment in time an event may happen and that no two events can occur at the same time and that during any given period of time of the same duration the chance of an event happening is the same. Lets also say that in some period of time the expected number of events is λ.
Let's just assume for now that no two events can happen within a second of one another. Well what this would mean that in a period of 1 second there is a probability p of the even happening. If there are n seconds in a period then the probability of exactly r events happening over this period would be modelled by the binomial distribution with
P(# of events=r)=(rn)prqn−r
We also know that during this period of n seconds we would expect there to be np events. So if we know that during these n seconds we should expect there to be λ events, we have that p=nλ.
But of course we don't actually know that two events cannot happen within a second of one another, what if two atoms just happen to decay within a second of one another. What if two customers just happen to click order within a second of one another. We just don't know.
Now the trick is that we can just consider a period of time even smaller than a second, even smaller than a millisecond, what if we just take the limit as our period of time in which no events can happen becomes infinitely small. Or in other words, we assume that the number of "seconds" within our time period n gets really really large. This is perfect because at some point the probability of two events happening within 0.000000…00001 seconds of one another is so small that it could not possibly impact the final probability.
If we know generating functions we may recognize that the probability generating function for the probability of X events is now
GX(t)=x→∞lim(1−nλ+nλ⋅t)n=x→∞lim(1+nλ⋅(t−1))n
and then using the definition of erx we have that
Gx(t)=eλ(t−1)=e−λ⋅(1+1!λ+2!λ2+3!λ3+…)
This gives us exactly what we are looking for, that is
P(# of events=r)=r!eλλr
But it is okay if we do not know generating functions. We would have that
P(# of events=r)=n→∞lim(rn)⋅(nλ)r⋅(1−nλ)n−r
Now lets just evaluate this limit.
=n→∞lim(rn)⋅(1−nλnλ)r⋅n→∞lim(1−nλ)n=e−λ⋅n→∞lim(rn)⋅(1−nλnλ)r=e−λ⋅λr⋅n→∞lim(rn)⋅(1−nλn1)r=e−λ⋅λr⋅n→∞lim(rn)⋅(1n1)r=e−λ⋅λr⋅n→∞lim(rn)⋅(n1)r=e−λ⋅λr⋅n→∞limr!⋅(n−r)!⋅nrn!=r!e−λ⋅λrn→∞lim(n−r)!⋅nrn!=r!e−λ⋅λrn→∞lim(nn⋅nn−1⋅⋯⋅nn−r+1)
Now it is not too much work to show that the limit evaluates to 1 which gives us the poisson distribution formula again.
Now lets prove a cool fact about the poisson distribution:
if X∼Po(λ) then Var(X)=E(X)=λ
We know that in a binomial distribution the variance is equal to npq. Where we know that np=λ. Lets recall how we derived the poisson distribution in the first place. We considered as n approached infinity. When this happens q=1−nλ approaches 1. We we have that npq=λ(1−nλ) approaches λ as n approaches infinity.
And in the limiting case when we get the poisson distribution. The variance is equal to λ
Lets prove one more cool fact about the poisson distribution
if X∼Po(a) and Y∼Po(b)
then X+Y∼Po(a+b)
To do this we use the generating function of the poisson distribution
GX(t)=ea(x−1)andGY(t)=eb(x−1)
So we then have that
GX+Y=e(a+b)(x−1)