The Poisson Distribution
The poisson and the binomial distribution are actually the same distribution.
Let us try to derive the poisson distribution from nothing but the binomial distribution.
We assume that in any moment in time an event may happen and that no two events can occur at the same time and that during any given period of time of the same duration the chance of an event happening is the same. Lets also say that in some period of time the expected number of events is λ.
Let's just assume for now that no two events can happen within a second of one another. Well what this would mean that in a period of 1 second there is a probability p of the event happening. If there are n seconds in a period of time then the probability of exactly r events happening over this period would be modelled by the binomial distribution with
P(# of events=r)=(rn)prqn−r
We also know that during this period of n seconds we would expect there to be np events. So if we know that during these n seconds we should expect there to be λ events, we have that p=nλ.
But of course we don't actually know that two events cannot happen within a second of one another, what if two atoms just happen to decay within a second of one another. What if two customers just happen to click order within a second of one another. We just don't know.
Now the trick is that we can just consider a period of time even smaller than a second, even smaller than a millisecond, what if we just take the limit as our period of time in which no events can happen becomes infinitely small. Or in other words, we assume that the number of "seconds" within our time period n gets really really large. This is perfect because at some point the probability of two events happening within 0.000000…00001 seconds of one another is so small that it could not possibly impact the final probability.
If we know generating functions we may recognize that the probability generating function for the probability of X events is now
GX(t)=x→∞lim(1−nλ+nλ⋅t)n=x→∞lim(1+nλ⋅(t−1))n
and then using the definition of erx we have that
Gx(t)=eλ(t−1)=e−λ⋅(1+1!λ+2!λ2+3!λ3+…)
This gives us exactly what we are looking for, that is
P(# of events=r)=r!eλλr
But it is okay if we do not know generating functions. We would have that
P(# of events=r)=n→∞lim(rn)⋅(nλ)r⋅(1−nλ)n−r
Now lets just evaluate this limit.
=n→∞lim(rn)⋅(1−nλnλ)r⋅n→∞lim(1−nλ)n=e−λ⋅n→∞lim(rn)⋅(1−nλnλ)r=e−λ⋅λr⋅n→∞lim(rn)⋅(1−nλn1)r=e−λ⋅λr⋅n→∞lim(rn)⋅(1n1)r=e−λ⋅λr⋅n→∞lim(rn)⋅(n1)r=e−λ⋅λr⋅n→∞limr!⋅(n−r)!⋅nrn!=r!e−λ⋅λrn→∞lim(n−r)!⋅nrn!=r!e−λ⋅λrn→∞lim(nn⋅nn−1⋅⋯⋅nn−r+1)
Now it is not too much work to show that the limit evaluates to 1 which gives us the poisson distribution formula again.
By realising that the poisson distribution is the limit of the binomial distribution we can instantly get the following fact about the poisson distribution:
If X∼Po(λ) then Var(X)=E(X)=λ
We know that in a binomial distribution the variance is equal to npq. We also know that np=λ. As p approaches 0, q=1−p approaches 1, So np=npq, Mean = Variance.