Birthday Problem
Consider the probability
that no
two people out of a group of
will have matching
birthdays out of
equally possible birthdays. Start with
an arbitrary person's birthday, then note that the probability that the second person's
birthday is different is
, that the
third person's birthday is different from the first two is
,
and so on, up through the
th person. Explicitly,
|
(1)
| |||
|
(2)
|
But this can be written in terms of factorials as
|
(3)
|
so the probability
that two or more people out of
a group of
do have the same birthday is therefore
|
(4)
| |||
|
(5)
|
In general, let
denote the probability that a
birthday is shared by exactly
(and no more) people
out of a group of
people. Then the probability that a birthday
is shared by
or more people is given by
|
(6)
|
In general,
can be computed using the recurrence relation
|
(7)
|
(Finch 1997). However, the time to compute this recursive function grows exponentially with
and so rapidly becomes unwieldy.
If 365-day years have been assumed, i.e., the existence of leap days is ignored, and the distribution of birthdays is assumed to be uniform throughout the year (in
actuality, there is a more than 6% increase from the average in September in the
United States; Peterson 1998), then the number of people needed for there to be at
least a 50% chance that at least two share birthdays is the smallest
such that
. This is given by
, since
|
(8)
| |||
|
(9)
|
The number
of people needed to obtain
for
, 2, ..., are 2, 2, 3, 3, 3, 4, 4,
4, 4, 5, ... (OEIS A033810). The minimal number
of people to give a 50% probability of having at least
coincident birthdays
is 1, 23, 88, 187, 313, 460, 623, 798, 985, 1181, 1385, 1596, 1813, ... (OEIS A014088; Diaconis and Mosteller 1989).
The probability
can be estimated as
|
(10)
| |||
|
(11)
|
where the latter has error
|
(12)
|
(Sayrafiezadeh 1994).
can be computed explicitly as
|
(13)
| |||
|
(14)
|
where
is a binomial
coefficient and
is a hypergeometric
function. This gives the explicit formula for
as
|
(15)
| |||
|
(16)
|
where
is a regularized
hypergeometric function.
A good approximation to the number of people
such that
is some given value can be given by solving
the equation
|
(17)
|
for
and taking
, where
is the ceiling
function (Diaconis and Mosteller 1989). For
and
, 2, 3, ...,
this formula gives
, 23, 88, 187, 313, 459, 622, 797,
983, 1179, 1382, 1592, 1809, ... (OEIS A050255),
which differ from the true values by from 0 to 4. A much simpler but also poorer
approximation for
such that
for
is given
by
|
(18)
|
(Diaconis and Mosteller 1989), which gives 86, 185, 307, 448, 606, 778, 965, 1164, 1376, 1599, 1832, ... for
, 4, ... (OEIS
A050256).
The "almost" birthday problem, which asks the number of people needed such that two have a birthday within a day of each other, was considered by Abramson and
Moser (1970), who showed that 14 people suffice. An approximation for the minimum
number of people needed to get a 50-50 chance that two have a match within
days out of
possible is given by
|
(19)
|
(Sevast'yanov 1972, Diaconis and Mosteller 1989).
birthday problem



