Sample Variance
The sample variance
(commonly written
or sometimes
) is the second sample
central moment and is defined by
|
(1)
|
where
the sample
mean and
is the sample
size.
To estimate the population variance
from
a sample of
elements with a priori unknown mean (i.e., the mean is estimated
from the sample itself), we need an unbiased estimator
for
. This estimator
is given by k-statistic
, which is defined
by
|
(2)
|
(Kenney and Keeping 1951, p. 189). Similarly, if
samples are taken
from a distribution with underlying central moments
, then the expected value of the observed sample
variance
is
|
(3)
|
Note that some authors (e.g., Zwillinger 1995, p. 603) prefer the definition
|
(4)
|
since this makes the sample variance an unbiased estimator for the population variance. The distinction between
and
is a common
source of confusion, and extreme care should be exercised when consulting the literature
to determine which convention is in use, especially since the uninformative notation
is commonly used for both. The unbiased sample
variance
is implemented as Variance[list].
Also note that, in general,
is not an unbiased estimator of the
standard deviation
even if
is an unbiased
estimator for
.
5-ary Lyndon words of length 12

