Re: [math-fun] Trimmed means and Multi-dimensional medians

29 Sep 2005

      --- joshua sweetkind-singer <sweetkindsinger@gmail.com> wrote:
...
Here's an interesting question: suppose we have data X_1, ..., X_n
drawn
from a Gaussian distribution with unknown mean
mu and known variance 1. We wish to estimate mu with a guess muhat.
Virtually everyone uses the sample mean of the dataset as an estimate
of mu,
but note that mu is also the *median* of the distribution. Under what
circumstances would we be justified in prefering the sample median of
the
data to estimate mu? Since the sample average is a sufficient
statistic, the
answer might be never, but I'm not sure. Might it be the case that
that the
sample median is preferable if we are using L1 loss, i.e., seeking to
minimize E_mu |mu - muhat| ?
I would solve this problem using the Bayesean method.  Then the
posterior distribution for mu will be a Gaussian with mean equal to the
sample mean, and variance 1/n.  This is all you can know about mu on
the basis of the given information.  For this particular estimation
problem, where we are given that the underlying distribution is a
Gaussian with unit variance, I would have no need for the sample
median.

Now then, if you must pick a number muhat, and make some decision on
that basis, and there is a cost c(mutrue,muhat) for being wrong, then
you can calculate the muhat that minimizes the expected cost, using the
p(mu) derived above.

Gene

__________________________________ 
Yahoo! Mail - PC Magazine Editors' Choice 2005 
http://mail.yahoo.com

Re: [math-fun] Trimmed means and Multi-dimensional medians

Eugene Salamin