Re: using data for Bayesian prior
From: JRKRideau (JohnKane9996_at_hotmail.com)
Date: 11/18/04
- Next message: George Kahrimanis: "Re: Prevalence estimates from population characteristics - HELP!"
- Previous message: Bruce Weaver: "Re: SPSS power calculation with MANOVA and MACRO (Break out the hatchet ;-)"
- In reply to: Herman Rubin: "Re: using data for Bayesian prior"
- Messages sorted by: [ date ] [ thread ]
Date: 18 Nov 2004 07:07:31 -0800
hrubin@odds.stat.purdue.edu (Herman Rubin) wrote in message news:<cmvuo0$5lpe@odds.stat.purdue.edu>...
> In article <cmvgag$6ec$1@planja.arnes.si>,
> Aleks Jakulin <a_jakulin@@hotmail.com> wrote:
> >Geert Verdoolaege wrote:
> >> I am not sure why exactly, in Bayesian analysis, it is a bad idea to
> >> use information on the measured data for the construction of a prior
> >> distribution of the parameters.
>
> For a mechanical Bayesian, this is correct. If you have an
> infinitely fast computer with zero cost, you could take any
> combination of prior, likelihood, and cost and compute the
> Bayes procedure.
>
> >The basic explanation is that you're double-counting the evidence.
> >First, you use the data to construct a prior, and then you update the
> >prior with the *same* data.
>
> There is another way of looking at it which points out that
> this is not exactly what is being done. It could be that
> the true prior is essentially incomputable, but if one
> looks at it as a prior on computable priors, one can
> estimate the computable prior from the data, and use this
> to compute the action to be taken. In the case of presumably
> independent estimation, there are even Bayes empirical Bayes
> procedures.
>
> Otherwise, this is not optimal, but it may be close, and
> it is possible to get general results here.
>
> >The empirical Bayes procedure uses a part of the data to construct the
> >prior, and the other part of the data to update it. This is
> >reminiscent of cross-validation, where you use a part of the data to
> >construct a model, and the other part to examine how well the model
> >fits the data that was not used for the model.
>
> See the above. I do not agree with the above; the entire
> data can be used for both, and usually should be.
>
> >However, hard-core Bayesians do not like empirical Bayes very much
> >because they see it as an approximation to hierarchical Bayesian
> >analysis. There, you don't specify the prior, but instead specify a
> >model of the prior's parameters.
>
> What is important is the risk of the procedure. There are
> those who use computationally simple, but very definitely
> unreasonable, priors, often based on the data, and look at
> the problem of approximating the prior. This may or may
> not be a reasonable way to look at it. When testing a null
> hypothesis with a reasonable amount of data, the prior
> probability that the hypothesis is true turns out not to be
> of importance at all. Looking at the Bayes risk shows this
> to be the case.
>
> Many of the "hard-core" Bayesians seem to have no qualms in
> choosing a convenient prior which cannot be reasonable, and
> often depends on the form of the experiment. This violates
> the consistency assumptions, which call for minimizing the
> prior Bayes risk, treating the "prior" as weights, and not
> as probabilities. The loss and prior cannot be operationally
> separated, as only the product enters. Using the data to
> estimate both may well be needed for high-dimensional problems;
> this includes almost all of the so-called "nonparametric" ones.
I have never used Bayesian stats and am certainly not a statistican.
Can anyone recommend a "Bayesian Stats for Dummies" text to give me a
feel for them?
John Kane
Kingston ON
- Next message: George Kahrimanis: "Re: Prevalence estimates from population characteristics - HELP!"
- Previous message: Bruce Weaver: "Re: SPSS power calculation with MANOVA and MACRO (Break out the hatchet ;-)"
- In reply to: Herman Rubin: "Re: using data for Bayesian prior"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|