[R] bootstrap: definition of original statistic
Frank E Harrell Jr
f.harrell at vanderbilt.edu
Fri Feb 22 16:10:36 CET 2008
Gad Abraham wrote:
> Hi,
>
> In the boot package, the original statistic is simply the statistic
> function evaluated on the original data (called t0).
>
> However, in Harrell et al 1996 "Multivariable prognostic models..."
> Stats Med vol 15, pp. 361--387, it is different (p. 372):
> The statistic function evaluated on the original data is called
> "D_app" (apparent statistic), whereas "D_orig" (original statistic)
> is derived by first fitting the model to a bootstrap sample, then
> freezing the model and applying it to the original data. The
> bootstrap statistic D_boot is derived by applying the function to
> the bootstrap sample. Then optimism O = average(D_boot - D_orig),
> and the bias-corrected estimate is D_app - O.
>
> Can someone explain this difference, and especially why boot
> doesn't evaluate the frozen model again on the original data?
>
> Thanks,
> Gad
>
>
The optimism bootstrap is a different kind of bootstrap to estimate bias
from overfitting. boot is for the regular bootstrap.
--
Frank E Harrell Jr Professor and Chair School of Medicine
Department of Biostatistics Vanderbilt University
More information about the R-help
mailing list