Resampling Validation of a Fitted Model's Indexes of Fit
The validate function when used on an object created by one of the
rms series does resampling validation of a
regression model, with or without backward step-down variable deletion.
# fit <- fitting.function(formula=response ~ terms, x=TRUE, y=TRUE)
validate(fit, method="boot", B=40,
bw=FALSE, rule="aic", type="residual", sls=0.05, aics=0,
force=NULL, estimates=TRUE, pr=FALSE, ...)
## S3 method for class 'validate'
print(x, digits=4, B=Inf, ...)
## S3 method for class 'validate'
latex(object, digits=4, B=Inf, file='', append=FALSE,
title=first.word(deparse(substitute(x))),
caption=NULL, table.env=FALSE,
size='normalsize', extracolsize=size, ...)
## S3 method for class 'validate'
html(object, digits=4, B=Inf, caption=NULL, ...)fit |
a fit derived by e.g. |
method |
may be |
B |
number of repetitions. For |
bw |
|
rule |
Applies if |
type |
|
sls |
significance level for a factor to be kept in a model, or for judging the residual chi-square. |
aics |
cutoff on AIC when |
force |
see |
estimates |
see |
pr |
|
... |
parameters for each specific validate function, and parameters to
pass to For For |
x,object |
an object produced by one of the |
digits |
number of decimal places to print |
file |
file to write LaTeX output. Default is standard output. |
append |
set to |
title, caption, table.env, extracolsize |
see
|
size |
size of LaTeX output. Default is |
It provides bias-corrected indexes that are specific to each type
of model. For validate.cph and validate.psm, see validate.lrm,
which is similar.
For validate.cph and validate.psm, there is
an extra argument dxy, which if TRUE causes the dxy.cens
function to be invoked to compute the Somers' Dxy rank correlation
to be computed at each resample. The values corresponding to the row
Dxy are equal to 2 * (C - 0.5) where C is the
C-index or concordance probability.
For validate.cph with dxy=TRUE,
you must specify an argument u if the model is stratified, since
survival curves can then cross and X beta is not 1-1 with
predicted survival.
There is also validate method for
tree, which only does cross-validation and which has a different
list of arguments.
a matrix with rows corresponding to the statistical indexes and columns for columns for the original index, resample estimates, indexes applied to the whole or omitted sample using the model derived from the resample, average optimism, corrected index, and number of successful re-samples.
prints a summary, and optionally statistics for each re-fit
Frank Harrell
Department of Biostatistics, Vanderbilt University
fh@fharrell.com
# See examples for validate.cph, validate.lrm, validate.ols
# Example of validating a parametric survival model:
n <- 1000
set.seed(731)
age <- 50 + 12*rnorm(n)
label(age) <- "Age"
sex <- factor(sample(c('Male','Female'), n, TRUE))
cens <- 15*runif(n)
h <- .02*exp(.04*(age-50)+.8*(sex=='Female'))
dt <- -log(runif(n))/h
e <- ifelse(dt <= cens,1,0)
dt <- pmin(dt, cens)
units(dt) <- "Year"
S <- Surv(dt,e)
f <- psm(S ~ age*sex, x=TRUE, y=TRUE) # Weibull model
# Validate full model fit
validate(f, B=10) # usually B=150
# Validate stepwise model with typical (not so good) stopping rule
# bw=TRUE does not preserve hierarchy of terms at present
validate(f, B=10, bw=TRUE, rule="p", sls=.1, type="individual")Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.