Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

Lung

VA lung cancer data set


Description

Data from a randomised trial of two treatment regimens for lung cancer. This is a standard survival analysis data set from the classic textbook by Kalbfleisch and Prentice.

  • X: A design matrix with 137 observations (rows) and 9 predictor variables (columns). The columns are:

    • trt: Treatment indicator (1=control group, 2=treatment group)

    • karno: Karnofsky performance score (0=bad, 100=good)

    • diagtime: Time from diagnosis to randomization (months)

    • age: Age (years)

    • prior: Prior therapy (0=no, 1=yes)

    • squamous: Indicator for whether the cancer type is squamous cell carcinoma (0=no, 1=yes)

    • small: Indicator for whether the cancer type is small cell lung cancer (0=no, 1=yes)

    • adeno: Indicator for whether the cancer type is adenocarcinoma (0=no, 1=yes)

    • large: Indicator for whether the cancer type is large cell carcinoma (0=no, 1=yes)

  • y: A two column matrix (Surv object) containing the follow-up time (in days) and an indicator variable for whether the patient died while on the study or not.

Usage

data(Lung)

Format

A list containing the design matrix X and response matrix y

Source

References

  • Kalbfleisch D and Prentice RL (1980), The Statistical Analysis of Failure Time Data. Wiley, New York.

See Also


ncvreg

Regularization Paths for SCAD and MCP Penalized Regression Models

v3.13.0
GPL-3
Authors
Patrick Breheny [aut, cre] (<https://orcid.org/0000-0002-0650-1119>)
Initial release
2021-03-29

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.