Khan microarray data with random missing values
A text file containing the Khan micorarray data with random missing values introduced for illustrative purposes
data(khanmiss)
The data set khanmiss
consists of 2310 rows and 65
columns. Row 1 has the
sample labels, Row 2 has the class labels.
The remaining rows are gene expression. Column 1 is a dummy gene number.
Column 2 is the gene name. Remaining columns are gene expression.
Please note that this dataset was derived from the original by introducing some random missing values purely for the purpose of illustration.
Khan, J. and Wei, J.S. and Ringner, M. and Saal, L. and Ladanyi, M. and Westermann, F. and Berthold, F. and Schwab, M. and Antonescu, C. and Peterson, C. and and Meltzer, P. (2001) Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural network. Nature Medicine 7, 673-679.
Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu (2002). Diagnosis of multiple cancer types by shrunken centroids of gene expression PNAS 99: 6567-6572. Available at www.pnas.org
data(khanmiss)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.