Simulation of validity indexes based on random clusterings
For a given dataset this simulates random clusterings using
stupidkcentroids
, stupidknn
,
stupidkfn
, and stupidkaven
. It then
computes and stores a set of cluster validity indexes for every
clustering.
randomclustersim(datadist,datanp=NULL,npstats=FALSE,useboot=FALSE, bootmethod="nselectboot", bootruns=25, G,nnruns=100,kmruns=100,fnruns=100,avenruns=100, nnk=4,dnnk=2, pamcrit=TRUE, multicore=FALSE,cores=detectCores()-1,monitor=TRUE)
datadist |
distances on which validation-measures are based, |
datanp |
optional observations times variables data matrix, see
|
npstats |
logical. If |
useboot |
logical. If |
bootmethod |
either |
bootruns |
integer. Number of resampling runs. If
|
G |
vector of integers. Numbers of clusters to consider. |
nnruns |
integer. Number of runs of |
kmruns |
integer. Number of runs of |
fnruns |
integer. Number of runs of |
avenruns |
integer. Number of runs of |
nnk |
|
dnnk |
|
pamcrit |
|
multicore |
logical. If |
cores |
integer. Number of cores for parallelisation. |
monitor |
logical. If |
List with components
nn |
list, indexed by number of clusters. Every entry is
a data frame with |
fn |
list, indexed by number of clusters. Every entry is
a data frame with |
aven |
list, indexed by number of clusters. Every entry is
a data frame with |
km |
list, indexed by number of clusters. Every entry is
a data frame with |
nnruns |
number of involved runs of |
fnruns |
number of involved runs of |
avenruns |
number of involved runs of |
kmruns |
number of involved runs of |
boot |
if |
Hennig, C. (2019) Cluster validation by measurement of clustering characteristics relevant to the user. In C. H. Skiadas (ed.) Data Analysis and Applications 1: Clustering and Regression, Modeling-estimating, Forecasting and Data Mining, Volume 2, Wiley, New York 1-24, https://arxiv.org/abs/1703.09282
Akhanli, S. and Hennig, C. (2020) Calibrating and aggregating cluster validity indexes for context-adapted comparison of clusterings. Statistics and Computing, 30, 1523-1544, https://link.springer.com/article/10.1007/s11222-020-09958-2, https://arxiv.org/abs/2002.01822
set.seed(20000) options(digits=3) face <- rFace(10,dMoNo=2,dNoEy=0,p=2) randomclustersim(dist(face),datanp=face,npstats=TRUE,G=2:3, nnruns=2,kmruns=2, fnruns=1,avenruns=1,nnk=2)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.