Tools for statistical disclosure control (sdc)
Labeling and removing unique replicates of unique actual (observed) individuals.
sdc(object, data, label = NULL, rm.replicated.uniques = FALSE, uniques.exclude = NULL, recode.vars = NULL, bottom.top.coding = NULL, recode.exclude = NULL, smooth.vars = NULL)
object |
an object of class |
data |
the original (observed) data set. |
label |
a single string with a label to be added to the synthetic data sets as a new variable to make it clear that the data are synthetic/fake. |
rm.replicated.uniques |
a logical value indicating whether unique replicates of units that are unique also in the orginal data set should be removed. |
uniques.exclude |
a single string or a vector of strings with name(s) of variable(s) to be excluded from the identification of uniques. |
recode.vars |
a single string or a vector of strings with name(s) of variable(s) to be bottom- or/and top-coded. |
bottom.top.coding |
a list of two-element vectors specifing
bottom and top codes for each variable in |
recode.exclude |
a list specifying for each variable in
|
smooth.vars |
a single string or a vector of strings with name(s)
of numeric variable(s) to be smoothed ( |
An object
provided as an argument adjusted in accordance with the
other parameters' values.
ods <- SD2011[1:1000,c("sex","age","edu","marital","income")] s1 <- syn(ods, m = 2) s1.sdc <- sdc(s1, ods, label="false_data", rm.replicated.uniques = TRUE, recode.vars = c("age","income"), bottom.top.coding = list(c(20,80),c(NA,2000)), recode.exclude = list(NA,c(NA,-8)))
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.