grep for nonstandard characters
Return the indices of elements of x
containing characters that are not in
standardCharacters
.
grepNonStandardCharacters(x, value=FALSE, standardCharacters=c(letters, LETTERS, ' ', '.', ',', 0:9, '\"', "\'", '-', '_', '(', ')', '[', ']', '\n'), ... )
x |
character vector in which it is desired to
identify elements containing characters not
in |
value |
logical:
|
standardCharacters |
Characters to overlook in |
... |
optional arguments for
|
1. x. <- strsplit(x, '')
: convert
the input character vector to a list of
vectors of character vectors with
nchar(x.[i])
== 1 for i in
1:length(x)
.
2. sapply(x., ...)
to identify all
elements for which any element of x[[i]] is
not in standardCharacters
.
an integer vector identifying all elements
of x
containing a character not in
standardCharacters
.
Spencer Graves
Names <- c('Raul', 'Ra`l', 'Torres,Raul', 'Torres, Raul') # confusion in character sets can create # names like Names[2] chk <- grepNonStandardCharacters(Names) all.equal(chk, 2) chkv <- grepNonStandardCharacters(Names, TRUE) all.equal(chkv, Names[2])
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.