[R] replace values in vector from a replacement table
Liviu Andronic
landronimirc at gmail.com
Tue Jul 31 12:44:57 CEST 2012
On Mon, Jul 30, 2012 at 6:00 PM, jim holtman <jholtman at gmail.com> wrote:
> try this:
>
>> (x <- rep(letters,2))
> [1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k" "l" "m" "n" "o" "p"
> "q" "r" "s" "t" "u" "v" "w"
> [24] "x" "y" "z" "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k" "l" "m"
> "n" "o" "p" "q" "r" "s" "t"
> [47] "u" "v" "w" "x" "y" "z"
>> values <- c("aa", "a", "b", NA, "d", "zz")
>> repl <- c("aa", "A", "B", NA, "D", "zz")
>> (repl.tab <- cbind(values, repl))
> values repl
> [1,] "aa" "aa"
> [2,] "a" "A"
> [3,] "b" "B"
> [4,] NA NA
> [5,] "d" "D"
> [6,] "zz" "zz"
>> indx <- match(x, repl.tab[, 1], nomatch = 0)
>> x[indx != 0] <- repl.tab[indx, 2]
>> x
> [1] "A" "B" "c" "D" "e" "f" "g" "h" "i" "j" "k" "l" "m" "n" "o" "p"
> "q" "r" "s" "t" "u" "v" "w"
> [24] "x" "y" "z" "A" "B" "c" "D" "e" "f" "g" "h" "i" "j" "k" "l" "m"
> "n" "o" "p" "q" "r" "s" "t"
> [47] "u" "v" "w" "x" "y" "z"
>>
>
Based on this code I came up with the following function.
replace2 <- function(x, ind, repl){
if(any(is.na(ind))) ind[is.na(ind)] <- 0
if(is.vector(x) & is.vector(repl)) {
(x[ind != 0] <- repl[ind])
return(x)
} else if(identical(ncol(x), ncol(repl))){
(x[ind != 0, ] <- repl[ind, ])
return(x)
}
}
Whereas replicate() can be used only on vectors of same dimension,
replicate2() can be used on vectors and matrices/dataframes, and the
replacement data can have different nr of rows. It also works with
index vectors containing NAs.
> ##for vectors
> (indx <- match(x, repl.tab[, 1], nomatch = 0))
[1] 2 3 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 3 0 5 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
[46] 0 0 0 0 0 0 0
> head(replace2(x, indx, repl.tab[, 2]))
[1] "A" "B" "c" "D" "e" "f"
> (indx <- match(x, repl.tab[, 1])) ##index vector with NAs
[1] 2 3 NA 5 NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA
NA NA NA NA 2 3 NA 5
[31] NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA
> head(replace2(x, indx, repl.tab[, 2]))
[1] "A" "B" "c" "D" "e" "f"
> ##for matrices/dataframes
> head(xx <- cbind(x, x))
x x
[1,] "a" "a"
[2,] "b" "b"
[3,] "c" "c"
[4,] "d" "d"
[5,] "e" "e"
[6,] "f" "f"
> (repl.tab2 <- cbind(repl.tab[, 2], repl.tab[, 2]))
[,1] [,2]
[1,] "aa" "aa"
[2,] "A" "A"
[3,] "B" "B"
[4,] NA NA
[5,] "D" "D"
[6,] "zz" "zz"
> head(replace2(xx, indx, repl.tab2))
x x
[1,] "A" "A"
[2,] "B" "B"
[3,] "c" "c"
[4,] "D" "D"
[5,] "e" "e"
[6,] "f" "f"
Does this function have any generic value? Are there obvious
implementation mistakes?
Regards
Liviu
More information about the R-help
mailing list