[R] avoiding timconsuming for loop renaming identifiers
    François Pinard 
    pinard at iro.umontreal.ca
       
    Sat Jul 21 04:03:01 CEST 2007
    
    
  
[toby909 at gmail.com]
>I was wondering if I can avoid a time-consuming for loop on my 600000 
>obs dataset.
>school_id   y
>8           9.87
>8           8.89
>8           7.89
>8           8.88
>20          6.78
>20          9.99
>20          8.79
>31          10.1
>31          11
>There are, say, 143 different schools in this 600000 obs dataset.
>I need to thave sequential identifiers, 1,2,3,4,5,...,143.
Hello, Toby.  Maybe:
   dta$id <- cumsum(c(1, diff(dta$school_id) != 0))
-- 
François Pinard   http://pinard.progiciels-bpi.ca
    
    
More information about the R-help
mailing list