[R] Problem to generate training data set and test data set
    Aimin Yan 
    aiminy at iastate.edu
       
    Mon Dec 25 18:35:17 CET 2006
    
    
  
I have a full data set like this:
    aa bas    aas bms   ams bcu        acu     omega       y
1 ALA   0 127.71   0 69.99   0 -0.2498560  79.91470 outward
2 PRO   0  68.55   0 55.44   0 -0.0949008  76.60380 outward
3 ALA   0  52.72   0 47.82   0 -0.0396550  52.19970 outward
4 PHE   0  22.62   0 31.21   0  0.1270330 169.52500  inward
5 SER   0  71.32   0 52.84   0 -0.1312380   7.47528 outward
6 VAL   0  12.92   0 22.40   0  0.1728390 149.09400  inward
......................................................................................
aa have 19 levels, and there are different number of observation for each 
levels.
I want to pick 75% of observations of each levels randomly to generate a 
training set,
and 25% of observation of each levels to generate a testing set.
Does anyone know to do this?
Thanks
Aimin Yan
    
    
More information about the R-help
mailing list