[R] Computing means of multiple variables based on a condition
    KMNanus 
    kmnanus at gmail.com
       
    Thu May 26 00:37:54 CEST 2016
    
    
  
I have a large dataset, a sample of which is:
a<- c(“A”, “B”,“A”, “B”,“A”, “B”,“A”, “B”,“A”, “B”)
b <-c(15, 35, 20,  99, 75, 64, 33, 78, 45, 20)
c<- c( 111, 234, 456, 876, 246, 662, 345, 480, 512, 179)
d<- c(1.1, 3.2, 14.2, 8.7, 12.5, 5.9, 8.3, 6.0, 2.9, 9.3) 
df <- data.frame(a,b,c,d)
I’m trying to construct a data frame that shows the means of c & b based on the condition of d and grouped by a.
I want to create the data frame below, then use ggplot2 to create a line plot of b at various conditions of d.
I can compute the grouped means (d>=2, d>=4, etc.) one at a time using dplyr but haven’t figured out how to put them all together or put them in one data frame.
I’d rather not use a loop and am relatively new to R.  Is there a way i can use tapply and set it to the conditions above so that I can create the df below?
        condition    mean(b)     mean(c)    
A        d>=2          ____         _____
B        d>=2          ____         _____
A        d>=4          ____         _____
B        d>=4         ____         _____
A        d>=6         ____         _____
B       d>=6         ____         _____
Ken
kmnanus at gmail.com
914-450-0816 (tel)
347-730-4813 (fax)
    
    
More information about the R-help
mailing list