[R] Help with IF command strings
    arun 
    smartpink111 at yahoo.com
       
    Fri Jul 12 17:53:28 CEST 2013
    
    
  
Hi,
Regarding the 2nd issue of mean=3.8 being "too high", could you explain it.
#Using the same example:
 dat1$V21[dat1$V2==1|dat1$V2==0]
#[1]  6  2  1 10  0
 (6+2+1+10+0)/5
#[1] 3.8
 mean(dat1$V21[dat1$V2==1|dat1$V2==0])
#[1] 3.8
About missing data:
set.seed(55)
dat2<- as.data.frame(matrix(sample(c(NA,0:4),26*10,replace=TRUE),ncol=26))  ####new example dataset
 dat2$V2
 #[1]  4 NA  0  0  1  3  2  4  2  1
dat2$V21
 #[1] NA  3  0  0  2  0  4  0  3 NA
(dat2$V2==1|dat2$V2==0) &!is.na(dat2$V2)
# [1] FALSE FALSE  TRUE  TRUE  TRUE FALSE FALSE FALSE FALSE  TRUE
 dat2$V21[(dat2$V2==1|dat2$V2==0) &!is.na(dat2$V2)]
#[1]  0  0  2 NA
mean(dat2$V21[(dat2$V2==1|dat2$V2==0) &!is.na(dat2$V2)],na.rm=TRUE)
#[1] 0.6666667
 (0+0+2)/3
#[1] 0.6666667
If this doesn't solve the problem, please provide a reproducible example using ?dput() 
ex:
dput(head(dataset,20))
A.K.
When I enter that formula I get "NA" or NaN" as an answer.  I have some 
missing data, which was entered in as NA, so I'm not sure if that is the
 problem.  Originally I thought I would need to do the entire set of 
equations you posted, but that gave me 3.8 as a mean, which I know is 
too high to be the mean for this data set. 
Thanks 
----- Original Message -----
From: arun <smartpink111 at yahoo.com>
To: R help <r-help at r-project.org>
Cc: 
Sent: Friday, July 12, 2013 8:21 AM
Subject: Re: Help with IF command strings
Hi,
Not sure I understand your question.
Suppose `data1` is your real data, but if the column names are different, change "V21", "V2" by those in the real data. Based on your initial post, the column names seemed to be the same.
mean(data1$V21[data1$V2==1|data1$V2==0])
A.K.  
What values would I substitute by real data.  I did everything the way 
you posted, and I got 3.8 as well.  So I'm curious what values I would 
change to get the mean for the actual data? 
----- Original Message -----
From: arun <smartpink111 at yahoo.com>
To: R help <r-help at r-project.org>
Cc: 
Sent: Thursday, July 11, 2013 9:21 PM
Subject: Re: Help with IF command strings
HI,
Try this:
set.seed(485)
dat1<- as.data.frame(matrix(sample(0:10,26*10,replace=TRUE),ncol=26))
mean(dat1$V21[dat1$V2==1|dat1$V2==0])
#[1] 3.8
#or
with(dat1,mean(V21[V2==1|V2==0]))
#[1] 3.8
A.K.
I have data in 26 columns, I'm trying to get a mean for column 21 only for the participants that are either 0 or 1 in column 2. 
One of the commands I tried looked something like this 
mean(data1$V21, if(V2 = 1))   
So basically I need to have the program run a mean (and later 
other forms of analysis) on participants based on their condition. 
either 0 or 1. 
Help is greatly appreciated. 
Thanks
    
    
More information about the R-help
mailing list