[R] Re: point-biserial correlation
    Noel Yvonnick 
    noel at univ-lille3.fr
       
    Mon Mar 31 17:07:00 CEST 2003
    
    
  
> has anyone written a package/function in R for computing a point-
> biserial resp. biserial correlation?
Note that the point-biserial correlation is nothing but the standard 
correlation coefficient when one of the variables is dichotomous, so that 
cor(.) is OK.
The biserial is different and includes a correction for the so-called "point 
of dichotomy". The following should work (translating a formula found in a 
psychometric manual) :
cor.biserial = function(x,y)
{
  stopifnot(is.factor(x))
  stopifnot(length(levels(x))==2)
  stopifnot(length(x)==length(y))
  N = length(y)
  # Success / Failure frequencies
  f = table(x)/length(x)
  # Means of success/failure groups on the global score
  m = tapply(y,x,mean)
  # Variance of the global score
  Sy = sqrt(var(y)*(N-1)/N)
  # Biserial correlation
  # Be cautious in interpreting the sign : 
  # depends upon the ordering of levels(x)
  ((m[1]-m[2])/Sy)*(f[1]*f[2]/dnorm(f[1]-.5))
}
Yvonnick Noel, PhD.
Department of Psychology
U. of Lille 3
    
    
More information about the R-help
mailing list