[R] Strange dataframe behavior
Charles C. Berry
cberry at tajo.ucsd.edu
Tue Oct 23 20:23:21 CEST 2007
On Tue, 23 Oct 2007, Sergey Goriatchev wrote:
> Hello,
>
> I have a question regarding the following output:
>
>> database <- read.delim(file=path.input.file, header=TRUE, dec=".", sep="\t", na.strings = "#NV")
>> str(database)
> 'data.frame': 314 obs. of 13 variables:
> $ S : Factor w/ 314 levels "307073","400212",..: 147 72 299 137
> 162 62 189 236 134 307 ...
> $ A : Factor w/ 314 levels "Alfa",...: 285 258 197 3 81 162 183 272
> 73 301 ...
> $ M: Factor w/ 19 levels "@NA","A",..: 18 10 11 6 7 12 17 17 11 6 ...
> $ W : num 0 0 0 0 0 ...
> $ T : num 0.0467 0.1095 0.0252 0.0821 -0.0275 ...
> $ C : num 0 0 0 0 0 ...
> $ MF : num -0.658 0.261 0.922 -1.897 -1.884 ...
> $ V : num 0.0585 -1.0852 -0.3156 -1.0592 0.2810 ...
> $ G : num -0.568 -1.302 0.225 -1.473 -0.541 ...
> $ Mo : num 0.34967 0.42807 -0.41407 -0.18216 -0.00305 ...
> $ R : num -0.5413 -2.0000 0.5353 -1.1437 -0.0776 ...
> $ Tr : num -0.12816 1.04148 0.00647 -0.02424 -1.66834 ...
> $ Su : num -1.611 1.160 -0.528 -0.091 -1.148 ...
>> which(is.na(database))
> [1] 675 704 774 887
>
> So, I have 314 observations, but there are unknown NA observations!
> I remove one observation (for certain reasons), and remove the
> corresponding factor level, then:
>> str(database)
> 'data.frame': 313 obs. of 13 variables:
> ....
>> which(is.na(database))
> [1] 673 702 772 885
>
> The removal of ONE observation moves NAs by two positions.
>
> Maybe someone have an idea what these NA observations mean????
If you mean why they move by two positions, I do.
But you can figure this out, if you do this:
which( is.na(database) , arr.ind = TRUE )
and maybe this
row( database )
col( database )
Chuck
> Thanks in advance for your time and help!
>
> Sergey
> University of Zurich
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
Charles C. Berry (858) 534-2098
Dept of Family/Preventive Medicine
E mailto:cberry at tajo.ucsd.edu UC San Diego
http://famprevmed.ucsd.edu/faculty/cberry/ La Jolla, San Diego 92093-0901
More information about the R-help
mailing list