[BioC] Error when using impute to get missing values
Martin Morgan
mtmorgan at fhcrc.org
Sat Mar 24 19:18:38 CET 2012
On 03/24/2012 10:07 AM, Tim Triche, Jr. wrote:
> This is a recurring problem with impute. I've tried tracing it and
> eventually hit a dead end; if the bug is reproducible (it isn't always, for
> me), running R as a gdb subprocess might help debugging it. If you want, I
> can give it a shot, time permitting; assuming this happens with this
> experiment every time, send me a script to reproduce it (as in, retrieve
> the data, put it in a matrix, and try imputing it) and I will see what I
> can do.
Also, on linux at any rate it's easy to
R -d valgrind -f script.R
and this usually points to the problem. valgrind is slow so it ends up
paying to make the example minimal (save hest and then the commands in
script.R will load impute, the data, and then evaluate impute.knn).
Martin
>
> Or ask the maintainer, Balasubramanian Narasimhan (help(package='impute')
> for email address), who may be able to do it faster. Or he may be busier.
> Could go either way :-)
>
>
> On Sat, Mar 24, 2012 at 9:36 AM, David Westergaard
> <s093629 at student.dtu.dk>wrote:
>
>> Hello,
>>
>> I am currently working on the dataset from ArrayExpress,
>> http://www.ebi.ac.uk/arrayexpress/experiments/E-GEOD-1560. It has a
>> lot of missing values filled by null, and I am trying to fill these in
>> using impute.knn. However, when I try to do so, I get a lot of errors:
>> *** caught segfault ***
>> address 0x3f94a7e775b5bc59, cause 'memory not mapped'
>> aswell as
>> 2000000007d68000-2000000007d78000 r-xp 00000000 08:03 235217090
>> /lib/libgcc_s.so.1
>>
>> which causes R to crash.
>>
>> Sample code looks like:
>>
>> # Read table, which contains two rows of headers
>> Data<-
>> read.table(file=file,header=FALSE,stringsAsFactors=FALSE,sep="\t",skip=2,na.string='null')
>> hl<- readLines(file,2)
>> hl<- strsplit(hl, '\t')
>> names(Data)<- sub('_$', '', paste(hl[[1]], hl[[2]], sep="_"))
>> # Select only those columns which have the actual preprocessed value,
>> x<- c(1,grep("C57_T40_.*AGILENT_VALUE",names(Data),perl=TRUE))
>> signals<- Data[,x]
>> hest<- as.matrix(signals[,-1])
>> # Error occurs at this step.
>> hest2<- impute.knn(hest)
>>
>> Any help as to why this happens is greatly appreciated.
>>
>>> sessionInfo()
>> R version 2.14.1 (2011-12-22)
>> Platform: ia64-unknown-linux-gnu (64-bit)
>>
>> locale:
>> [1] C
>>
>> attached base packages:
>> [1] stats graphics grDevices utils datasets methods base
>>
>> other attached packages:
>> [1] impute_1.28.0
>>
>> Best Regards,
>> David Westergaard
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>
>
>
--
Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109
Location: M1-B861
Telephone: 206 667-2793
More information about the Bioconductor
mailing list