[BioC] How to convert from IRanges(List) to Rle(List)
Nicolas Delhomme
delhomme at embl.de
Sun Apr 8 14:10:10 CEST 2012
Hi Martin,
I had to change my data structure, but using transcriptLocs2refLocs, computation time is down to one second!!!
Amazing, really. Thanks!!
Nico
---------------------------------------------------------------
Nicolas Delhomme
Genome Biology Computational Support
European Molecular Biology Laboratory
Tel: +49 6221 387 8310
Email: nicolas.delhomme at embl.de
Meyerhofstrasse 1 - Postfach 10.2209
69102 Heidelberg, Germany
---------------------------------------------------------------
On 7 Apr 2012, at 20:12, Martin Morgan wrote:
> On 04/07/2012 05:39 AM, Nicolas Delhomme wrote:
>> Hi all,
>>
>> I'm just wondering if there would be a direct way to convert an
>> IRanges to an Rle, as in: as(rng,"Rle"). At the moment, I can convert
>> my IRanges into an integer vector and cast that as an Rle
>> (Rle(as.integer(rng)), but that is not extremely efficient on a long
>> IRangesList (with> 700,000 IRanges in it). Takes ~10 mins with an
>> sapply.
>>
>> Why I want that is for the following: I have an IRangesList of
>> transcripts (describing exons at the genome level) and for every one,
>> I have a bp position at the transcript level that I want to convert
>> into a genomic bp position. Basically, I need to be able to convert a
>> given transcript coordinate into the corresponding genomic
>> coordinate. My IRanges contain the genomic coordinates of every
>> transcript and by converting it into an integer vector, I can select
>> the right genomic bp coordinate by using the transcript bp coordinate
>> as an index (as.integer(rng)[transcript.pos]).
>>
>> I considered the IRanges approach because I keep the transcript name
>> and I'm sure that I looking up the right coord in the right
>> transcript, but I'm open to other suggestions.
>
> Hi Nico -- VariantAnnotation::refLocsToLocalLocs, GenomicFeatures::transcriptLocs2refLocs and IRanges::map might do this for you; no direct experience on my part, though. Martin
>
>>
>> Thanks for any pointers,
>>
>> Cheers,
>>
>> Nico
>>
>> ---------------------------------------------------------------
>> Nicolas Delhomme
>>
>> Genome Biology Computational Support
>>
>> European Molecular Biology Laboratory
>>
>> Tel: +49 6221 387 8310 Email: nicolas.delhomme at embl.de
>> Meyerhofstrasse 1 - Postfach 10.2209 69102 Heidelberg, Germany
>>
>> _______________________________________________ Bioconductor mailing
>> list Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/bioconductor Search the
>> archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>
> --
> Computational Biology
> Fred Hutchinson Cancer Research Center
> 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109
>
> Location: M1-B861
> Telephone: 206 667-2793
More information about the Bioconductor
mailing list