[Bioc-sig-seq] trimLRPatterns question
Harris A. Jaffee
hj at jhu.edu
Fri Oct 9 22:23:23 CEST 2009
Allowing 2 errors, the first two letters of Rpattern3 match the last
two letters of the subject.
Maybe you want max.Rmismatch=2, which is effectively c(rep(-1,11),2),
or something else to prevent
unintended trimming.
On Oct 9, 2009, at 3:53 PM, joseph wrote:
> Hello
> In the example below, Rpattern2 (2 mismatches) trims the subject
> correctly, while Rpattern3 (3 mismatches) which is not supposed to
> do any trimming, it trims the last 2 nucleotides. Can you explain why?
>
>
>> subject <- DNAStringSet("GCTGGAACCCAGGGTGTTGTACCTGTAGGCACCA")
>> Rpattern2 <- "CTGaAGGCACgA" # 2 mismatches
>> Rpattern3 <- "tTGaAGGCACgA" # 3 mismatches
>> trimLRPatterns(Rpattern = Rpattern2, subject = subject,
>> max.Rmismatch = rep(2, 12))
> A DNAStringSet instance of length 1
> width seq
> [1] 22 GCTGGAACCCAGGGTGTTGTAC
>> trimLRPatterns(Rpattern = Rpattern3, subject = subject,
>> max.Rmismatch = rep(2, 12))
> A DNAStringSet instance of length 1
> width seq
> [1] 32 GCTGGAACCCAGGGTGTTGTACCTGTAGGCAC
>
>
>> sessionInfo()
> R version 2.10.0 alpha (2009-10-01 r49909)
> x86_64-apple-darwin9.8.0
>
> locale:
> [1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] rtracklayer_1.5.17 RCurl_1.2-0 bitops_1.0-4.1
> ShortRead_1.3.36
> [5] lattice_0.17-25 BSgenome_1.13.16 Biostrings_2.13.48
> IRanges_1.3.87
>
> loaded via a namespace (and not attached):
> [1] Biobase_2.5.8 grid_2.10.0 hwriter_1.1 tools_2.10.0 XML_2.6-0
>>
>
>
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> Bioc-sig-sequencing at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing
More information about the Bioc-sig-sequencing
mailing list