[BioC] ChIPpeakAnno problem w getAllPeakSequence()
Ou, Jianhong
Jianhong.Ou at umassmed.edu
Wed Oct 17 18:03:45 CEST 2012
Hi Anna,
To get the peak sequences you can try BSgenome + BStrings first. Try ?getSeq to extract the sequence from local database, faster than remote mart.
Because I can not repeat your error, could you please share the subset of you data to me for testing?
Yours sincerely,
Jianhong Ou
jianhong.ou at umassmed.edu
On Oct 17, 2012, at 9:14 AM, Anna Ehrlund wrote:
> Dear list,
>
> I've just started analyzing a ChIP-seq data set. I generated a peak list using "standard" utilities (bowtie, MACS) and loaded it into R in the ChIPpeakAnno package. I managed to annotate the peaks but when I tried to retrieve the peak sequences using the getAllPeakSequence() function I ran into a problem:
>
>
> peaksequences<-getAllPeakSequence(mergedpeakannotations, upstream=100, downstream=100, genome=mart, AnnotationData=getAnnotation(mart, featureType="TSS"))
> Error in getBM(c(seqType, type), filters = c(type, "upstream_flank"), :
> Query ERROR: caught BioMart::Exception::Usage: Filter upstream_flank NOT FOUND
>
> The error message sometimes states downstream_flank instead of upstream. At one point I got it to run on a small subset of the data (first 10 rows) but later the same command failed with this same error message.
>
> Some info about the objects:
> mergedpeakannotations: RangedData object, created using mergedpeakannotations<- annotatePeakInBatch(mergedpeak.rd, AnnotationData=getAnnotation(mart, featureType="TSS"))
>
> mart=useMart(biomart="ensembl", dataset="hsapiens_gene_ensembl")
>
> Anybody got a clue about why this fails? All help would be much appreciated!
>
> Best regards!
> Anna
>
>
>
> sessionInfo()
> R version 2.15.1 (2012-06-22)
> Platform: x86_64-pc-mingw32/x64 (64-bit)
>
> locale:
> [1] LC_COLLATE=Swedish_Sweden.1252 LC_CTYPE=Swedish_Sweden.1252
> [3] LC_MONETARY=Swedish_Sweden.1252 LC_NUMERIC=C
> [5] LC_TIME=Swedish_Sweden.1252
>
> attached base packages:
> [1] grid stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] ChIPpeakAnno_2.6.0 limma_3.14.1
> [3] org.Hs.eg.db_2.8.0 GO.db_2.8.0
> [5] RSQLite_0.11.2 DBI_0.2-5
> [7] AnnotationDbi_1.20.1 BSgenome.Ecoli.NCBI.20080805_1.3.17
> [9] BSgenome_1.26.1 GenomicRanges_1.10.2
> [11] Biostrings_2.26.2 IRanges_1.16.2
> [13] multtest_2.14.0 Biobase_2.18.0
> [15] biomaRt_2.14.0 BiocGenerics_0.4.0
> [17] VennDiagram_1.5.1
>
> loaded via a namespace (and not attached):
> [1] MASS_7.3-22 parallel_2.15.1 RCurl_1.95-1.1 splines_2.15.1 stats4_2.15.1
> [6] survival_2.36-14 tools_2.15.1 XML_3.95-0.1
>
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list