[Bioc-sig-seq] ShortRead, readAligned() and qa()
Martin Morgan
mtmorgan at fhcrc.org
Tue Jun 9 13:43:06 CEST 2009
Ivan Gregoretti <ivangreg at gmail.com> writes:
> Hello everybody,
>
> Can qa() work on compressed *_export.txt files?
>
>
>
> The function readAligned() is very smart because it does not care if the
> export file is plain or compressed. You just do
>
> aln <- readAligned(sp, fileNamePattern)
>
> and the lane is loaded up.
>
> I tried to run qa() on a directory with compressed export files and I got:
>
>> sp <- SolexaPath(experimentPath=myExperimentPath,
> analysisPath=myAnalysisPath)
>> sp
> class: SolexaPath
> experimentPath: /data/igregore/all/runs/090529/
> dataPath: NA
> scanPath: NA
> imageAnalysisPath: NA
> baseCallPath: NA
> analysisPath: GERALD_01-06-20...
>> qa <- qa(sp)
> Error: Input/Output
> no input files found
> dirPath: /data/igregore/all/runs/090529/GERALD_01-06-2009_niddk/
> pattern: .*_export.txt$
>
>
> I guess that qa()'s inability to read .txt.gz is a feature rather than a
> bug. If so, would you please consider adding this capability on a future
> ShortRead release?
Hi Ivan -- I think you should be able to specify a pattern
qa(sp, pattern=".*_export.txt.gz")
and failing that provide a dirfull direcotry path
qa(analysisPath(sp), pattern=".*_export.txt.gz")
let me know if that does not work.
Martin
> Thank you,
>
> Ivan
>
>> sessionInfo()
> R version 2.9.0 (2009-04-17)
> x86_64-unknown-linux-gnu
>
> locale:
> LC_CTYPE=en_US;LC_NUMERIC=C;LC_TIME=C;LC_COLLATE=C;LC_MONETARY=C;LC_MESSAGES=en_US;LC_PAPER=en_US;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US;LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] ShortRead_1.2.0 lattice_0.17-22 BSgenome_1.12.0 Biostrings_2.12.1
> [5] IRanges_1.2.0
>
> loaded via a namespace (and not attached):
> [1] Biobase_2.4.1 grid_2.9.0 hwriter_1.1
>
>
> Ivan Gregoretti, PhD
> National Institute of Diabetes and Digestive and Kidney Diseases
> National Institutes of Health
> 5 Memorial Dr, Building 5, Room 205.
> Bethesda, MD 20892. USA.
> Phone: 1-301-496-1592
> Fax: 1-301-496-9878
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> Bioc-sig-sequencing at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing
--
Martin Morgan
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109
Location: Arnold Building M1 B861
Phone: (206) 667-2793
More information about the Bioc-sig-sequencing
mailing list