[BioC] Gene names on topTable by limma

Wed Oct 7 16:21:38 CEST 2009

Hi Marcos,

It seems from your code that you already know 
this, but I'll state it for the archives: you can 
put a variety of annotation information in 
fit$genes, and all of it will be automatically 
pulled into the output of topTable, which is 
handy if you'll be using topTable for several 
different contrasts. You already had this in your code to add the gene symbols:

 > fit2$genes$Symbol=getSYMBOL(fit2$genes$ID, "hgu133plus2")

You can add as many other columns as you want:

 > fit2$genes$EG <- getEG(fit2$genes$ID, "hgu133plus2")

 > fit2$genes$GeneName <- unlist(mget(fit$genes$ID, hgu133plus2GENENAME))

etc.

Cheers,
Jenny

At 03:56 PM 10/6/2009, James W. MacDonald wrote:
>Please don't take things off-list.
>
>Marcos Pinho wrote:
>>Jim,
>>
>>thanks for your reply. I was already able to 
>>introduce gene symbols. What I was looking for 
>>was to somehow create a column with short gene names for my final report.
>
>There isn't a getShortName() in annotate, but it 
>isn't difficult to get these data directly:
>
>names <- unlist(mget(fit$genes$ID, hgu133plus2GENENAME))
>
>see ls("package:hgu133plus2.db") for all the annotation possibilities.
>
>Best,
>
>Jim
>
>
>>
>>cheers,
>>
>>Marcos
>>
>>*> library(limma)*
>>*> fit =lmFit (esetF, design)*
>>*> fit2=eBayes(fit)*
>>*> library(annotate)*
>>Loading required package: xtable
>>
>>Attaching package: 'xtable'
>>
>>
>>         The following object(s) are masked from package:widgetTools :
>>
>>          label
>>
>>*> fit2$genes$Symbol=getSYMBOL(fit2$genes$ID, "hgu133plus2")*
>>*> topTable(fit2, coef=2)*
>>* *
>>*fit2$genes$EG <- getEG(fit2$genes$ID, "hgu133plus2")***
>>               ID Symbol     logFC  AveExpr         t      P.Value
>>adj.P.Val
>>5253   209993_at  ABCB1  8.167898 6.410285  28.41936 1.387084e-06 0.008649451
>>3378  1553436_at  MUC19  7.512443 6.097913  25.70956 2.249499e-06 0.008649451
>>9828 206488_s_at   CD36  7.318355 6.081049  22.79634 4.015521e-06 0.008649451
>>6995 222392_x_at   PERP  6.047929 6.736659  20.28712 7.035669e-06 0.008649451
>>8973   210603_at  ARD1B  6.088776 5.773605  20.23547 7.122355e-06 0.008649451
>>1412   235683_at  SESN3  6.589716 5.620394  20.17661 7.222719e-06 0.008649451
>>7573 216191_s_at   TRD@ -6.153962 5.640399 -19.71517 8.071494e-06 0.008649451
>>4013   202948_at  IL1R1 -6.231328 5.580407 -19.61010 8.281240e-06 0.008649451
>>5302   205934_at  PLCL1  5.836512 5.809832  19.53496 8.435274e-06 0.008649451
>>4061 205786_s_at  ITGAM -6.788839 7.957034 -18.03301 1.238036e-05 0.008649451
>>On Tue, Oct 6, 2009 at 4:18 PM, James W. 
>>MacDonald <jmacdon at med.umich.edu <mailto:jmacdon at med.umich.edu>> wrote:
>>     Hi Marcos,
>>
>>     Marcos Pinho wrote:
>>         Dear list,
>>         can anybody suggest how could I insert gene names in additional
>>         to gene
>>         symbols on my topTable generated by limma with my differentially
>>         expressed
>>         genes?
>>
>>     It really depends on what annotation you have for your genes.
>>     However the general idea would be to use one of the org.Xx.eg.db
>>     packages to annotate your genes, or if you are using a commercial
>>     chip, use a chip-specific annotation package.
>>     Say your MArrayLM object is called 'fit'
>>     tabl <- topTable(fit) ##maybe need a coef argument as well.
>>     if you have Entrez Gene IDs
>>     tabl$Genes <- unlist(mget(tabl$ID, org.Xx.eg.dbSYMBOL))
>>     if you have an affy chip
>>     tabl$Genes <- unlist(mget(tabl$ID, theaffychipSYMBOL))
>>     or so.
>>     Does that help?
>>     Best,
>>     Jim
>>
>>         cheers,
>>         Marcos B. Pinho
>>         Programa de Engenharia Química - PEQ
>>         Laboratório de Engenharia de Cultivos Celulares- LECC
>>         Universidade Federal do Rio de Janeiro - UFRJ
>>         Instituto Nacional de Câncer - INCA
>>         Rio de Janeiro - Brasil
>>                [[alternative HTML version deleted]]
>>
>> 
>>------------------------------------------------------------------------
>>         _______________________________________________
>>         Bioconductor mailing list
>>         Bioconductor at stat.math.ethz.ch
>>         <mailto:Bioconductor at stat.math.ethz.ch>
>>         https://stat.ethz.ch/mailman/listinfo/bioconductor
>>         Search the archives:
>>         http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>>     --     James W. MacDonald, M.S.
>>     Biostatistician
>>     Douglas Lab
>>     University of Michigan
>>     Department of Human Genetics
>>     5912 Buhl
>>     1241 E. Catherine St.
>>     Ann Arbor MI 48109-5618
>>     734-615-7826
>>
>>
>>--
>>Marcos B. Pinho
>>Programa de Engenharia Química - PEQ
>>Laboratório de Engenharia de Cultivos Celulares- LECC
>>Universidade Federal do Rio de Janeiro - UFRJ
>>Instituto Nacional de Câncer - INCA
>>Rio de Janeiro - Brasil
>
>--
>James W. MacDonald, M.S.
>Biostatistician
>Douglas Lab
>University of Michigan
>Department of Human Genetics
>5912 Buhl
>1241 E. Catherine St.
>Ann Arbor MI 48109-5618
>734-615-7826
>
>_______________________________________________
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>https://stat.ethz.ch/mailman/listinfo/bioconductor
>Search the archives: 
>http://news.gmane.org/gmane.science.biology.informatics.conductor

Jenny Drnevich, Ph.D.

Functional Genomics Bioinformatics Specialist
W.M. Keck Center for Comparative and Functional Genomics
Roy J. Carver Biotechnology Center
University of Illinois, Urbana-Champaign

330 ERML
1201 W. Gregory Dr.
Urbana, IL 61801
USA

ph: 217-244-7355
fax: 217-265-5066
e-mail: drnevich at illinois.edu