[R] reshaping the data
    Jim Lemon 
    drjimlemon at gmail.com
       
    Mon Jul  3 11:38:49 CEST 2017
    
    
  
Hi Bogdan,
If you want something like this:
             22M     17M     11M
AEBP1  SNV     SNV     NA
ATR       INDEL   NA       NA
ATR       SNV     NA        NA
BTK        NA       NA      SNV
BTK       NA       NA       INDEL
You're in trouble with repeated row names. Same with column names if
you add extra columns for repeated values. You'll have to change
something in order to make this consistent.
Jim
On Mon, Jul 3, 2017 at 5:22 PM, Bogdan Tanasa <tanasa at gmail.com> wrote:
> Dear all,
>
> I would appreciate please a piece of help regarding the use of acast/dcast
> functions in reshape2 package.
>
> Specifically, I'm working with a data frame, that has information about
> SAMPLE, GENE, and TYPE of MUTATION (as shown below):
>
> Sample    Gene  Type
> 22M       AEBP1   SNV
> 17M       AEBP1   SNV
> 22M         ATR   INDEL
> 22M         ATR   SNV
> 11M         BTK   SNV
> 11M         BTK INDEL
>
>
> I would like to transform this DATAFRAME into a MATRIX that has GENE on
> ROWS, SAMPLE on COLUMNS, and the elements of the matrix are SNV or INDEL
> (ie the types of mutations).
>
> The R code starts with :
>
> y <- data.frame(Sample = x$Sample, Gene = x$Gene, Type=x$Type)
>
> z <- acast(y, Cancer_Gene ~ Sample)
>
> although in z, I do not have the information on Type (i.e.SNV or INDEL).
>
> thanks a lot,
>
> -- bogdan
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
    
    
More information about the R-help
mailing list