[R-sig-ME] Nested random effect or unbalanced design?
Mark J Brewer
M.Brewer at bioss.ac.uk
Thu Sep 4 16:05:54 CEST 2014
Dear Ilkka,
I don't see B as a random effect, even if it is a random stimulus. If I
understand correctly, in your toy example B has a fixed set of 5 levels,
changing the nature of which would change the experiment - and you are
directly interested in how B affects A (or more specifically, how means
change, rather than a variance summary).
Correlations are often high in the manner you describe for M4; this is a
complex model with many parameters - e.g. this is allowing a separate
relationship between A and B for every subject. You should still be able to
test formally for a difference between M2 and M4.
Best wishes,
Mark
-----Original Message-----
From: r-sig-mixed-models-bounces at r-project.org
[mailto:r-sig-mixed-models-bounces at r-project.org] On Behalf Of Leppänen
Ilkka
Sent: 04 September 2014 14:34
To: r-sig-mixed-models at r-project.org
Subject: [R-sig-ME] Nested random effect or unbalanced design?
Dear list,
I am using lmer to identify a mixed effects model, but I am puzzled by
whether my design has nested random effects or whether it is just
unbalanced.
I have an experiment where subjects are repeatedly measured a physiological
variable A. Each subject sees a specific level of a variable B once every
time they are measured. Theory goes that higher B gives higher A.
The trick is that levels of B are drawn randomly from some distribution for
each subject. These draws are not reproducible in other experiments, so B
should be incorporated somehow as a random effect.
Now if I want to know how B affects A as a random effect, I start from the
model
(M1) A ~ (1|B) + time + (1|subject)
But in this model the random effect B has a very low variance. So I specify
B as a fixed effect with the model
(M2) A ~ B + time + (1|subject)
This model is desirable because I am mostly interested in stating how B
affects A as a fixed effect, not as a nuisance.
However, if I understand nesting correctly, (M2) is not sufficient because
subjects are nested in B. As a toy example, B has levels 1,2,3,4,5 and there
are five repeated measurements. Subject 1 sees B={1,2,3,2,2}, subject 2 sees
B={4,2,1,1,2}, subject 3 sees B={1,2,3,4,5}, and subject 4 sees only one
level B={4,4,4,4,4}. A cross-tabulation of this looks like
> xtabs(~ B+subject, sparse=T)
5 x 4 sparse Matrix of class "dgCMatrix"
S1 S2 S3 S4
1 1 2 1 .
2 3 2 1 .
3 1 . 1 .
4 . 1 1 5
5 . . 1 .
Therefore, because each subject sees their unique set of levels of B, I
would use the model with nested random effects
(M3) A ~ B + time + (1|subject) + (1|B:subject)
Here I get the result that the variance of the random effect B:subject is
very small relative to variance of B or the residual. A likelihood ratio
test does not see (M3) different than the simpler model (M2).
If I decide to go with (M2), I try to keep randomness maximal by examining
the random slopes
(M4) A ~ B + time + (1+B|subject)
But here, the random intercept and slope are highly or perfectly correlated.
My interpretation of this situation is that the by-subject random slopes are
unidentifiable because of the nesting; for example, S4 in the toy example
cannot have a "slope" for B.
How should I proceed? Should I just forget about the fact that B is a random
effect and pretend that my design is unbalanced?
Thank you in advance,
Ilkka Leppänen
Aalto University
_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
--
Biomathematics and Statistics Scotland (BioSS) is formally part of The
James Hutton Institute (JHI), a registered Scottish charity No. SC041796
and a company limited by guarantee No. SC374831
More information about the R-sig-mixed-models
mailing list