Repeated Measures

Repeated Measures

I. Why do repeated measures experiments?

A. Power: subjects serve as their own controls; effects of treatments are measured against the mean effect of all the treatments combined on the subject.

B. Efficiency

C. Example: S^R(A^F) X B^F

N=30, a=3, b=3, n=10

(3 groups of 10 subjects receiving each of 3 treatments)

Source E(MS) df ___ Error Line

1. A nbs²_A + bs²_S(A) a-1 = 2 2

2. S(A) bs²_S(A) a(n-1)= 27

3. B nas²_B + s²_S(A)B b-1 = 2 5

4. A X B ns²_AB + s²_S(A)B a-1)(b-1)= 4 5

5. S(A) X B s²_S(AB) (b-1)a(n-1)=54

If different subjects were run in each treatment, one would need 3 X 30= 90 subjects.

S^R(A^F X B^F) Factorial N=90

Source E(MS) df _ Error Line

1. A nbs²_A + s²_S(AB) a-1=2 4

2. B nas²_B + s²_S(AB) b-1=2 4

3. A X B ns²_AB + s²_S(AB) (a-1)b-1)=4 4

4. S(A X B) s²_S(A)B ab(n-1)=81

For the test of B to reach significance in the repeated measures design one would need an F_.05(2,54)=3.19. For this test to reach significance in the factorial design, one would need an F_.05(2,81)=3.12. So, little power is lost at a great savings in subjects. Also, note that if a smaller number of subjects are run in the factorial design, there is a substantially larger loss of power. For example, if n=4 and so N=36, the test of B would need an F(2,27)value of 3.34 to reach significance at the .05 level.

II. Model for repeated measures with 1 repeated factor.

		Treatment
Subject	1		...	j	...	k
1	y₁₁			y_1j		y_1k
.
i	y_i1			y_ij		y_ik
.
N	y_ni			y_nj		y_nk	.
	_.1			_.j		_.k	..

SStotal= SS(y_ij-_..)²

SSb.people= kS(_i.- _..)²

SSw.people= SS(y_ij-_i.)²

SStotal=SSb.people+SSw.people

SStreat= nS(_.j-_..)²

SSresidual = SS[(y_ij-_..)-(_i.- _..)-( _.j-_..)]²

SSw.people=SStreat + SSresidual

Model: S^R X T^F

Source E(MS) df Error Line

1. S ts²_S n-1

2. T ns²_T + s²_ST k-1 3

3. S X T s²_ST (n-1)(k-1)

III. Assumptions of Univariate Repeated Measures

A. ANOVA assumptions of:

1. Homogeneity of variances

2. Errors are independently and identically distributed as N(0, s²).

3. Subject effects are independent and distributed as N(0, s²).

B. Additional assumption of sphericity.

1. Observations may covary, but the degree of covariance must remain the same across treatments( e.g. no carry-over effects).

2. If the covariances are heterogeneous, the error term will generally be an underestimate and F tests will generally be positively biased (cf. Box 1954).

IV. Comparisons between treatments.

A. Often the hypothesis of interest in a repeated measures design, like hypotheses in factorial designs, does not correspond to the tests for main effects. In this case use a contrast.

MScontrast =

where n=number of observations in the mean

1. MScontrast can be tested against the standard error term for the corresponding effect (e.g. MS_ST).

2. MScontrast can also be tested against deviations from the expected trend. This method treats all of the variance not attributed to the contrast(s) as error. For example, to test a linear contrast by the second method, calculate:

SSdev.linear = SS_ST + (SS_T-SS_linear),

df(dev.linear)= df_ST+df_T-1 = [(n-1)(k-1)]+(k-1)-1 = n(k-1)-1

MSdev.linear= SSdev.linear/df(dev.linear)

When the degrees of freedom are large (say over 30), then the two approaches do not differ. Using this procedure tends to negatively bias F-tests because the denominators are often too large.

3. One could also use a multivariate approach to testing the contrast. This approach is equivalent to creating a new variable that is a linear combination of the dependent variables using the contrast weights ( y* = c_i *y_ijk) and then performing an F-test to test whether the mean of this new variable is different from zero (this is what many computer programs report).

V. Example (Winer p268)

Person Drug 1 Drug 2 Drug 3 Drug 4 Mean

1 30 28 16 34 27

2 14 18 10 22 16

3 24 20 18 30 23

4 38 34 20 44 34

5 26 28 14 30 24.5

Mean 26.4 25.6 15.6 32 24.9

SS_{between S}= 4(170.2)=680.8

SS_{within S} = 811

SS_drugs = 5 (139.64)=698.2

SS_SXD = 112.8

SS_total = 1491.8

Source SS df MS F

Between S 680.8 4 170.2

Within S 811 15 54.07

Drugs 698.2 3 232.73 24.76

S X D 112.8 12 9.4

Total 1491.8 19

If we assume that drugs 1-4 are really 4 dosage levels of the same drug, it makes sense to test for trends.

Drug dosage

26.4 25.6 15.6 32

Contrast Weights

Linear -3 -1 1 3

Quadratic 1 -1 -1 1

Cubic -1 3 -3 1

Tests against SXD

F=MS_contrast/MS_SXD

MS_lin= 5(6.8)²= 11.56 1.23

MS_quad= 5(17.2)² = 369.8 39.34^**

MS_cubic = 5(35.6)² = 316.84 33.7 ^**

Total = 698.2

Tests against deviations from trend:

SS dev.lin = (SSresidual + SSdrugs)- SSlin

799.4 =112.8 + 698.2 - 11.56

SS dev.quad = (SSresidual + SSdrugs)- SSlin-SSquad

429.64 = 112.8 + 698.2 - 11.56-369.8

SS dev.cubic = (SSresidual + SSdrugs)- SSlin-SSquad-SScubic

= SSres=112.8

F_lin = MSlin = 11.56 = .20 df=(1,n(k-1)-1)=(1,14)

Msdev.lin 799.44/14

F_quad = MSquad = 369.8 = 11.19** df=(1,n(k-1)-1-1)=(1,13)

MSdev.quad 429.64/13

F_cubic= MScubic = 316.84 = 33.7** df=(1,n(k-1)-1-1-1)=(1,12)

MSdev.cubic 112.8/12

Multivariate approach

SS DF MS F P

linear 11.560 1 11.560 3.074 0.154

error 15.040 4 3.760

quadratic 369.800 1 369.800 26.797 0.007

error 55.200 4 13.800

cubic 316.840 1 316.840 29.778 0.005

error 42.560 4 10.640

VI. Violations of sphericity

A. Assumptions

For the typical F ratio used in univariate repeated measures analyses to be exact, the data must demonstrate homogeneity of treatment differences; that is, given any two treatments i and j,

s² (v_i-v_j) is constant. If the covariance matrix of the data is symmetrical (i.e., has compound symmetry) then this assumption will be met. For example, if the covariance matrix for a design with three treatments were to have compound symmetry, it would have the form:

T1 T2 T3

T1 s² ps² ps²

T2 ps² s² ps²

T3 ps² ps² s²

where p=constant. Although compound symmetry is not necessary for the assumptions of a repeated measures analysis F-test to be met, it is sufficient. Compound symmetry is violated when one treatment covaries differently with some treatments than with other treatments. This may occur when treatments are not presented in a counterbalanced order (e.g. a latin square is not used). If the covariances are heterogeneous, the error term will generally be an underestimate and the F tests will generally be positively biased (cf. Box 1954).

B. Testing for violations

Compound symmetry can be tested by inspection and by several summary measures, such as Box's (1950) extension of Bartlett's test for homogeneity of variance.

C. Box (1950) test of compound symmetry

1. Take sample variance/covariance matrix S (pooled over levels of any grouping factors).

2. Construct S'= var...cov...cov

. . . .

cov...var...cov

. . . .

cov...cov...var

where var=mean of s²_i (main diagonal) of S

cov=mean of s²ij (off diagonal elements) of S

3. Calculate:

M= -(N-p)ln(|S|/|S'|) | | = determinant of matrix

C= q(q+1)²(2q-3)

6(N-p)(q-1)(q²+q-4)

df= q²+q-4

where N= number of subjects

p= number of groups

q= number of repeated measures treatments

4. Evaluate C²(df)=(1-C)M

If this is significant, then the matrix violates the compound symmetry assumption.

D. Correcting for violations

1. Use multivariate techniques: MANOVA, Hotelling T². These are extensions of ANOVA and t-test for multivariate data.

2. Set bounds on the true significance of the F test (Box 1954).

a. In a repeated measures design, the F ratio for a within-subjects effect is distributed as F[df_numerator e,df_denominator e] where 1/df_numerator < e < 1. e measures the degree to which the covariance matrix deviates from compound symmetry.

b. If e=1, then F ratio has normal df -- this yields a lower bound on the alpha level.

c. When e=1/df_numerator, the numerator of the F ratio has 1 degree of freedom -- this yields an upper bound on the alpha level (maximum heterogeneity of covariances).

d. The true significance of the effect will lie between that provided by these tests.

3. Example for S X T design

a. F ratio s²_T/s²_ST distributed as F[(k-1) e,(k-1)(n-1) e] where 1/(k-1) < e < 1; k is the number of treatments.

b. If e=1, then F ratio has normal df -- this yields a lower bound on the alpha level.

c. If e=1/(k-1), then the F ratio has 1,n-1 degrees of freedom -- this yields an upper bound on the alpha level.

d. The true significance of the effect will lie between that provided by these tests. The F(k-1,(k-1)(n-1)) level is too liberal, while the F(1,n-1) level is too conservative.

4. Estimate the true significance of the F-test (estimate e).

a. Box's (1954) e^ aka Greenhouse-Geisser F

e^ =

where s_ij= is the i,j element in the sample variance/covariance matrix.

s_ii= mean of variances in sample (main diagonal in matrix)

s_..= mean of all variances and covariances in sample

s_i.= mean of all variances and covariances with variable i (row of matrix)

k = number of treatments

n = number of subjects

This procedure tends to underestimate the true e and hence to underestimate the significance of a result, but not by as much as the lower bound approximation of 1/k-1 above.

b. Huynh-Feldt e~

e~ = n(k-1) e^ -2

(k-1)[n-1-(k-1) e^]

In general, e~ tends to overestimate e. In fact, e~ can be larger than 1. When this occurs, e~ is set to 1. e~ will usually be larger than e^ so it will increase the power and the possibility of Type I errors in the analysis.

Treatment

Subject