The Durbin's ANOVA (missing data)

Durbin's analysis of variance of repeated measurements for ranks was proposed by Durbin (1951)1). This test is used when measurements of the variable under study are made several times – a similar situation in which Friedman'sANOVA is used. The original Durbin test and the Friedman test give the same result when we have a complete data set. However, Durbin's test has an advantage – it can also be calculated for an incomplete data set. At the same time, data deficiencies cannot be located arbitrarily, but the data must form a so-called balanced and incomplete block:

where:

$t$ – total number of considered measurements,

$b$ – total number of examined objects.

Basic assumptions:

Hypotheses involve equality of the sum of ranks for successive measurements ($R_{j}$) or are simplified to medians ($\theta_j$):

\begin{array}{cl}
\mathcal{H}_0: & \theta_1=\theta_2=...=\theta_k,\\
\mathcal{H}_1: & $not all $\theta_j$ are equal $(j=1,2,...,k)$,$
\end{array}

Two test statistics of the following form are determined:

\begin{displaymath}
T_1=\frac{(t-1)\left[\sum_{j=1}^tR_j^2-tC\right]}{A-C},
\end{displaymath}

\begin{displaymath}
T_2=\frac{T_1/(t-1)}{(b(k-1)-T_1)/(bk-b-t+1)},
\end{displaymath}

where:

$R_{j}$ – sum of ranks for successive measurements $(j=1,2,...t)$,

$R_{ij}$ – ranks assigned to successive measurements, separately for each of the studied objects $(i=1,2,...b)$,

$\displaystyle A=\sum_{i=1}^b\sum_{j=1}^tR_{ij}^2$ – sum of squared ranks,

$\displaystyle C=\frac{bk(k+1)^2}{4}$ – correction coefficient.

The formula for $T_1$ and $T_2$ statistics includes a correction for tied ranks.

For complete data, the $T_1$ statistic is the same as the Friedman test. It has asymptotically (for large sample sizes) Chi-square distribution with $df=t - 1$ degrees of freedom.

The $T_2$ statistic is the equivalent of Friedman's Iman-Davenport ANOVA adjustment, so it follows Snedecor's F distribution with $df_1=t-1$ i $df_2=bk-b-t+1$ degrees of freedom. It is now considered to be more precise than the $T_1$ statistic and is recommended for use with the $T_1$ statistic2).

The p-value, designated on the basis of the test statistic, is compared with the significance level $\alpha$:

\begin{array}{ccl}
$ if $ p \le \alpha & \Longrightarrow & $ reject $ \mathcal{H}_0 $ and accept $ 	\mathcal{H}_1, \\
$ if $ p > \alpha & \Longrightarrow & $ there is no reason to reject $ \mathcal{H}_0. \\
\end{array}

Testy POST-HOC

Introduction to the contrasts and the POST-HOC tests was performed in the unit, which relates to the one-way analysis of variance.

Conover-Inman test

Used for simple comparisons (the counts in each measurement are always the same).

Hypotheses:

Example - simple comparisons (comparing 2 selected medians / rank sums between each other):

\begin{array}{cc}
\mathcal{H}_0: & \theta_j=\theta_{j+1},\\
\mathcal{H}_1: & \theta_j \neq \theta_{j+1}.
\end{array}

\begin{displaymath}
CD=t_{1-\alpha /2, bk-b-t+1}\sqrt{\frac{(A-C)2r}{bk-b-t+1}\left(1-\frac{T_1}{b(k-1)}\right)},
\end{displaymath}

where:

$t_{1-\alpha /2, bk-b-t+1}$ – is the [wartosc_krytyczna|critical value (statistic) of the t-Student distribution for a given significance level $\alpha$ and $df=bk-b-t+1$ degrees of freedom.

\begin{displaymath}
t=\frac{\sum_{j=1}^k c_jR_j}{\sqrt{\frac{(A-C)2r}{bk-b-t+1}\left(1-\frac{T_1}{b(k-1)}\right)}},
\end{displaymath}

The test statistic has t-Student distribution with $df=bk-b-t+1$ degrees of freedom.

The settings window with the Durbin's ANOVA can be opened in Statistics menu→NonParametric tests Friedman ANOVA, trend test or in ''Wizard''

Note

For records with missing data to be taken into account, you must check the Accept missing data option. Empty cells and cells with non-numeric values are treated as missing data. Only records with more than one numeric value will be analyzed.

EXAMPLE (mirror.pqs file)

An experiment was conducted among 20 patients in a psychiatric hospital (Ogilvie 1965)3). This experiment involved drawing straight lines according to a presented pattern. The pattern represented 5 lines drawn at different angles ($0^o, 22.5^o, 45^o, 67.5^o, 90^o$) relative to the indicated center. The patients' task was to reproduce the lines while having their hand covered. The time at which the patient drew the line was recorded as the result of the experiment. Ideally, each patient would draw a line from all angles, but elapsed time and fatigue would have a significant impact on performance. In addition, it is difficult to keep the patient interested and willing to cooperate for an extended period of time. Therefore, the project was planned and conducted in balanced and incomplete blocks. Each of the 20 patients traced a line at two angles (there were five possible angles). Thus, each angle was drawn eight times. The time at which each patient drew a line at a given angle was recorded in the table.

\begin{tabular}{|c||c|c|c|c|c|}
\hline
patient number &$0^o$&$22.5^o$&$45^o$&$67.5^o$&$90^o$\\\hline
1&7&15&&&\\
2&20&&72&&\\
3&8&&&26&\\
4&33&&&&36\\
5&7&16&&&\\
6&&68&67&&\\
7&&33&&64&\\
8&&34&&&12\\
9&10&&96&&\\
10&&29&59&&\\
11&&&17&9&\\
12&&&100&&15\\
13&16&&&32&\\
14&&19&&32&\\
15&&&36&39&\\
16&&&&44&54\\
17&16&&&&38\\
18&&17&&&12\\
19&&&37&&11\\
20&&&&56&6\\
\hline
\end{tabular}

We want to see if the time taken to draw each line is completely random, or if there are lines that took more or less time to draw.

Hypotheses:

$\begin{array}{cl}
\mathcal{H}_0: & $there is no significant difference between the time  $\\
& $taken by patients to draw each line, $\\
\mathcal{H}_1: & $at least one line is drawn in shorter/longer time.$
\end{array}$

Comparing the p=0.0145 for the $T_2$ statistic (or the p=0.0342 for the $T_1$ statistic) with the $\alpha=0.05$ significance level, we find that the lines are not drawn at the same time. The POST-HOC analysis performed indicates that there is a difference in the time taken to draw the line at angle $0^o$. It is drawn faster than the lines at the angle of $22.5^o$, $45^o$ and $67.5^o$.

The graph shows homogeneous groups indicated by the post-hoc test.

1)
Conover W. J. (1999), Practical nonparametric statistics (3rd ed). John Wiley and Sons, New York
2)
Durbin J. (1951), Incomplete blocks in ranking experiments. British Journal of Statistical Psychology, 4: 85–90
3)
Ogilvie J. C. (1965), Paired comparison models with tests for interaction. Biometrics 21(3): 651-64