![]() |
|
| English - Español |
|
|
* These steps need not be in the sequence in this diagram. The sequence may be adjusted according to the needs of the research teams. ** These elements are optional and may be omitted if not relevant for research teams Module 30: DETERMINING DIFFERENCES BETWEEN GROUPS, PART II: |
|||||||||||||||||||||||||||||||||||||||||||||||||||
| t = | mean of the differences standard error |
The degrees of freedom is the number of paired observations (= sample size) minus 1.
To interpret result of the study the same table of t-values is used as for the t-test for unpaired observations (see Annex 29.1).
To illustrate how the paired t-test is used, it will be performed on the results of the nutritional survey referred to in Example 2 above. The results are:
Table 30.1: Results of quality control exercise during a nutritional survey

The null hypothesis in this study is that if observers A and B measured all the children in the population from which these 20 children were sampled, there would, on average, be no difference between their measurements. In other words, the mean difference between A and B would be zero.
We can regard this set of 20 differences (the A - B column) as a sample of all differences that would have been obtained if the observers had measured the whole population.
To perform the significance test, the value of t has to be calculated and compared to the t-table value to determine if there is a statistically significant difference between the two. This indicates the probability that the results of the study occurred by chance.
The significance test is done as follows:
1. Calculate the mean difference of the measurements between A and B in the sample. This is the sum of the differences divided by the number of measurements:
| Mean difference = | 21.1 20 | = 1.05 |
2. Calculate the standard deviation of the differences (Module 27):
Standard deviation = 1.77
Calculate the standard error (Module 27):
| Standard error = | | ||
3. The value of t is the mean difference divided by the standard error:
| t = | 1.05 0.40 | = 2.62 |
4. Refer to the table of t-values in Annex 29.1.
The degrees of freedom is the sample size (the number of pairs of observations) minus 1 which in this case is 20 – 1 = 19.
The probability from the table is < 0.05, which allows us to conclude that there is a significant difference between the observers. A and B definitely need more training and supervision on their measuring skills as the test does not show whether one or both have been inaccurate in their measurements.
The McNemar’s chi-square test is used with NOMINAL data to compare PROPORTIONS of paired observations. It is important to note that the layout of the table is different from that used with unpaired samples.
Table 30.2 shows the results of a case-control study that was conducted to determine causes of a cholera outbreak in Bombay. For each cholera case confirmed in the hospital, a subject was sought of the same sex, the same age decade and the same neighbourhood.
Table 30.2: Source of drinking water by cholera patient/control pairs in the 5 days preceding illness (incorrect layout).

However, the layout of Table 30.2 is not correct, as it does not take account of the fact that cases and controls were selected as pairs.
The correct layout is presented in Table 30.3.
Table 30.3: Source of drinking water by cholera patient/control pairs in the 5 days preceding illness (correct layout)

Adapted from Baine & Mazotti et al. (1973).
How should we interpret Table 30.3?
In 12 pairs both cases and controls drew water from the shallow well and 31 pairs drew water from the tap. These 43 pairs therefore give us no information whether drawing water from shallow wells is a risk factor for getting cholera or not. However, in 30 pairs (39%) the cases draw water from shallow wells, while the controls drew water from the tap, whereas in only 3 pairs (4%) the controls draw water from the shallow wells while the cases drew water from the tap. It would seem therefore that drawing water from shallow wells was a risk factor for getting cholera.
Before we accept that conclusion, we must perform a significance test to estimate the likelihood that these results are due to chance or sampling variation only. In this case the appropriate significance test is McNemar’s chi-square test (see Table 28.1):
| ?2 = | (|r – s| – 1)2 r + s | with 1 degree of freedom |
| Where: | r = | the number pairs where a control drew water from the tap and the case from the shallow well, |
| s = | the number pairs where a control drew water from the shallow well and the case from the tap, and | |
| |r – s| | means the difference between r and s as a positive number, irrespective of whether s is larger than r. |
To check for statistical significance we use ordinary chi-square tables (Annex 29.2).
Note:
McNemar’s ?2 test is only valid if (r + s) is larger than 10.
The test can be performed on the data in our example because r + s (30 + 3) is larger than 10.
The calculation of the chi-square value is as follows:
| ?2 = | (30 – 3 – 1)2 30 + 3 | = | 262 33 | = 20.5 with 1 degree of freedom | ||
Using an a-level of 0.01, the ?2 table value is equal to 6.63 (Annex 29.2). We can see that the calculated ?2 value of 20.5 is larger than the table value. This means that the p-value is less than 0.01. We therefore reject the null hypothesis and conclude that drawing water from the shallow wells was a risk factor for getting cholera.
GROUP WORK
If your data were collected by paired or matched observations, identify the appropriate statistical test and make the necessary calculations and analysis.
See epidemiological and statistics textbooks referred to in Modules 9 and 28.
Baine WB, Mazotti M, Greco D et al. (1974) Epidemiology of cholera in Italy in 1973. Lancet ii (Dec.): 1370–1374.
Trainer’s Notes
Timing and teaching methods
| 1 hour | Introduction and discussion |
| 2 hours | Group work |
| guest (Lire)heure de l'Est (É.-U. et Canada) Login | Accueil|Carrières|Droits d'auteurs et usage|Informations générales|Nous rejoindre|Basse vitesse |