![]() |
|
| français - Español |
|
|
NB: Development of a research process is a cyclical process. The double-headed arrows indicate that the process is never linear. Module 11: SAMPLINGOBJECTIVES At the end of the session you should be able to:
I. INTRODUCTIONWHAT is sampling?SAMPLING is the process of selecting a number of study units from a defined study population. Some studies involve only small numbers of people and thus all of them can be included. Often, however, research focuses on such a large population that, for practical reasons, it is only possible to include some of its members in the investigation. We then have to draw a SAMPLE from the total population. In such cases we must consider the following questions:
The study population has to be clearly defined (for example, according to age, sex, and residence.) Otherwise we cannot do the sampling. Apart from persons, a study population may consist of villages, institutions, records, etc. Each study population consists of STUDY UNITS. The way we define our study population and our study unit depends on the problem we want to investigate and on the objectives of the study. For example:
RepresentativenessIf researchers want to draw conclusions which are valid for the whole study population, which requires a quantitative study design, they should take care to draw a sample in such a way that it is representative of that population. A REPRESENTATIVE SAMPLE has all the important characteristics of the population from which it is drawn. For example:
When using qualitative research approaches, however, representativeness of the sample is NOT a primary concern. In exploratory studies which aim at getting a rough impression of how certain variables manifest themselves in a study population or at identifying and exploring thus far unknown variables, you may try to select study units which give you the richest possible information: you go for INFORMATION-RICH cases! For example:
II. SAMPLING METHODSAs the rationale for the use of specific sampling methods in qualitative study designs is very different from the rationale underlying sampling methods in quantitative studies, we will discuss them separately. Purposeful sampling strategies for qualitative studiesQualitative research methods are typically used when focusing on a limited number of informants, whom we select strategically so that their in-depth information will give optimal insight into an issue about which little is known. This is called purposeful sampling. There are several possible strategies from which a researcher can choose. Often different strategies are combined, depending on the topic under study, the type of information wanted and the resources of the investigator(s). This section owes much to Michael Quinn Patton who in his book Qualitative Evaluation and Research Methods (1990: 169-186) discusses a wide range of purposeful sampling techniques. (1) Extreme case sampling We have already discussed this type of sampling several times in Modules 9 and 10. Selection of extreme cases, such as good or very poor compliers to treatment, is a powerful and rapid strategy to identify contributing factors to poor compliance. In the same way, selection of well nourished children of the same age will help to identify contributing factors for malnutrition, and systematic comparison of a poorly and a well functioning district health team will give insight into factors that may contribute to the satisfactory functioning of DHTs. Also ‘thick’ (elaborate) description of single, deviant cases may be useful. In that way AIDS was discovered in California (USA) as a newly emerging disease. (2) Maximum variation sampling If a researcher wants to obtain as complete as possible insight in a certain issue in all its variations, maximum variation sampling will be used.
Note: Purposeful sampling should not be haphazard. Care should be taken that for different categories of informants, selection rules are developed to prevent the researcher from sampling according to personal preference. In the study on stigma it would, for example, be possible to select the categories of patients one would like to focus on (male/ female, on treatment and released from treatment) in a systematic way from patient registers in a number of selected clinics. One could e.g., take all, or every second or third patient in the desired categories. The clinics may have been selected purposefully (town/rural, specific ethnic areas, poor/rich areas) but if there is choice the researcher may make lists for each category and select at random (see next section) the desired number of clinics. For identifying social distance variation, selection rules will be much more pragmatic: spouses, relatives and neighbours who are available will be interviewed in a careful, indirect way in order not to hurt the patient’s interests. Still, however interesting data maximum variation sampling may generate, highlighting different factors and different perspectives, it does not provide representative data for the total population. (3) Homogeneous sampling Sometimes a researcher would like to have specific information about one particular group only, for example, a group that, for unclear reasons, is more at risk than others:
In focus group discussions (FGDs), we usually select homogeneous groups because participants discuss more freely when they are amongst people of similar social status. (See Module10C.) (4) Typical case sampling It is sometimes illustrative to describe in-depth some cases which are ‘typical’ for the group one is interested in. For example, one may describe a ‘typical’ family in a rural village in country A, or a ‘typical’ young school leaver who migrates from the rural areas to town in search of work, or ‘typical’ health problems of miners or malnourished children. Such descriptions are merely illustrative; they cannot be generalised for the whole group. Typical examples can either be selected with co-operation of key informants who know the study population well, or from a survey that helps to identify the normal distribution and the modus of the characteristics we are interested in. (5) Critical case sampling Critical cases are those who ‘can make the difference’ with respect to an intervention you want to introduce or to evaluate.
(6) Snowball or chain sampling This approach is particularly suitable for locating key informants or critical cases. You start with one or two information-rich key informants and ask them if they know persons who know a lot about your topic of interest. If a particular person is recommended to you by two or three different people you can be quite sure that he or she will be a valuable key informant. The same approach can be used if an in-depth interview leads to discoveries, which seem rewarding to follow-up by a number of interviews with an additional group of informants.
Patton (1990: 179) labels this kind of additional sampling during the study opportunistic sampling. Flexible sampling procedures, steered by the data one collects (in relation to the objectives) forms a major opportunity for qualitative researchers to optimally exploit the field situation and explore ‘in-depth’ interesting issues which present themselves. It is exactly the opposite of the random sampling techniques discussed in the next section of this module, which are used in quantitative research to ensure representativeness of the sample for the total population. Still, if qualitative researchers can choose from a group of seemingly similar informants they will also sample at random (see example 2). Note: Purposeful sampling is NOT the same as convenience sampling.* CONVENIENCE SAMPLING is a method in which for convenience sake the study units that happen to be available at the time of data collection are selected in the sample. This may happen at the beginning of a study when researchers are merely orienting themselves, or, when there are many similar informants and the researchers do not (yet) have a preference for specific categories. When there seems no other choice (no one else available for an interview) researchers may also sample conveniently. * In the earlier version of this module, following epidemiological tradition, all ‘non-random’ sampling methods were categorised under the headings of ‘convenience’ or ‘quota’ sampling. This injustice towards purposeful sampling techniques has been corrected in the present version. In HSR, purposeful and random sampling techniques are used equally. 2. Random sampling strategies to collect quantitative dataIf the aim of a study is to measure variables distributed in a population (e.g., diseases) or to test hypotheses about which factors are contributing significantly to a certain problem, we have to be sure that we can generalise the findings obtained from a sample to the total study population. Then, purposeful sampling methods are inadequate, and probability- or random sampling methods have to be used. PROBABILITY SAMPLING involves using random selection procedures to ensure that each unit of the sample is chosen on the basis of chance. All units of the study population should have an equal, or at least a known chance of being included in the sample. Probability sampling requires that a listing of all study units exists or can be compiled. This listing is called the sampling frame. The following probability sampling methods will be discussed:
(1) Simple random sampling This is the simplest form of probability sampling. To select a simple random sample you need to:
For example, a simple random sample of 50 students is to be selected from a school of 250 students. Using a list of all 250 students, each student is given a number (1 to 250), and these numbers are written on small pieces of paper. All the 250 papers are put in a box, after which the box is shaken vigorously, to ensure randomisation. Then, 50 papers are taken out of the box, and the numbers are recorded. The students belonging to these numbers will constitute the sample. (2) Systematic sampling In SYSTEMATIC SAMPLING individuals are chosen at regular intervals (for example every fifth) from the sampling frame. Ideally we randomly select a number to tell us where to start selecting individuals from the list.
Systematic sampling is usually less time consuming and easier to perform than simple random sampling. However, there is a risk of bias, as the sampling interval may coincide with a systematic variation in the sampling frame. For instance, if we want to select a random sample of days on which to count clinic attendance, systematic sampling with a sampling interval of 7 days would be inappropriate, as all study days would fall on the same day of the week (e.g., Tuesdays only, which might be a market day). (3) Stratified sampling The simple random sampling method described above has as disadvantage that small groups in which the researcher is interested may hardly appear in the sample. If it is important that the sample includes representative study units of small groups with specific characteristics (for example, residents from urban and rural areas, or different religious or ethnic groups), then the sampling frame must be divided into groups, or STRATA, according to these characteristics. Random or systematic samples of a pre-determined size will then have to be obtained from each group (stratum). This is called STRATIFIED SAMPLING. Stratified sampling is only possible when we know what proportion of the study population belongs to each group we are interested in. An advantage of stratified sampling is that we can take a relatively large sample from a small group in our study population. This allows us to get a sample that is big enough to enable us to draw valid conclusions about a relatively small group without having to collect an unnecessarily large (and hence expensive) sample of the other, larger groups. However, in doing so, we are using unequal sampling fractions and it is important to correct for this when generalising our findings to the whole study population.
(4) Cluster sampling It may be difficult or impossible to take a simple random sample of the units of the study population at random, because a complete sampling frame does not exist. Logistical difficulties may also discourage random sampling techniques (e.g., interviewing people who are scattered over a large area may be too time-consuming). However, when a list of groupings of study units is available (e.g., villages or schools) or can be easily compiled, a number of these groupings can be randomly selected. The selection of groups of study units (clusters) instead of the selection of study units individually is called CLUSTER SAMPLING. Clusters are often geographic units (e.g., districts, villages) or organisational units (e.g., clinics, training groups).
(5) Multi-stage sampling In very large and diverse populations sampling may be done in two or more stages. This is often the case in community-based studies, in which people are to be interviewed from different villages, and the villages have to be chosen from different areas. This type of sampling is frequently used in HSR. For example, in a study of utilisation of pit latrines in a district 150 homesteads are to be visited for interviews with family members as well as for observations on types and cleanliness of latrines. The district is composed of 6 wards and each ward has between 6 and 9 villages. The following four-stage sampling procedure could be performed*:
* This is an adaptation of the method developed by the EPI division in WHO Geneva to measure EPI coverage in districts. A MULTI-STAGE SAMPLING procedure is carried out in phases and it usually involves more than one sampling method. The main advantages of cluster and multi-stage sampling are that:
The main disadvantage of this type of sampling is that:
3. Bias in samplingBIAS in sampling is a systematic error in sampling procedures, which leads to a distortion in the results of the study. Module 10 discussed how the use of faulty data collection tools would lead to biased results. Bias can also be introduced as a consequence of improper sampling procedures, which result in the sample not being representative of the study population.
There are several possible sources of bias that may arise when sampling. The most well known source is non-response. Non-response can occur in any interview situation, but it is mostly encountered in large-scale surveys with self-administered questionnaires. Respondents may refuse or forget to fill in the questionnaire. The problem lies in the fact that non-respondents in a sample may exhibit characteristics that differ systematically from the characteristics of respondents. There are several ways to deal with this problem and reduce the possibility of bias:
Note: The bigger the non-response rate, the more necessary it becomes to take remedial action. It is important in any study to mention the non-response rate and to honestly discuss whether and how the non-response might have influenced the results. Other sources of bias in sampling may be less obvious, but at least as serious:
4. Ethical considerationsIf the recommendations from a study will be implemented in the entire study population, one has the ethical obligation to draw a sample from this population in a representative way. If during the research new evidence suggests that the sample was not representative, this should be mentioned in any publication concerning the study, and care must be taken not to draw conclusions or make recommendations that are not justified. GROUP WORK, PART I (2 hours)
III. SAMPLE SIZEHaving decided how to select our sample, we now have to determine our sample size. 1. Sample size in qualitative studiesThere are no fixed rules for sample size in qualitative research. The size of the sample depends on WHAT you try to find out, and from what different informants or perspectives you try to find that out.
In exploratory studies, the sample size is therefore estimated beforehand as precisely as possible, but not determined. Patton (1990: 183-186) stresses that richness of the data and analytical capability of the researcher determine the validity and meaningfulness of qualitative data more than sample size. Still, sampling procedures and sample size should always be carefully explained in order to avoid the allusion of haphazardness. Careful analysis of different complementing data sets can result in some plausible generalisations but without ‘proving’ them in a mathematical sense. * Lincoln and Guba, in Patton (1990: 185) 2. Sample size in quantitative studiesFor quantitative studies, calculations can be made which indicate the desirable sample size. The principles of such calculations will be discussed below. It is a widespread belief among researchers that the bigger the sample, the better the study becomes. This is not necessarily true. In general it is much better to increase the accuracy and richness of data collection (for example by improving the training of interviewers or by better pre-testing of the data collection tools) than to increase sample size after a certain point. Also, it is better to make extra efforts to get a representative sample rather than to get a very large sample. The following general rules may help to determine the desirable sample size of any given study:
Therefore the eventual sample size is usually a compromise between what is DESIRABLE and what is FEASIBLE. Sample size calculationsIn quantitative studies, researchers will perform sample size calculations before embarking on the project to find the desirable sample size. The formulae for calculating a desired sample size are listed in Annex 11.2. They are divided into two categories, depending on whether the study:
The formulae can only be used if you have a rough idea about the outcome of the study, which is not always the case. It is always advisable to call upon a statistician or an experienced researcher who can help you in choosing and using the appropriate formulae. We will look at a few examples to highlight some important issues. (1) Descriptive studies with one group
However, at present it is usually not necessary to calculate the desirable sample size by hand. There are computer programmes to assist us (for example Epi Info). In Annex 11.3 two tables are presented, which will help you to calculate the desired sample size in the most common quantitative HSR studies. Both take a confidence level of 95% (P<0.05) as point of departure. In the measles example, you have to go down the first column till you find the 80% for the estimated vaccination coverage. If you allow a margin of error of 10 + or – the estimated 80%, you look in that column and find a desired sample size of 64. However, if you would allow a margin of error of only + or - 5%, the sample size would increase to 256, and if you would like to be 95% sure that the measles vaccination coverage is between 79 and 81%, you would need to check 6400 children. The smaller your margin of error (also called confidence interval) the bigger your sample size needs to be. If you would like to increase your confidence level from 95 to 99%, the sample would have to increase even more. But then researchers of course start to look at feasibility: do we really need that precision? Most HSR studies will be satisfied with the 95% confidence level. Note also that in general you need more precision (or a smaller margin of error) if the estimated proportion is very small. This may be the case, for example, for the proportion of HIV* women or the maternal mortality rate in a population. Table 11.1: Required sample size for studies of HIV prevalence in pregnant women
The table shows that in district A, where HIV is less prevalent, a smaller margin of error is desired and, therefore, the required sample is larger. (Annex 11.4 explains how these sample sizes were calculated). Table (a) in Annex 11.3 will, however, help you again to identify the required sample sizes in a simple way. The first line of table (a) gives you the required sample sizes for an estimated prevalence of 1%. You would need a sample of 1584 women to be 95% sure that HIV prevalence in District A would be between 0.5 and 1.5% (so 1% plus or minus 0.5%). For district B, with 10% estimated HIV prevalence, and a margin of error plus or minus 5%, the required sample size would be 144. (2) Comparing two groups for a significant difference In comparative studies one usually wants to demonstrate that there is a significant difference between two groups. In this type of study the sample size depends primarily on the estimated size of the difference between the two groups that are compared. The larger the difference, the smaller will be the sample that is needed to show this difference. Second, the sample size depends on how large we want the probability to be that we indeed will find a significant difference. The larger the sample size, the larger will be the probability of finding a significant difference. In the case of many variables, the one with the smallest estimated difference between the groups should be used as the basis for calculating the sample size, as it requires the largest sample. Still, researchers will discuss whether that variable is important enough to maintain if the sample would become too big to handle. They may also decide to measure only some variables in a large sample while measuring variables that require a smaller sample in a sub-sample to save costs and ensure the feasibility of the study.
Table (b) in Annex 11.3 will again help you to identify the required sample size in a simple way. Compare 90% in the column with 50% in the row (or vice versa) and you will find a required sample size of 22. In this case a slightly different formula was used as the one presented in presented in Annex 11.4. Also the calculated sample size for the second example would be bigger (262) when using the table. Note that it may be useful to conduct sample size calculations for each of the objectives of the study. These calculations may reveal, for instance, that some but not all objectives can be met. Or they may indicate that some variables need only to be measured on a sub-sample. GROUP WORK, PART II (1 hour)
EXERCISE (½ hour)
REFERENCESAll epidemiological and social science research handbooks mentioned in the references in Module 9 are dealing with sampling procedures and sample size. Moreover you can consult: Swinscow TDV, Revised by MJ Campbell (1998) Statistics at Square One. London: BMJ Publishing Group. (9th ed.) Campbell MJ, Machin D (1993) Medical Statistics: A Common Sense Approach. Clichester: John Wiley. (2nd ed.) Annex 11.1: How to use a random number table*
*The random number table on the following page has been taken from Hill AB (1977) A Short Textbook of Medical Statistics. London: Hodder and Stoughton, 1977:306-7. Random sampling numbers
Annex 11.2: Formulae for calculating sample size*The formulae for calculating required sample size are divided in two categories:
1. Measuring one variableIn the formulae below the following abbreviations are used:
1.1 Single mean In a study the mean weight of newborn babies will be determined. The mean weight is expected to be 3000 grams. Weights are approximately normally distributed and 95% of the birth weights are probably between 2000 and 4000 grams; therefore the standard deviation would be 500 grams. The desired 95% confidence interval is 2950 to 3050 grams, so the standard error would be 25 grams. The required sample size would be:
1.2 Single rate The maternal mortality rate in a country is expected to be 70 per 10,000 live births. A survey is planned to determine the maternal mortality rate with a 95% confidence interval of 60 to 80 per 10,000 live births. The standard error would therefore be 5/10,000. The required sample size would be:
1.3 Single proportion The proportion of nurses leaving the health services within three years of graduation is estimated to be 30%. A study that aims to find causes for this, also aims to determine the percentage leaving the service with a confidence interval of 25% to 35%. The standard error would therefore be 2.5%. The required sample size would be:
* Modified from Kirkwood B (1988) Essentials of Medical Statistics. Oxford: Blackwell Scientific Publications, 1988. 1.4 Difference between two means (sample size in each group) The difference of the mean birth weights in district A and B will be determined. In district A the mean is expected to be 3000 grams with a standard deviation of 500 grams (as in 1.1). In district B the mean is expected to be 3200 grams with a standard deviation of 500 grams. The difference in mean birth weight between districts A and B is therefore expected to be 200 grams. The desired 95% confidence interval of this difference is 100 to 300 grams, giving a standard error of the difference of 50 grams. The required sample size would be:
1.5 Difference between two rates (sample size in each group) The difference in maternal mortality rates between urban and rural areas will be determined. In the rural areas the maternal mortality rate is expected to be 100 per 10,000 and in the urban areas 50 per 10,000 live births. The difference is therefore 50 per 10,000 live births. The desired 95% confidence interval is 30 to 70 per 10,000 live births giving a standard error of the difference of 10/10,000. The required sample size would be:
1.6 Difference between two proportions (sample size in each group) The difference in the proportion of nurses leaving the service is determined between two regions. In one region 30% of the nurses are estimated to leave the service within three years of graduation, in the other region 15%, giving a difference of 15%. The desired 95% confidence interval for this difference is 5% to 25%, giving a standard error of 5%. The sample size in each group would be:
2. Significant difference between two groupsIn the formulae below the following abbreviations are used:
2.1 Comparison of two means (sample size in each group) The birth weights in district A and B will be compared. In district A the mean birth weight is expected to be 3000 grams with a standard deviation of 500 grams. In district B the mean is expected to be 3200 grams with a standard deviation of 500 grams (see 1.4). The required sample size to demonstrate (with a likelihood of 90%) a significant difference between the mean birth weights in district A and B would be:
2.2 Comparison of two rates (sample size in each group) The maternal mortality rates in urban and rural areas will be compared. In the rural areas the maternal mortality rate is expected to be 100 per 10,000 and in the urban areas 50 per 10,000 live births (compare to 1.5). The required sample size to show (with a likelihood of 90%) a significant difference between the maternal mortality in the urban and rural areas would be:
2.3 Comparison of two proportions (sample size in each group) The proportion of nurses leaving the health service is compared between two regions. In one region 30% of nurses is estimated to leave the service within three years of graduation, in the other region it is probably 15%. The required sample size to show with a 90% likelihood that the percentage of nurses is different in these two regions would be:
(a) Sample size for measuring proportions in one group
The percentages in the column heading are 2 x standard error (e in formula) (b) Sample size for comparison of proportions in two groups
u = 1.28 Power = 90% v = 1.96 Significance: P<0.05 (u+v)2= 10.5 Annex 11.4: Explanation of sample size calculations given in the text1. Prevalence of HIV (p.13)
2. Feeding patterns in malnourished and well-nourished children (p.14)
If the power is 75%, u = 0.67 and (u + v)2 = 6.9; If the power is increased from 75% to 90%, the sample size is increased 10.5/6.9 (i.e. 1.5 times.) Trainer’s Notes Module 11: SAMPLINGTiming and teaching methods The topic on sampling has two major components (sampling procedures and sample size), which preferably should be presented in two separate sessions. These sessions will require 6½ hours in total. Materials
Introduction to Sampling Procedures (Part II of Module 11) Timing and teaching methods
Introduction and discussion
Group work, Part I
Introduction to Sample Size (Part III of Module 11) Timing and teaching methods
Introduction and discussion
Group work, Part II
Exercise
Plenary
|
||||||||||||||||||||||||
| guest (Read)(Ottawa) Login | Home|Careers|Copyright and Terms of Use|General Infomation|Contact Us|Low bandwidth |