| United States General Accounting Office |
| GAO | Program Evaluation and Methodology Division |
| June 1991 | Using Structured Interviewing Techniques |
Preface
GAO assists congressional decisionmakers in their deliberative process by furnishing analytical information on issues and options under consideration. Many diverse methodologies are needed to develop sound and timely answers to the questions that are posed by the Congress. To provide GAO evaluators with basic information about the more commonly used methodologies, GAO's policy guidance includes documents such as methodology transfer papers and technical guidelines.
This methodology transfer paper on using structured interviewing techniques discusses how GAO evaluators should incorporate structured interview techniques when appropriate to performing our work. It explains when these techniques should be used and what steps should be followed. Overall, it describes techniques for designing a structured interview, for pretesting, for training interviewers, and for conducting the interviews. The original report was authored by Erwin W. Bedarf in July 1985. This reissued version, prepared by Kenneth Litkowski, supersedes the version published in 1985.
Using Structured Interviewing Techniques is one of a series of papers issued by
the Program Evaluation and Methodology Division (PEMD). The purpose of the series is to
provide GAO evaluators with guides to various aspects of audit and evaluation methodology,
to illustrate applications, and to indicate where more detailed information is available.
We look forward to receiving comments from the readers of this paper. They should be
addressed to Eleanor Chelimsky at 202-275-1854.
Werner Grosshans
Assistant Comptroller General
Office of Policy
Eleanor Chelimsky
Assistant Comptroller General for Program Evaluation and Methodology
Contents
Preface
Chapter 1
The Role of Structured Interviews in GAO Evaluations
Chapter 2
What Is a Structured Interview and When Should It Be Used?
Chapter 3
Designing a Structured Interview
Identifying Variables and Developing Questions
Composing Appropriate Questions
Selecting a Question Format
Organizing Questions
Layout Considerations
Chapter 4
More on Interview Design: Avoiding Problems
Appropriateness of the Language
Level of the Language
Use of Qualifying Language
Clarity of Language
Bias Within Questions
Considerations for Telephone Interviewing Instruments
Chapter 5
Pretesting and Expert Review
Purpose of Pretest
Pretest Procedures
Purpose of Expert Review
Instrument Redesign
Chapter 6
Training Interviewers
Training Methods
Interviewer Qualifications
Chapter 7
Selecting and Contacting Interviewees
Selection of Interviewees
Contacting Potential Interviewees
Interview Arrangements
Protecting the Interviewee
Chapter 8
Conducting Interviews
Developing Rapport and Showing Interest
Giving the Interviewee a Reason to Participate
Helping the Interviewee to Be Responsive
Asking Questions in a Prescribed Order and Manner
Ensuring Understanding
Ensuring Nonbias
Obtaining Sufficient Answers
Showing Sensitivity to Interviewee Burden
Chapter 9
Analyzing the Data
Nonrespondent Problem
Data Analysis
Analysis of Open-Ended Questions
Chapter 10
The Role of Evaluators and Specialists on Each Task
Bibliography
Glossary
Papers in This Series
Tables
Table 1.1: Evaluation Questions and Strategies
Table 2.1: Comparison of Data-Collection Techniques
Table 3.1: Identifying, Developing, and Selecting Questions
Table 10.1 Functions and Responsibilities of Evaluators and Specialists
Figures
Figure 3.1: Question 1
Figure 3.2: Questions 2, 3, and 4
Figure 3.3: Question 5
Figure 3.4: Question 6
Figure 3.5: Question 7
Figure 3.6: Question 8
Figure 3.7: Question 9
Figure 3.8: Question 10
Figure 3.9: Question 11
Figure 3.10: Structured Interview Text
Figure 4.1: Question 1
Figure 4.2: Question 2
Figure 4.3: Question 3
Figure 4.4: Question 4
Figure 4.5: Question 5
Figure 7.1: Interviewee Contact
Procedures Example
Figure 7.2: Example of Telephone Contact With Potential Interviewee
Figure 7.3: Interviewee Contact Log
Figure 7.4: Interviewee Contact Log Filled In
Abbreviations
CATI Computer-assisted telephone interviewing
DCI Data-collection instrument
GAO General Accounting Office
PEMD Program Evaluation and Methodology Division
QPL Questionnaire Programming Language
QUEST A GAO computer program for formatting interviews and questionnaires
Chapter 1
The Role of Structured Interviews in GAO Evaluations
A major responsibility of the General Accounting Office (GAO) is to audit and evaluate the programs, activities, and financial operations of federal departments and agencies and to make recommendations toward more efficient and effective operations.
The broad questions that dictate the objectives of a GAO evaluation and that suggest the evaluation strategy can be categorized as descriptive, normative, or impact (cause-and-effect).1 A descriptive evaluation, as the name implies, provides descriptive information about specific conditions of a program or activity, while a normative evaluation compares an observed outcome to an expected level of performance. An impact (cause-and-effect) evaluation aims to determine whether observed conditions, events, or outcomes can be attributed to the operation of the program or activity. According to the type of evaluation questions to be answered, different evaluation strategies are used, as shown in table 1.1.
Table 1.1: Evaluation Questions and Strategies
| Type of question | Strategy |
| Descriptive | Sample survey Case study Available data |
| Normative | Sample survey Case study Available data |
| Impact (cause-and-effect) | Field experiment Available data |
| 1 We use the term "evaluation" throughout this paper; however, many of the interviewing concepts and procedures apply equally to GAO audits. The categories of questions are discussed fully in the methodology transfer paper entitled Designing Evaluations. See the bibliography at the end of this paper. |
In a sample survey, data are collected from a sample of a population to determine
the incidence, distribution, and interrelationship of events and conditions. The case
study is an analytic description of an event, process, institution, or program based
on either a single case or multiple cases. The field experiment compares outcomes
associated with program operations with estimates of what the outcomes would have been in
the absence of the program. Available data refers to previous studies or data bases
previously established and currently available.
The design of a GAO evaluation encompasses seven elements:
This paper focuses on the fourth design element- specifically, structured interviews.
Like self-administered questionnaires, structured interviews are often used when the
evaluation strategy calls for a sample survey. Structured interviews can also be used,
however, in field experiments where information must be obtained from program participants
or members of a comparison group. Similarly, when essentially the same information must be
obtained from numerous people for a multiple case-study evaluation or a single case-study
evaluation, it may be beneficial to use structured interviews.
Structured interviews (and other forms of structured data collection, such as the
self-administered questionnaire) are often used in conjunction with a design that employs
statistical sampling. This combination provides data that can be used to make projections
about the entire population from which the sample was drawn. We discuss sampling
methodology and generalization in depth in the methodology transfer paper entitled Using
Statistical Sampling.
It should be noted, however, that the steps in the evaluation design process-defining
the questions that dictate the objectives of the study, selecting the method of collecting
the information, and preparing an analysis plan for using the collected information to
answer the questions-are interrelated and iterative. If, for example, a
structured interview is used to collect information to answer an evaluation question, the
question will determine the contents or subject matter of the interview form. Any
constraints in identifying and selecting a sample (for example, the lack of a universe
listing of the target population) may make it necessary to refine the original evaluation
question. Many more examples could be given to demonstrate the iterative nature of this
process. The point to remember is that the use of structured interviewing to collect
information is not an isolated process and cannot be thought of as a sequential task
unrelated to or independent of other tasks in the process of answering an evaluation
question.
Chapter 2
What Is a Structured Interview and When Should It Be Used?
For years, GAO evaluators have collected data through various techniques such as reviewing records and interviewing government and contractor officials, employees, and program participants. Increasingly since 1972, we have used what have come to be called data-collection instruments (DCIs) on assignments that require the same or uniform information on numerous cases. A DCI is a document containing questions presented in a systematic, highly precise fashion; its purpose is to enable the evaluator to obtain uniform data that can be compared, summed, and, if it is quantitative, subjected to additional statistical analysis. The form of a DCI varies according to whether it is to be used in a structured interview, as a self-administered questionnaire (either mailed to individuals or organizations or completed by individuals in a group setting), or as a pro forma schedule to obtain information from records.
An interview that uses a DCI to gather data, either by telephone or face to face, is a structured interview, one in which evaluators ask the same questions of numerous individuals or individuals representing numerous organizations in a precise manner, offering each interviewee the same set of possible responses. In contrast, an unstructured interview contains many open-ended questions, which are not asked in a structured, precise manner. Different evaluators interpret questions and often offer different explanations when respondents ask for clarification.
Given the need to collect uniform data from numerous persons or organizations, when
should the evaluator use a structured interview rather than a mail questionnaire or a
questionnaire administered in a group setting? There is no hard-and-fast answer. We
discuss some of the advantages and disadvantages of interviews and questionnaires in the
following paragraphs. In addition, the characteristics of various data-collection
techniques are systematically compared in table 2.1.
Table 2.1: Comparison of Data-Collection Technique
Extent of advantage
| Structured Interview | Questionnaire | ||||
| Characteristic or advantage | By telephone | Face to face | By mail | Group | Audit of records |
| Methodology | |||||
| Allows use of probes | 3 | 5 | 1 | 2 | na |
| Controls bias of collector | 3 | 2 | 5 | 4 | 5 |
| Can overcome unexpected events in data collections | 4 | 5 | 2 | 3 | 4 |
| Facilitates feedback about instrument or collection procedures | 4 | 5 | 2 | 5 | 2 |
| Allows oral and visual inquiry | 1 | 5 | 2 | 5 | na |
| Allows oral and visual response | 1 | 5 | 2 | 2 | 2 |
| Evaluator can control collection procedures | 3 | 5 | 1 | 4 | 5 |
| Facilitates interchange with source | 4 | 5 | 2 | 5 | na |
| What contents allow | |||||
| Inclusion of most relevant variables | 3 | 5 | 4 | 4 | 3 |
| Complex subject matter to be presented or derived | 3 | 5 | 3 | 4 | 4 |
| Collection of real-time data | 5 | 5 | 4 | 5 | 3 |
| Acquisition of historical data | 4 | 4 | 4 | 4 | 5 |
| Universe or sample | |||||
| Relevant universe to be sampled can be identified | 4 | 5 | 4 | 5 | 4 |
| Facilitates contacting and getting sample | 3 | 2 | 4 | 4 | 5 |
| Allows use with large sample | 4 | 3 | 5 | 4 | 5 |
| Allows identity of source to be known | 4 | 5 | 3 | 5 | 3 |
| Reduces problems from respondent's illiteracy | 4 | 5 | 1 | 3 | na |
| What time, cost, and resources minimize | |||||
| Instrument-development time | 2 | 3 | 1 | 1 | 5 |
| Instrument-development cost | 3 | 1 | 1 | 1 | 5 |
| Number of field staff | 5 | ? | 5 | ? | ? |
| Travel by staff | 5 | ? | 5 | ? | ? |
| Staff training | 2 | 1 | 5 | 3 | 5 |
| Time required to carry out activities | ? | ? | 3 | ? | ? |
| Overall cost | 3 | 1 | 5 | 4 | 1 |
| Results, response, and quality of data | |||||
| Maximize rate of return of data after source is contacted | 4 | 5 | 3 | 5 | na |
| Minimize multiple contacts of sources | 2 | 2 | 3 | 4 | na |
| Minimize follow-up after initial response | 5 | 5 | 3 | 4 | 5 |
| Increase chance source will be accurate | 4 | 4 | 4 | 4 | 3 |
| Allow reliability to be checked | 5 | 5 | 3 | 4 | 4 |
| Allow validity to be checked | 4 | 4 | 2 | 4 | 5 |
| Facilitate recall of data by source | 4 | 5 | 3 | 4 | na |
a Key:
In the job design phase of an evaluation or in a one-of-a-kind interview during the data collection and analysis phase of an evaluation, the less-structured, less-guided type of interview may be more useful.
Face-to-face interviews and telephone interviews are generally more successful with respondents whose reading levels are low in comparison with the complexity of the questions. In this radio and television age, some respondent groups understand spoken words and sentences better than written ones.
The telephone interview and, even more, the face-to-face interview enable the interviewer to establish rapport with the respondents. Individuals who would ignore mail questionnaires entirely or who would not answer certain questions on them can be persuaded to provide truthful answers in a telephone or face-to-face interview. Also, a well-trained interviewer can recognize when a respondent is having a problem understanding or interpreting a question and can employ the proper techniques to assist the interviewee without jeopardizing the integrity of the interview.
In comparison to the telephone interview, the face-to-face interview gives the interviewer the opportunity to observe as well as listen. For example, if it is required or desired that the interviewee's living arrangements be noted, the face-to-face interview would be the choice. Also, more complex questions can be asked in a face-to-face interview than in a telephone interview. Respondents can be shown cards with the complete set of possible responses, making it easier for them to remember and consider all the choices. In addition, more questions can be asked. Twenty to 30 minutes is the usual limit for telephone interviews, while face-to-face interviews can last up to an hour.
Computer-assisted telephone interviewing (CATI) is one form of telephone interviewing. In CATI, the questionnaire or DCI is stored in a computer, questions are displayed on the computer screen during the interview, and the interviewer directly enters the responses into the computer. Telephone interview costs generally fall somewhere between the lower mail survey costs and the higher personal interviewing costs. Also, depending on the size of the sample, the number of interviewers available, the number of questions, and question complexity, telephone surveys can be completed quickly.
In comparison with mail questionnaires, face-to-face and telephone interviews are much faster methods of gathering data. The need to train interviewers and their time spent traveling and contacting and interviewing respondents, however, make the face-to-face interview much more expensive than telephone interviews or mail or group questionnaires. Both forms of questionnaire can be longer and can include more complex questions (if the respondent group is one that reads well) than is possible with the telephone interview.
To administer a questionnaire in a group setting requires that it be practical to assemble the respondents. Thus, it is normally used in situations in which the sample is an entire group or a large portion of it, such as an Army company or battalion or all or many agency employees in one location. Group questionnaires are faster than mail questionnaires and permit some clarification of questions (but not to the same extent as interviews). As with mail queries, however, the language complexity used in group questionnaires must be commensurate with the reading level of the respondents.
In the past, GAO has used structured, face-to-face interviews to study such topics as
We used face-to-face interviews in the first two cases because the respondent groups were not ones that tend to respond in large numbers to mail questionnaires, the subject matter was complex in relationship to their reading levels, and the interviews were too long to be done by telephone. In the Drug Enforcement Agency evaluation, the face-to-face interview was used because time did not permit a mail survey, the interview was too long for a telephone survey, and the agents could not be assembled in a group.
GAO used structured telephone interviews to study such topics as the satisfaction of
In both cases, telephone interviews were used because the number of questions to be asked was small and time precluded a mail questionnaire.
Questionnaires were administered in a group setting as part of GAO studies of
In general, GAO uses mail questionnaires much more frequently than group
questionnaires, telephone interviews, or face-to-face interviews combined.1 However, an
understanding of structured interviewing techniques is essential for situations in which a
mail questionnaire cannot be used. Additional discussion of structured interviews,
questionnaires, and other DCIs, with examples of GAO applications, appears in chapter 10.1
of the GAO Project Manual.
| 1 Questionnaires are discussed in the methodology transfer paper entitled Developing and Using Questionnaires. |
Chapter 3
Designing a Structured Interview
Designing a structured interview requires more than just writing down a set of questions
to be asked. In this chapter, we first examine the process by which the interview
questions are identified, developed, and selected; then we describe standard procedures
for composing and formatting the questions. These procedures aim to ensure that the data
collected are reliable and valid and to facilitate trouble-free editing and analysis of
data, while keeping the burden on the interviewee to a minimum.
Reading or even studying this transfer paper will not make anyone an expert in writing
questions for structured interviews. We suggest, therefore, that you work with measurement
specialists from the design, methodology, and technical assistance group in the division
programming the assignment when you are planning to use a structured interview.
The DCI for structured interviews should be reviewed by a design, methodology, and
technical assistance group if it involves 10 or more private citizens, private firms, or
local governments; 5 or more state governments; or 25 or more federal agency officials or
employees.1 In certain executive agencies, GAO has designated representatives and
established procedures that must be followed when using structured interviews and
questionnaires. 2
| 1 See GAO's Project Manual chapter 10.1 on methodology. 2 For example, see GAO's Operations Manual, order 0175.5 (A-91), "Coordination of General Accounting Work at the Department of Defense, Defense Agencies and the Military Departments/ Bureaus. " |
Identifying Variables and Developing Questions
The first step is to formulate the broad, overall questions to be answered by the
evaluation or audit. Why is the study being done? What do we hope to be able to say or
prove? Are we primarily describing what has taken place in a program? Do we want to
compare what has happened with some established or implied standard, a normative-type
question? Or do we want to determine if a program has made a difference, a
cause-and-effect type question? Examples of such questions are
The type of question asked will dictate the evaluation strategy. Also, certain strategies are more appropriate to answering certain questions. 3 However, structured interviews, being simply a method of data collection, can be used with several evaluation strategies and, thus, in a variety of GAO assignments.
After the broad overall questions are developed, they must be translated into measurable elements in the form of hypotheses or questions. For the example mentioned above, to evaluate how participants found jobs would require developing such measures as the sources through which participants learned of available jobs, the number of employers contacted, and the number of job interviews arranged. Next, the target population must be identified. The target population is the source level (individuals, groups, or organizations) at which the information is to be gathered. Thus, in the study of how program participants found jobs after leaving the program, the target population is the individual participants of the program who were trained. 4
Next, develop a pool of questions that attempt to measure the variables under consideration. The questions may include various ways of measuring the same variable. For example, for age, you might ask, "How old were you on your last birthday?" or "On what day, month, and year were you born?" Both questions help you determine the individual's age, but the second elicits much more information. Decide which to use. From the pool of questions, then, the most useful or appropriate are chosen.
The identification, development, and selection of questions for our example, a study of how program participants found jobs after leaving a job-training program, are illustrated in table 3.1.
| 3 Formulating overall evaluation questions and selecting evaluation
strategies that provide answers is discussed in the methodology transfer paper entitled Designing
Evaluations. 4 Later in the evaluations, data analyses may actually be done at a higher (more aggregated) level. In the example above, the XYZ program may be conducted at several locations in a city, in many cities in a state, and in many states. Thus, several levels of analysis would be possible. The objectives of the evaluation and the sampling plan devised to meet those objectives, however, dictate the level or levels of data analysis. |
Table 3.1: Identifying, Developing, and Selecting Questions
| Task | Example |
| Formulate overall questions | How do program participants find jobs after leaving the XYZ program? |
| Determine the kind of information needed |
|
| Identify target population | Program participants who have left the program (random sample) |
| Create a question pool | 1.1 How did you look for jobs?
1.2 About how many jobs that you were interested in did you find out about from
2.1 How many employers did you contact about a job since you left the program?
3.1 How many... |
| Select questions | 1.1... 2.1... 3.1... |
Composing Appropriate Questions
When composing interview questions, be sure they are appropriate-that is, relevant to the
study, directed to the proper persons, and easily answered.
Avoid questions that require the interviewee to perform "audit work" to answer-that is, to consult records or other information sources. If used at all, such questions should be reserved for mail questionnaires. For telephone interviews, the questions should be even less complex, because there is less of an opportunity to help the interviewee understand. It is possible to send the questionnaire beforehand to the person who will be interviewed, requesting that he or she gather the necessary information in preparation for the interview.
Other questions (or the manner in which they are presented) that cause the interviewee discomfort should be avoided or used with extreme care. The same is true of questions that would tend to incriminate or show the interviewee in a bad light, particularly since the interview might terminate if they were asked. Likewise, avoid personal questions about private matters that do not belong in a GAO study, as well as questions whose sole purpose is to embarrass the interviewee (such as testing or questioning the intelligence of the interviewee or seeking information about private habits).
If needed, ask sensitive questions in a mail questionnaire, where confidentiality or anonymity can be granted. 5 Also avoid questions that could cause unnecessary confrontation, causing the interviewer and interviewee to take sides and do battle. This detracts from the interview task, may cause bias, and can seriously affect the validity of the answers given.
Also avoid questions that have no answers and avoid questions that, if you attempt to ask them, produce unusable results. These are not to be confused, of course, with questions for which the legitimate answer might be "no basis to judge" or "no opinion" (presumably, some interviewees will not have a basis to make a judgment or give an opinion).
| 5 See our discussion on confidentiality and anonymity in chapter 7. |
Selecting a Question Format
Considerations in deciding on the format or type of question to use include how the
question is delivered or presented, what the interviewee is asked, and available response
alternatives. Among the types of questions we use are open-ended, fill-in-the-blank,
binary-choice, and scaled-response, as discussed below.
Open-Ended Questions
The open-ended question provides no structure for the answer, allowing the interviewee to
discuss what he or she wishes, not necessarily what the interviewer wants to know. By
sharpening the question, you can focus it. For example:
Open-ended questions are easy to write. For initial research, they can be used successfully to elicit answers that contribute to the formulation of more specific questions and response alternatives. For a small number of respondents and where analysis may be qualitative, rather than quantitative, open-ended questions may also be useful. If possible, avoid using open-ended questions with larger numbers of respondents, whose answers need to be tabulated. Under such circumstances, content analysis should be done before attempting to tabulate. 6
In CATI questionnaires, the questions should be designed in accordance with the guidelines established for structured telephone surveys. 7 In addition, other practices apply. For example, open-ended questions should be avoided as much as possible, primarily because of the time it takes to type the answer. If the topics addressed in the questionnaire are at the exploratory stage, a CATI is not recommended. A CATI requires some degree of maturity in the understanding of the issues under investigation. To the extent that open-ended questions are included in a CATI, they should be designed for easy typing. Such questions take up considerable space in the computer data files. To the extent possible, they should be moved to the end of the questionnaire and the interviewer should attempt to record the answers "off-line." These questions have the potential for interrupting the flow of the CATI and deflating the interview.
| 6 Discussed in the methodology transfer paper entitled Content Analysis:
A Methodology for Structuring and Analyzing Written Material. Also see chapter 9. 7 Discussed in the methodology transfer paper entitled Developing and Using Questionnaires. |
A question that actually is closed can be presented in such a way that to the interviewee it appears to be open-ended. Do this by preparing a list of potential answers and checking these off during the interview, as the interviewee mentions the various alternatives. Do not, however, read the choices to the interviewee. Such questions are more focused and specific than simple, open-ended questions and allow the range of possible answers to be narrowed. Question 1 in figure 3.1 illustrates the technique.
Figure 3.1: Question 1
1. Why weren't you satisfied with the plane? (DO NOT READ CHOICES) (Check
all that apply)
|
Fill-In-The-Blank Questions
This type of question has a simple answer, usually in the form of a name, frequency, or
amount. Again, you may prepare a list of alternative answers to check off during the
interview. Questions 2, 3, and 4 in figure 3.2 illustrate this type of question.
Figure 3.2: Questions 2, 3, and 4
|
Binary-Choice Questions
This is the typical yes-no, true-false type of question, a good format for obtaining
factual information but generally not opinions or feelings. Since the interviewee is asked
to make a commitment to one extreme or another, binary choice is considered a forced
choice. Figure 3.3 shows an example.
Figure 3.3: Question 5
5. Have you ever served in the U.S. military? (check one.)
|
Scaled-Response Questions
In the scaled-response question, you read or show to the interviewee a scale-a list of
alternative responses that increase or decrease in intensity in an ordered fashion. There
are three types: balanced, unbalanced, and rating and ranking scales.
Balanced Scales
The end points of the balanced scale are usually adjectives or phrases with opposite
meanings-for example, very satisfied and very dissatisfied. As its name implies, the
balanced scale contains an equal number of responses on each side of a reference point or
neutral response, as shown in question 6 in figure 3.4.
Figure 3.4: Question 6
6. How satisfied or dissatisfied are you with the typing ability of
the secretaries in your division? (check one.)
|
This scale expands the binary-choice answer discussed above, permitting a range of answers that better reflect the way people hold opinions.
Unbalanced Scales
The unbalanced scale is used when no negative response is possible. It has a reference
point (usually a "zero" point or "none") and the value of the
attribute increases for successive points on the scale. Intensity ranges from none to very
great.
Figure 3.5: Question 7
7. On your last assignment, how much opportunity, if any, were you
given to help develop staff working for you? (check one.)
|
Rating and Ranking Scales
In a rating question, the interviewee is asked to assign a rating to persons,
places, or things according to specified criteria. The points on the scale can be either
numeric or verbal. An example of a verbal scale is shown in figure 3.6.
Figure 3.6: Question 8
Whether verbal or numerical, a rating scale implies that the distance from one point to the next is the same on all parts of the scale.
In a ranking question, the interviewee is asked to place items in order according to a specified criterion, as shown in question 9 in figure 3.7.
Figure 3.7: Question 9
| 9. Rank the following individuals on their overall ability to do
Band II evaluator work. Use 1 for the best, 2 for the second best, 3 for third best,
4 for the fourth best, and 5 for the last. (Enter number for each) ____Brown |
Ranking questions may have several types of instructions. You can ask the interviewee
to rank all, as in the example, or to select the first (best) and the last (worst), the
top three, or some other combination.
In contrast to rating, ranking does not imply that the distance between points is the same
on all parts of the scale. For example, if Johnson, Green, and Smith were ranked 1, 2, and
3, respectively, the interviewee may not necessarily think that the gap between Johnson's
and Green's performance is the same as the gap between Green's and Smith's.
When it is necessary to obtain the interviewee's opinion as to the distance between
items (for example, how much better or worse one evaluator is than others), use a rating
question. While a rating question may also produce an ordering, a respondent may well give
two or more items the same rating. If you want the interviewee to choose between seven or
fewer items but you do not care how much better he or she believes one item is than the
others, a ranking question is likely to give you what you want. When a larger number of
items must be ordered, however, it will probably be easier for the interviewees to rate
them than to rank them. It is difficult to judge the order of a large number of items and
avoid ties between items, especially in interviews. A final order can be produced by
averaging the ratings over all respondents.
Number of Cues
The number of cues (alternative responses) for scaled-response questions depends on the
type of interviewee and type of analysis expected. There is a physical limit, generally,
to the number of cues to which an interviewee can react, probably around seven. GAO
usually uses five-point scales. Respondents with a keen interest in the study can be
expected to handle a greater number of cues. The more points on the scale, the better will
be the eventual analysis of the data, since more cues provide a more sensitive measure and
allow the analyst greater flexibility in selecting ways to analyze the data.
An even number of cues used in a balanced scale generally eliminates a middle or
neutral point on
the scale and forces the interviewee to commit to a positive or negative feeling. The use
of an odd-numbered scale permits a neutral answer and more closely approximates the range
of opinions or feeling that people can have.
When the possible responses do not include "no basis to judge," "can't recall," or "no opinion," the interviewee may feel forced to select an answer that is inaccurate. The point is that some people honestly may be unable to answer. If you have good reason to believe this is so for members of the respondent group, include in the list of cues read or shown to the interviewees the most applicable of the alternatives-"no basis to judge," "can't recall," or "no opinion." If you do not do this, the interviewee may guess, make up an answer, or ignore the question.
Order of Cues
The order in which the cues are presented can be used to help offset possible arguments
that the interviewees are biased toward answering the question in a particular way.
Consider a situation in which GAO had preliminary evidence that participants in a training
program were not getting job counseling. The following question could be asked:
"Job counseling involves someone talking to you about how to apply for a job, how to behave in an interview, etc. To what extent did you receive job counseling while you were in this program?"
The choices presented to the interviewee would be
In this example, the order of presentation biases the choice slightly in favor of the program. Some interviewees who did not take a strong interest in the question might select the first choice, indicating that they received job counseling to a very great extent. This would tend to give us an overall answer that was slightly biased toward receiving job counseling.
When the cues form a scale, only at great expense could we totally eliminate the bias inherent in the order in which the alternative responses are presented. 8 To repeat, the bias is slight. But when it does exist, we should use the logic of biasing the question against the hypothesis we are examining.
Wording of Cues
As indicated in the previous example, the scale used in the cues was the
"extent" to which some action was performed. When an action or process is being
assessed in a question, it is preferable to present the question and the cues in terms of
the action. The previous question would generally be rephrased as "How much job
counseling did you receive?" The cues could be rephrased as "A very great amount
of counseling," "A great amount of counseling," "A moderate amount of
counseling," and so on.
| 8 To totally eliminate this type of bias requires that half the sample be presented the cues in one order and the other half be presented the cues in the opposite order. In our example, half the sample would be presented a card on which "very great extent" was the first (or top) cue and "little or no extent" was the last (or bottom) cue. The other half of the sample would be presented a card on which "little or no extent" was the first cue and "very great extent" was the last cue. |
Unscaled-Response Questions
In an unscaled-response question, a list of cues is read or shown to the interviewee, who
is asked to choose one from the list or to select all that apply. The list should consist
of mutually exclusive categories. An "other" category is usually included as a
last alternative, either to provide for many possible (but thought to be rare) answers or
if it is thought that some interviewees will come up with unique answers. Question 10 in
figure 3.8 is an example of a question in which only one response is to be given; question
11 in figure 3.9 is a question in which the interviewee may check several responses.
Figure 3.8: Question 10
10. Educationally, what is the highest level that you have
achieved? (check one.)
|
Figure 3.9: Question 11
11. Please check the following possible eligibility requirements
that you will be using to determine when you will offer remedial education to youths in
your program. (check all that apply.)
|
Organizing Questions
In any DCI, the order in which the questions are presented is important. Early questions,
which set the tone for the collection procedure and can influence responses to later
questions, also help you get to know the interviewee and to establish the rapport
essential to a successful interview. 9 For example, in an interview with participants in
the XYZ program, the first few questions could review for accuracy data obtained from
agency files such as family composition, age, and education.
The next questions should also be ones that can be answered with some ease, as you are still developing rapport with the interviewee. Should these early questions be too difficult or too sensitive for the level of relationship developed, the interviewee might end the interview. Remember also that the questions should hold the interviewee's attention; thus, you must begin to introduce some "interesting" questions and the sensitive areas covering the attitudes of the interviewee.
Present the questions in a logical manner, keeping the flow of questions in chronological or reverse order, as appropriate. Avoid haphazardly jumping from one topic to another.
Also, avoid introducing bias in the ordering of questions. For example, to determine what the interviewee thinks a program's advantages and disadvantages are, do not mention the possible advantages or disadvantages earlier in the interview.
Generally, the set of questions asked varies from interviewee to interviewee. Many questions are asked only if there is a specific response to a particular question. As a result, several questions may be skipped. These interrelationships among the questions constitute the skip pattern of the DCI. For face-to-face interviews and telephone interviews that do not use a CATI system, the complexity of the DCI's skip pattern should be kept to a minimum. Otherwise, it becomes very difficult for the interviewer to find the next question to be asked.
| 9 Establishing rapport is covered in more detail in chapter 8. |
One of the important advantages of a CATI questionnaire is that it allows for considerable complexity in the skip pattern, since the branching is handled entirely by the computer. Any number of paths can be followed through the questionnaire. Usually, the computer displays the next question in sequence. Alternatively, conditional skips can be programmed to go from one specific question to another somewhat later in the questionnaire. These skips can be based on how the interviewee answers a single question or on the responses to several questions.
One drawback to a GATI questionnaire is that multiple-choice questions permitting several answers are not easily handled. It is difficult for an interviewee to remember all the options when several can be chosen. As a result, multiple-choice questions allowing the interviewee to check all that apply (as illustrated in figure 3.9) should be broken down into separate questions, each of which is an alternative response that is "checked" or "not checked."
Layout Considerations
The layout or form of a printed DCI (for non-CATI applications) (see figure 3.10 for an
example) is important; it is what you carry into the interview and use as a guide to
conducting it. It gives you on-the-spot instructions for each question and allows you to
record the answer. Later, the form is used to facilitate editing, keypunching, and the
subsequent computerized analysis.
Figure 3.10: Structured Interview Text
| Now I'd like to find out what you are doing. 23. Are you now receiving any AFDC? If, "Yes"-Is this a full grant or reduced grant? (Check one.) 01. __ Yes-Full grant (go to question 24) 24. What is your status with WIN? Are you registered in training or what? (Listen. Insert comments and try to determine what code to assign. If necessary check records or check with WIN staff afterwards. Check one.) Comments: 01. __ 10. Working registrant status 25. Are you looking for work (different work)? (check one)
26. How are you going about looking for work? (Do not read choices; indicate 1=mentioned, 2=not mentioned)
27. To what extent are you having difficulty finding a job? (Read choices; check one)
|
Here are some considerations when designing the DCI.
Typeface. Generally the text to be read to the interviewee is set off in a different typeface from the instructions that you do not read to the interviewee. In the example presented in figure 3.10, the text to be read to the interviewee is presented in upper and lowercase, the instructions in upper- and lowercase italics.
Continuation of questions. Generally, do not continue a question in the next column or on the next page, as you risk not having the entire question or all the response alternatives presented to the interviewee.
Boxes and lines. Provide open-top boxes for the interviewer to record answers to questions that require written responses. Place the box or line in a standard place beside each question to aid the interviewer and to facilitate editing, data entry, and subsequent analysis of completed questionnaires.
Keypunch numbers. These should be placed in a standard place beside each question to facilitate keypunching, when data are to be entered into computer files.
Skipping questions. If a certain response to a question means that interviewers are to skip the next question, specify this by placing a "GO TO" instruction beside the response.
Two computer programs are available to assist in the design and layout of structured
interviews and CATI questionnaires. They are QUEST (developed by GAO's Office of
Information Management and Communications) and QPL (Questionnaire Programming Language,
developed by the design, methodology, and technical assistance group in the Human
Resources Division).
QUEST makes it possible for an evaluator to create questionnaires using laser printers,
incorporating typographic and graphic elements (such as check boxes, arrows, and italic
type).10 QUEST automatically handles many of the layout considerations mentioned above.
Draft and pretest versions of typeset questionnaires, incorporating current desktop
publishing concepts, can be generated quickly. The designer develops a WordPerfect file
employing codes that identify questionnaire elements and control typographical layout on
the page. These codes make it possible to correct questionnaires easily, automatically
renumbering pages, questions, and choices and altering keypunch instructions.
QPL is designed to automate many of the activities involved in gathering and preparing survey data for analysis.11 It was developed primarily to implement CATI questionnaires within GAO; it can also be used as a data entry program for other DCIs. In this system, the questionnaire is first written in QPL, using a word processing program, and then compiled. The compiled version displays the questions on the computer screen, one at a time, and then waits for the interviewer to type a response. The interviewer can page forward and backward through the questionnaire to make corrections or review answers. The record of the interview is then added to a data file. The compiled version can also be converted into SAS and SPSS statistical analysis programs that can process QPL data files. One of the programs in the QPL system reformats the computer questionnaire into a written questionnaire, numbering all the questions, drawing open-top boxes for the answers, specifying card and column locations for each answer in the data file, and writing skip instructions. Unlike QUEST, the questionnaire does not incorporate typographic and graphic elements, but QPL makes it easy to review and revise a questionnaire.
| 10The question examples in this paper were prepared with QUEST. A manual
is available from the Office of Information Management and Communications, Publishing and
communications Center. As new technology becomes available, QUEST will be improved to make
it more powerful and even easier to use. More advanced desktop publishing software may
also facilitate questionnaire development. 11 See the QPL reference manuals: QPL Reference Manual, Version 2.0 (HRD Technical Reference Manual 1, March 1990); QPL Data Collection Program (HRD Technical Reference Manual 2, March 1990); and QPL Data Editing Program (HRD Technical Reference Manual 3, March 1990). |
Chapter 4
More on Interview Design: Avoiding Problems
In this chapter, we suggest further ways to compose good interview questions and to forestall problems with comprehension or bias. As an evaluator writing such questions, you need to consider the appropriateness and level of language used in the interview, the effects of qualifying language, and the importance of clarity. We also discuss the various kinds of bias that can creep into the wording of interview questions and their effect on the validity of the evaluation results.
Appropriateness of the Language
Whether interviewing language is appropriate or inappropriate may relate to what is said,
how it is said, or when it is said, as discussed below.
What is said in the interview is basically dictated by the written, structured data-collection instrument. The DCI is prepared in advance and pretested and the interviewers are trained to use it; thus, to some extent, the appropriateness of the language has been tested. It is the task of the interviewer to transmit faithfully to the interviewee the meaning of the questions. In addition to wording the questions precisely, you may include supplemental language in the DCI, to be used if the interviewee does not understand the original wording of a question. If, in the course of the interview, the interviewee still does not understand and different language must be improvised, such improvisations should be noted and considered before the data are analyzed.
How it is said concerns the speech and mannerisms of the interviewer who controls the "presentation" and whose delivery of questions may alter their intended meaning. More detailed information on this topic appears in chapter 8.
When it is said refers to the context of the interview in which each question is placed. Although, in designing the DCI, you should be precise about the order in which questions are asked, you may introduce some variation during the actual interview to clarify the questions, review information, or postpone potentially sensitive questions. Or, if the interviewee expresses concern or sensitivity to a given question, changing the language of a subsequent question might defuse the concern.
Level of the Language
When composing interview questions, consider the level of the language used. Seek to
communicate at the level the interviewee understands and to create a verbal setting that
is conducive to serious data-gathering yet one in which the interviewee is comfortable. In
chapter 3, we touched on some of the writing approaches to use; here we deal with how the
questions sound and the atmosphere the language creates. One problem often encountered is
maintaining a level of language that is neither above nor below the interviewee's level of
understanding.
Speaking over the interviewee's head includes the use of complex, rare, and foreign words and expressions, words of many syllables, abbreviations, acronyms, and certain jargon. Such language, while it may seem appropriate to the interviewer or evaluation team, may not be understood by the interviewee.
For example, when interviewing participants in a training program, the terms "OJT" and "PSE" in a question may be nothing but alphabet soup to the interviewees; even the words they represent, "on-the-job training" and "public service employment," may be over their heads. In conducting the actual interview, you would most likely have to give further definitions or examples of what was meant. When interviewing training program directors, however, the use of "OJT" or "PSE" would be appropriate if the interviewees use the terms daily.
Thus, to speak over the interviewee's head hinders communication. Interviewees who are embarrassed at their lack of understanding may either not answer or guess at the meaning, which can lead to incorrect answers. Or the interviewee may get the impression that you really do not care about the answer and lose interest in the interview.
Speaking down to an interviewee is just as bad. You can oversimplify the language in the DCI to the point where the interviewees feel you regard them as ignorant. This approach is demeaning. You have contacted these individuals because they have important information to impart. To treat a person condescendingly-or to let it appear that you do- negates that importance.
Likewise, take care in using slang, folksy expressions, and certain jargon. While such language may help you develop rapport with the interviewee, the exactness of the communication may be lessened.
To avoid error in either direction, pretest both the final wording of the DCI and the interview approach.1
| 1More detailed information on pretesting appears in chapter 5. |
Use of Qualifying Language
After composing an interview question, you may find it requires an adjective or qualifying
phrase added or a time specified to make the item complete or to give the interviewee
sufficient or complete information. For example, "How many employees do you
have?" might become "How many full-time-equivalent employees do you have?"
and "How many times have you gone to a physician?" might become "How many
times have you gone to a physician in the past 6 months?"
If feedback is possible in the actual interview, the interviewee can ask for further qualification, where needed. If you have not included the necessary qualifiers in the DCI, however, another interviewer may qualify in a different way. This could make the resulting data difficult to summarize and analyze.
Also, interviewees, not realizing that qualifying language is absent, may answer the question as they interpret it. Thus, different interviewees would be responding to different questions, based on their own interpretations.
Clarity of Language
The style in which a question is couched can affect its clarity of communication. We
discuss below such matters as question length, complexity, and clutter; double-barreled
questions; double negatives; extreme language; and defining terms.
Length, Complexity, and Clutter
A question that contains too many ideas or concepts may be too complex for the interviewee
to understand, especially if it is presented orally, which makes it difficult for the
interviewee to review parts of the question. While the interviewee might be responding to
one part of the question, the interviewer may be interpreting the response as a response
to the entire question. You should set up more than one thought in separate sentences and
give the interviewee the proper framework. For example, "How satisfied or
dissatisfied were you with the amount of time devoted to helping you get a job while you
were in the XYZ program?" becomes "Think about the training experiences you had
while in the XYZ program. How satisfied or dissatisfied were you with the amount of time
devoted to helping you get a job?"
Likewise, a sentence may contain clutter-words that do not clarify the message. Word questions concisely. Here are a few tricks to reduce sentence clutter:
Double-Barreled Questions
A double-barreled question is a classical example of an unclear question. Consider the
following: "Did you get skill training while in the program and a job after
completing the program?" This question attempts to determine if there is a
relationship between skill training and getting a job. But if the interviewee answers
"yes," this could mean "yes" to both parts, "yes" to the
training part only, or "yes" to the job part only. Other interviewees, finding
the question confusing, might not respond. You are presenting two questions but the
opportunity to record only one answer. Both interviewee and interviewer may see the need
for only one answer. State the questions separately.
Double Negatives
In phrasing a question, avoid the double negative, which is difficult to answer. For
example, "Indicate which of the organizational goals listed below are not considered
unattainable within the 2-year period" should be reworded to read "Indicate
which of the organizational goals listed below are considered attainable within the 2-year
period."
Extreme Words
Avoid such words as "all," "none," "everything,"
"never," and others that represent extreme values. Rarely is a statement using
such a word true, and the use of extreme words causes interviewees to avoid the end points
of a scale. There are cases when the use of "all" or "none" is
appropriate, but they are few. Where "yes" or "no" answers are
expected, the results can be misleading. For example, if one employee is not covered in a
question like "Are all of your employees covered by medical insurance?" a
"yes" answer is impossible. A better question would be "About what percent
of your employees are covered by medical insurance?" Alternatively, choices can
be provided, as in question 1 in figure 4.1.
Figure 4.1: Question 1
1. What portion of your employees are covered by medical insurance?
(READ THE CHOICES) (check one)
|
Defining Terms
Where possible, define key words and concepts used in questions. For example, when
speaking of "employees," define and clarify the term. Are we talking about
part-time, full-time, permanent, temporary, volunteer, white-collar, blue-collar? An
example of how this might be done is
"Consider people who work for your company, are paid directly by your company, work at least 35 hours per week, and are viewed as permanent employees. What percent of these employees . . . ?"
Of course, not all questions need be preceded by such a definition. As earlier questions are developed, definitions evolve. You may wish to list definitions in a separate section or on a card to hand to interviewees for reference.
Bias Within Questions
A question is biased when it causes interviewees to answer in a way that
does not reflect their true positions on an issue. An interviewee may or may not be aware
of the bias. Problems result when the interviewees are
Bias can appear in the stem (or statement) portion of the question or in the response-alternative portion. Bias may also result when a question carries an implied answer, choices of answer are unequal, "loaded" words are used, or a scaled question is unbalanced. These are discussed below.
Implied-Answer Bias
A question's wording can indicate the socially acceptable answer. An example is the
question "Most GAO employees have subscribed to the U.S. Savings Bond program. Have
you subscribed?" Interviewees who are concerned about being different from the norm
may answer "yes," even if they have not subscribed. The question could be
restated as "Have you subscribed to the U.S. Savings Bond program?"
Questions can be worded so as to impel some people to answer in one direction and others in another. Yet both types of interviewee could be unaware of any bias in the wording. Such bias usually occurs when additional qualifying or identifying information is added to the question. There is bias in the question "Which plan is more acceptable to you: the one designed by Pat Brown, our chief economist, or the one designed by Chris Green, the consultant we hired?" The interviewee who is not familiar with either plan may answer on the basis of whether the plan was generated internally or externally to the organization, although this may have little or nothing to do with the quality of the plan. A better presentation would be "Whose plan is more acceptable to you: Pat Brown's or Chris Green's?"
Bias Resulting From Unequal Choices
When response alternatives are created, it is important that they appear to be equal. If
undue emphasis is given to one, it may be easier for the interviewee to select that one.
Question 2 in figure 4.2 illustrates a question with unequal emphasis, and question 3 in
figure 4.3 corrects the unbalance. Alternative 3 in question 2 is isolated from the two
others because of the words "high-paid," which sets those individuals apart from
the others, and by the fact that alternative 3 is longer than the others.
Figure 4.2: Question 2
2. Who do you feel is most responsible for the poor quality of the
training program? (check all that apply)
|
Figure 4.3: Question 3
3. Who do you feel is most responsible for the poor quality of the
training program (check all that apply)
|
Bias From Specific Words
When used in almost any context, certain words can be considered "loaded,"
because they evoke strong emotional feelings. "American," "freedom,"
"equality," and "justice" generally evoke positive feelings, while
"communist," "socialist," "bureaucrat," and "nuclear
holocaust" may evoke negative feelings. Since it is difficult to control the
emotional connotations of such words, it is usually best to avoid them.
Bias From Lack of Balance
When using a scaled question, avoid bias in the stem as well as in the response
alternatives. A question that seeks to measure satisfaction with something should mention
both ends of the scale in a balanced fashion. For example, question 4 in figure 4.4 shows
unbalance in both the stem and the alternatives, while question 5 in figure 4.5 shows how
this bias is eliminated. 2
| 2 Proper of an unbalanced scale was discussed in chapter 8. |
Figure 4.4: Question 4
4. How satisfied were you with the answers you received? (check
one)
|
Figure 4.5: Question 5
5. How satisfied or dissatisfied were you with the answers you
received? (check one)
|
Considerations for Telephone Interviewing Instruments
In general, the same principles described above apply to the development of questions and
answers for telephone interviews. However, some additional considerations come into play.
The primary additional factor is that the cues available in face-to-face interviews are
absent. It is not possible to observe the interviewee's reactions (including confusion,
uncertainty, or hostility) and make allowable adjustments in conducting the interview.3
Making questions shorter, breaking multiple-choice questions into binary questions, and
conducting some pretests face-to-face will overcome some of these difficulties.
Another loss in telephone interviewing arises from the impersonal nature of the telephone. An interviewer has a tendency to become flatter in presentation. The interviewer must counter this tendency by being continually aware of the enunciation of questions. In the QPL system, some words are capitalized, underlined, or put into bold type to help the interviewer maintain appropriate pitch and emphasis.
In summary, designing a structured interview form is not simple. It involves many considerations and choices: the specific questions to be asked, their format, language order, and layout. In this chapter and chapter 3, we have covered briefly the basic principles that should be followed in making these choices.4
| 3see chapter 8 for details. 4For more information, consult Bradburn and Sudman (1981) or Sudman and Bradburn (1982), as listed in the bibliography. |
Chapter 5
Pretesting and Expert Review
Pretesting and expert review constitute perhaps the least appreciated phase in the development of a structured interview.' In the desire to meet deadlines for getting the job done, staff may ask "Why not eliminate the pretest?" or "Do we need outside opinions on the interview form?"
But these are perhaps the most important steps in the development of the interview, an iterative process that uses continuing input from evaluators and technical specialists to derive the final product. As Cannell et al. (1989) indicate, when the evaluator has little experience with a topic or when the interviewee has difficulty with a question, substantial work may be necessary to develop questions that will obtain the desired results. Research has shown that question formulation may alter results by as much as 50 percent. The pretest and expert review processes give the evaluators feedback as to whether its efforts stand a chance of doing what they are designed to do.
Following pretesting and expert review, the DCI is redesigned as needed-an iterative process that occurs after each pretest or group of pretests.
| 1The term "pretest" is not interchangeable with "pilot." "Pretest" is usually used in connection with the testing of a structured interview or questionnaire, while "pilot" implies a test of all or most of the complete study design at one field location before proceeding to implement the design at all selected locations. |
Purpose of Pretest
In pretesting, we test the DCI with respondents drawn from the universe of people who will eventually be considered for the study interviews to predict how well the DCI will work during actual data collection. The pretest seeks to determine whether
Research (Cannell et al., 1989) has shown the following to be among the types of problems that arise with survey questions:
Pretest Procedures
The number of pretests typically varies depending on the size of the survey and the range
of conditions that may affect the survey results. For structured interviewing of thousands
of respondents, 25 to 75 pretests might be conducted. Sometimes, when the sample is less
than 500, a dozen or fewer pretest cases are sufficient, provided they bracket the range
of data collection conditions. Discuss the exact number with the measurement specialist
who designed the DCI. To a great degree, the pretest procedures for the structured
interview simulate what would be done during actual data collection. It is important to
test as many of the procedures involved in conducting a structured interview as possible,
including the selection of and contact with the interviewees. In part, pretests should be
conducted in the same mode to be used in the actual interviews-that is, the face-to-face
interview pretested in person and telephone interviews over the telephone. However,
telephone and mail surveys should also be tested in part in face-to-face interviews. For
CATIs, which generally have fewer than 300 interviews, a dozen pretests might be
sufficient. These pretests should be conducted both in person and over the telephone.
Who Conducts the Pretest
Two types of staff should represent GAO at the pretest:
The measurement specialist acts as the interviewer-that is, asks the questions on the first and perhaps the second pretest-while the evaluator observes. On subsequent pretests, the evaluator asks the questions and the measurement specialist attends as observer.
Selecting and Contacting Pretest Interviewees
Pretest interviewees are drawn (not necessarily randomly) from the universe being
considered for the final study. If the universe is relatively homogeneous-for example,
welfare recipients-the pretest subjects need not be exactly balanced as to various
attributes. With a heterogeneous group, such as taxpayers or U.S. citizens, however, try
to obtain pretest interviews with high- and low-income people, old and young, the highly
educated and less educated, and women and men. Ideally, the DCI is pretested with several
of each of the different kinds or types of individuals in a heterogeneous group.
Contact pretest interviewees by telephone or in person to arrange a pretest session. If
possible, follow procedures similar to those proposed for actual data collection. Identify
yourself, describe what kind of agency GAO is and what it does, explain the nature of the
study, and indicate the importance of their participation. If this is a face-to-face
pretest, ask the interviewee to participate by arranging to meet in a place that is
convenient to the interviewee and free of distractions. If this is a pretest of a
telephone interview, arrange a time that is convenient for the interviewee. (For a more
detailed explanation and copies of text to be followed, see chapter 7.)
Conducting the Pretest
The initial steps of a pretest are the same as for actual data collection. Give the
interviewee any appropriate background information, even if you have covered this while
setting up the interview appointment. Since an interview is interactive, the interviewee
will probably provide a great deal of feedback in addition to answering the questions.
Problems with the DCI or procedures often become evident immediately and may be dealt with
then, so that the interview can proceed. Often, if an instruction, word, or concept is not
understood, the interview cannot continue.
Ideally, however, it is desirable to run through the entire interview without getting sidetracked. This way, you can examine the flow of the interview and estimate the total time needed to complete it.
During the pretest, then, your tasks as interviewer are to
With respect to the second item, providing explanations or alternative wording must be done carefully, since interviewer bias can occur. The interview is written as bias-free as possible. In deviating from the prescribed text, you may not have time to rephrase the question adequately and can make a slip in wording that favors or is slanted toward your approach to the situation.
For telephone interviews, it may be easier to conduct the pretests and they may be more informative. The interviewee should be informed that a measurement specialist will be listening for purposes of refining the instrument. It may be possible to use a speaker phone to allow more members of the team to listen, take notes, and record answers without intruding. With the interviewee's permission, the interview may be taped to allow for more detailed examination of problems. With these possibilities, pretesting telephone interviews may be a lot smoother than pretesting face-to-face. However, as mentioned above, remember to include some face-to-face interviews.
Identifying Problems
After a pretest, the evaluator and the measurement specialist review the interview process
and attempt to identify any problems that the interviewer has in asking the questions or
the interviewees appear to have in answering questions. If the pretests disclose problems
such as ambiguous interpretation, or other difficulties (discussed below), you must revise
the interview and continue the tests until the problems are resolved, even if this
requires unplanned extra time. Premature termination of pretests can result in
questionable data. Major indicators of problems include the following:
The problems fall into two basic categories-those related to instrument design or administration and those concerning the interviewee's lack of knowledge or reluctance to answer. The first type can be controlled by the staff designing the instrument and are covered in chapters 3 and 4, while the second is merely recorded as observed behavior.
Research has found (Cannell et al., 1989) that pretest interviewers are not consistent in identifying problems with the questions or providing guidance for their revision. Responses can vary by as much as 50 percent when there are no adequate controls over the quality of the questions and procedures. Two techniques (categorization of respondent behavior and use of probe questions) that have been developed are useful particularly when the number of interviewers is large. The first method simply involves tabulating for each question how often each one of the problems mentioned above occurred across all interviews. A small percentage of interviews is expected to have some problem for each question. If, however, for a given question, a high percentage of interviews has a specific problem, this suggests that a question needs revision.
The second method, use of probe questions, can be used by itself or to clarify the nature of the problems identified from the first method. Special probe questions may be included in the interview or may be used at the end to ask interviewees to elaborate an answer, explain how they interpreted the questions or answers, or describe any difficulties. There are three types of probes:
| 2For more information on conducting a pretest debriefing, see the methodology transfer paper entitled Developing and Using Questionnaires. |
Purpose of Expert Review
Because no instrument is perfect, it is generally useful to seek outside commentary on our
approach. We seek expert review on assignments using structured interviews to help us
determine whether
In many instances, officials from the agency whose program is under review serve in this capacity. By obtaining agency input at this stage, we avoid potential problems after data collection, when time and money have already been spent. In other cases, staff in other design, methodology, and technical assistance groups, PEMD staff, or individuals with subject-area or evaluation expertise can provide expert review. In particular, subject-matter experts in membership associations who provide us with lists of the respondent universe or sample can provide expert review.
Persons providing expert review are not acting as interviewees. They do not answer the questions but instead provide a critique.
Instrument Redesign
The evaluator and the measurement specialist consider the results of the pretest and
expert review and make appropriate changes to the DCI. If changes are minor, the
instrument can be used without further pretests; if extensive, another series of pretests
may be necessary.
If pretesting can be spread over a longer period of time, more versions of the
instrument can be tested and a smaller number of interviewees can be used with each
version. Changes that are obviously needed can be made and the revised version can be used
in the next pretest. This allows us to use a relatively more perfect version on each round
of pretests.
Chapter 6
Training Interviewers
In most cases, our own evaluators conduct structured interviews for GAO studies, but
occasionally we use employees of other agencies or contractors. Regardless, the
interviewers must be trained in the purpose of the evaluation and the procedures for
conducting the interview.
Training Methods
GAO uses various ways of training its interviewers and helping them maintain their skills
throughout the data-collection period: a job kickoff conference, an interview booklet,
role-playing and field practice, and supervisory field visits and telephone contacts. In
addition to the items discussed below, interviewer training should emphasize the skills
described in chapter 8 for conducting the interviews, with particular attention to
structured interview tips, probing techniques, and reinforcements. These are also
discussed below.
Kickoff Conference
For most projects of any size, a GAO division holds a kickoff conference to tell the staff
from the regions and other divisions the purpose of the evaluation, to make assignments,
and to answer questions. When a project is to include structured interviewing in the
data-collection phase, the conference is usually extended so the interviewers can be given
detailed instructions on the use of the DCI. Preferably, all potential interviewers should
attend.
If a region sends only one representative to the kickoff conference, for example, it
should be an individual who will be conducting interviews for the study. Not all aspects
of the training can be written into the interview booklet (discussed in the next section);
thus, practice sessions must involve, along with the measurement specialist, those who
will actually conduct interviews and possibly will train others in the region to do so.
The training begins with the evaluator in charge and the measurement specialist reviewing
the purpose of the study and how the interview data will fit into its overall objectives.
Then, the data-collection procedures are covered in detail, using the interview booklet.
The trainers discuss the interview form, question by question, including the need for the
data, possible rephrasing to be used if a question is not understood by the interviewee,
how to record the answers, and other matters they think could arise. The trainees can ask
questions, clarify items, catch typographical errors in the DCI, and suggest possible
changes from their experience. Even at such a late date as the kickoff conference, changes
can be made in the DCI to preclude problems being carried into the actual interviews.
Among the potential problems that the trainers usually make special efforts to address is making sure that the interviewers
Interview Booklet
Where the interview questions are limited in number and not very complex or difficult and
the staff members who will conduct the interviews helped develop the DCI, we use the
kickoff conference alone to inform the interviewers in detail how each question should be
handled.
If, however, a large-scale interview effort is undertaken, GAO project staff may prepare a booklet that discusses in detail each question in the DCI. (The booklet is similar to that issued by the Bureau of the Census to its enumerators.) Typically, GAO's booklets cover not only the interview questions but also other matters such as sampling procedures, contacts with interviewees, and coding procedures. These are discussed below.
Sampling Procedures
Where statistical sampling procedures are to be used to select interviewees, the booklet
shows the interviewer how to identify the universe and select the sample. The booklet may
include a random number table, when necessary, and describe both simple random samples and
more complex two stage procedures.
Interviewee-Contact Procedures
Rules are provided for contacting the potential interviewee and deciding what to do if the
person refuses or cannot be located. An example is given of a phone conversation to set up
the interview. Also covered is the log interviewers must keep of all interview contacts to
ensure that proper sampling is maintained. The log makes it possible to adjust the
universe later and to examine possible effects of nonresponse. For CATIs, many of the
contact and logging procedures are handled automatically by the computer. How this is to
be accomplished should be described to the interviewers during training.
Coding Procedures
The booklet shows interviewers how to code the various types of question to facilitate
editing and keypunching the answers and reviews different types of questions. This is
handled automatically for CATIs.
Role-Playing Practice
This is nothing more than two staff members taking turns "playing" interviewer
and interviewee, a training method that should start at the kickoff conference as a group
session with the measurement specialist observing and critiquing. The role playing can
continue when the staff members return to their regions, particularly if regional staff
members who did not attend the conference will also be conducting interviews.
Such role-playing gives staff members the chance to become familiar with the instrument from both sides of the interview. The person playing the interviewee should challenge the interviewer by giving him a "hard time," perhaps refusing to answer questions or pretending not to understand them. Sometimes this serves to show the weaknesses of questions that are unclear or lack sufficient response alternatives. If so, the evaluator in charge or measurement specialist should be notified so the items can be changed or clarification can be given to all interviewers.
For CATIs, the interviewers must also be trained in the software requirements. This should be done after the training in the details of a paper version of the DCI. The computer training first focuses on the mechanics of using the computer for a CATI, showing the interviewers how to start the CATI, move the cursor and step through each screen, move back and forth between questions, and identify particular situations that may arise.
After the essentials of the DCI and the computer have been covered, the interviewers can proceed to role-playing, this time using the computer set up for office-to-office mock interviews. The evaluator in charge or measurement specialist should observe these sessions to identify not only weaknesses in the DCI but also any difficulties in using the computer. This role-playing should be practiced for a half to a full day.
Field Practice
Once evaluators are in the field at the first site, they should oversample the number of
interviewees needed for that site and use some for field-practice interviews. These
interviews are planned as throwaway cases, identified as such in advance of the interview.
The data derived from an interview are not used in the final analysis, regardless of
whether the interview went well or poorly. Interviewing real interviewees who do not count
gives interviewers a chance to get rid of any anxiety and test out their approach. The
interviewees, however, should not be told that this is a practice session. To them, this
is the real thing; they will, therefore, exhibit all the cautions and concerns of any
interviewee.
Obviously, field practice takes some time and should be built into the project
schedule. After practice, the interviewers should discuss any problems they had and decide
where they need to change their approach or learn more. Any lasting concerns should be
relayed to the evaluator in charge or the measurement specialist.
Supervisory Field Visits
Normally, the evaluator in charge makes field visits during the course of an evaluation. A
visit early in the data-collection phase, when interviewing has just begun, is valuable,
allowing the evaluator in charge to review the procedures being used to conduct the
interviews and observe some interviews firsthand. This quality-assurance checking enables
the evaluator in charge to ascertain that interviewers are carrying out the standard
practices designed into the structured-interview procedures. If possible, the measurement
specialist should participate in some of the visits.
For CATIs, it may be more difficult to maintain supervisory controls. To the extent possible, each interview should be recorded on paper as entries are made into the computer, so that the accuracy of the computer input can be verified. Large organizations that conduct CATIs frequently provide the capability of a supervisor to monitor calls by interviewers, usually at random or when the interviewer experiences problems. This is not usually possible for GAO CATIs. In some instances, it may be useful to tape initial interviews, with the interviewee's permission, in order to remove any final problems associated with the interview administration.
Supervisory Telephone Contacts
The evaluator in charge and measurement specialist form a team that keeps interviewers
informed of changes in procedure and receives comments from the field on progress and
problems encountered. These telephone contacts serve as the final step in training
interviewers.
Interviewer Qualifications
Many GAO interviews are highly sensitive, and the data to be obtained can be influenced by
subtle elements that are in the control of the interviewer. When GAO uses outside sources
to supply interviewers, it usually retains the right to examine the work of interviewers
and, if there is cause, suggest that some be replaced. The same applies to GAO evaluators
whom the region or division assigns to the project. Staff members who are reluctant to
conduct the necessary interviews or exhibit some bias may not be right for the job and
could jeopardize the data-collection effort.
For CATIs, the skill level and content knowledge of interviewers can be lower than for face-to-face interviews because the questions are generally simpler and fewer probes need to be used. As a result, GAO contracts for CATIs or the use of short-term, part-time staff have been quite successful and provide alternatives to the use of evaluators.
The qualifications that interviewers exhibit during the various training opportunities
should be evaluated by supervisors. If there are any problems that cannot be corrected
through retraining, these interviewers should be replaced.
Chapter 7
Selecting and Contacting Interviewees
This chapter touches briefly on the selection of interviewees and then discusses in some
detail contacting the prospective interviewees, arranging the interview, and protecting
the interviewee (through informed consent and guarantees of confidentiality or anonymity).
Selection of Interviewees
For some structured interviews, because there is only one person who fits the category of interviewee (for example, state officials responsible for welfare programs), no selection process is needed. More-complex selection procedures that are required-for example, when the sampling plan calls for a random sample of program participants or other respondent groups-are covered in some depth in the methodology transfer paper entitled Using Statistical Sampling. When complex sampling techniques are used and a list of interviewees is generated by computer, control over the selection and contact of interviewees can be automated, as described in more detail below.
Contacting Potential Interviewees
Once the potential interviewees have been selected, you must contact them, explain what
GAO is doing and why you need their assistance, and arrange an appointment. The interview
booklet sets out rules to be followed in contacting the interviewees.
Frequently, when structured interviews are used, interviewees are program participants or beneficiaries of federal programs. The universe list is developed for a given point in time and a sample is drawn. By the time the sample is contacted for interviews, months may have passed. This means some of the people selected for initial telephone contact will have moved away, died, or otherwise become inaccessible to GAO interviewers. Thus, we oversample and set up rules for replacing individuals who cannot be located. Such provisions are illustrated in figure 7.1, which contains rules that GAO used to review a nationwide program requiring the interviewing of program participants.
Figure 7.1: Interviewee Contact Procedures Example
| In order to provide a comprehensive assessment of the XYZ program
nationwide, a complicated sampling plan has been devised to select participants to interview. The sampling plan will allow us to interview as few as 16 participants at each selected location, thereby alleviating limitations on our staff and time. However, since only 16 participants will be representing all participants at a site, the sampling and interviewing rules for selecting the participants must be strictly adhered to. Failure to follow the rules will seriously jeopardize the validity of our review. The rules for randomly selecting the participants for possible interview should be closely followed to yield the 16 planned interviews. Log sheets will be provided for you to record your attempts to contact potential interviewees. The rules for random selection require that the people interviewed must be the first ones selected. Only if you absolutely cannot reach one of the first ones can you move down the list to try the next participant for possible interview. We have set up some rules to follow which allow you to drop a participant from the list. You may drop a potential interviewee
Other rules and suggestions will be discussed at the kick-off conference. |
When contacting the interviewee by phone, use a standardized approach. This ensures that you do not omit any important information. An example of such an approach is presented in figure 7.2. For CATIs, the introductory script can be put onto the computer. Naturally, if unexpected events occur, you may have to deviate from this guide. The interview booklet may contain some samples of unexpected events and provide some guidance on how to deal with them.
Maintain a log of all attempted contacts, with a record of each interviewee's name and
address, telephone number, date and time of the attempted contact, and the result. This
information will be of use later in determining the possible effects of nonrespondents on
the results. Also, it gives the analyst a means of adjusting the universe and plays a role
when response-weighing is used. An example of such a log appears in figure 7.3 and how it
looks partially completed is in figure 7.4.
Figure 7.2: Example of Telephone Contact With Potential Interviewee
| WHEN YOU GET THE INTERVIEWEE ON THE PHONE, YOU SHOULD SAY SOMETHING LIKE: 'Hello, (name of interviewee), my name is (give your name). I work for the U.S. General Accounting Office. We work for the U.S. Congress. Currently, we are doing a study of services provided under the XYZ program. That is the program that provides (briefly describe the program). When can we set up an appointment for you to spend 30 minutes or so with me to answer some questions about the program and your experiences with it? ' IF THE INTERVIEWEE AGREES, SET UP THE APPOINTMENT. IF HE OR SHE REFUSES, EXPLAIN THE IMPORTANCE OF THE INTERVIEW BOTH TO THOSE WHO
PARTICIPATE IN THE PROGRAM AND TO THE GOVERNMENT. YOU CAN SAY: 'Well, the government is providing the money for the program. If it is a good program,
they should know that; if it is not doing the job, it should be changed. Our report will
help the government decide what should be done. That's why we need to talk to people who
have been in the program and really know what is going on. ' IF ALL EFFORTS FAIL, RECORD THE REASON FOR THE REFUSAL IN THE RESULT COLUMN OF THE LOG SHEET. |
Figure 7.3: Interviewee Contact Log
*** insert 7.3
Figure 7.4: Interviewee Contact Log Filled In
***Insert 7.4
For CATIs, particularly those using the QPL system, a data base of respondents can easily
be generated and used to provide automated call sheets. Certain information, such as the
time at which an interview was conducted, can be entered automatically by the computer. In
addition, if information about the interviewee is already available from other sources, it
can be entered directly into the record being generated without the need to ask the
interviewee (unless some verification is wanted). Finally, when the interview is
completed, selected information can be transferred to the automated call sheets to record
the progress in administering the DCI.
A main objective when selecting and contacting interviewees is to avoid bias. By following set procedures, you can minimize wrong selections made by mistake or because of ease in contacting them.
Interview Arrangements
When you interview an individual for a GAO evaluation or audit, the interviewee usually is
doing GAO a favor. You should, therefore, make the interview arrangements, including time
and site, as convenient as possible for the interviewee.
This may mean conducting the interview at what is for you, the interviewer, an inconvenient hour, such as early morning or late evening. The location might be a GAO office, an audit site, space provided by the agency under review, or some other public place. If this is not convenient for the interviewee, you may have to travel to his or her home or place of employment or to some other such location. For example, if you must interview farmers, you cannot expect them to take time from their work routine to travel to a place to meet you; you would need to go to the farms.
If the interview contains sensitive questions, holding the interview in certain locations might create difficulties. For example, if you are questioning participants of a welfare program about the services and treatment they are receiving, it would be unwise to conduct the interview in the welfare office. Such a setting might cause interviewees to omit negative comments about the office and its personnel out of fear that this information would be overheard and affect their benefits.
When interviewing people in their homes, you may encounter frequent interruptions from other family members, neighbors, and telephone calls. Television and radio programs can also be distracting. Interruptions and distractions also occur when people are interviewed at work. Nevertheless, there are advantages to interviewing people in their own settings: they generally feel more comfortable, they have not been inconvenienced by having to travel to the interview, and they may have records and other sources of information, including other people, at their disposal. Thus, choose the interview setting carefully. On balance, it is more important to conduct the interview in a setting in which the interviewee feels comfortable than to insist on a setting that offers no distractions.
For CATIs, the same general principles are used to se