2.2 Designing an Experiment

We have previously discussed the importance of using experiments to establish cause and effect relationships between variables. That is the “why” of experiments. In this section, we investigate the “how” of experiments—designing and conducting experiments so that the results are meaningful.

2.2.1 What Is a Randomized, Comparative Experiment?

On November 18, 1976 the California division of the American Cancer Society got nearly one million smokers to quit for the day. That was the beginning of the Great American Smokeout, now held every year on the third Thursday in November. While some people are able to quit smoking “cold turkey,” research has shown that individuals are more successful when they use resources such as support from family and friends, stop-smoking groups, counseling, nicotine patches, or prescription medication.

invisible clear both

Photo Credit: Doug Martin / Science Source

invisible clear both

Researchers in Denmark conducted a double-blind, randomized study to investigate whether nicotine patches were safe and effective in quitting smoking. The experiment involved 289 smokers, 82 men and 207 women. Subjects were treated for 16 weeks; 145 individuals received 16-hour nicotine patches and 144 received placebo patches. The results are summarized in the table below.

Type of Treatment Percent Abstinent
After 6 Weeks After 12 Weeks After 26 Weeks After 52 Weeks
Nicotine Patch 53 41 24 17
Placebo Patch 17 10 5 4
Table 2.2: Nicotine Patch Study Results

Data from The New England Journal of Medicine

The explanatory variable for this study was the type of patch received, and the response variable was whether the individual continued to abstain from smoking, as measured at four intervals up to one year.

This study was an experiment, because there were two treatments involved (the nicotine patch and the placebo patch), and comparative, because the results from the nicotine patch group were compared with those of the placebo group. A placebo is an inactive treatment that has no medical effect, but resembles the active treatment in all other aspects. Here the placebo served as the control treatment, a treatment used to establish the effect of not treating the individuals medically. A study that uses a placebo as a treatment is called placebo-controlled.

In this experiment, the placebo was used in order to determine whether the nicotine in the patch, rather than just having a patch, caused the increase in abstinence from smoking. In a phenomenon known as the placebo effect, some people improve just because they are part of a study or are receiving a treatment, regardless of whether the treatment is real or not.

The researchers described their study as double-blind. This means that all those directly involved with the experiment do not know which individuals are receiving the active treatment, and which are receiving the placebo. This includes not only the subjects of the experiment, but also those administering treatments or interpreting the results. When those conducting the experiment know who is receiving what treatment, they may unintentionally treat the subjects differently in some way.

An experiment is single-blind if either the subjects or the researchers, but not both, do not know who is receiving what treatment. Blinding minimizes the chance of bias, but is not possible in all situations.

The final characteristic of this study is that it is randomized. Recall that randomization was also an important aspect of sample surveys. For sample surveys, randomization involves using impersonal chance to select the sample from the sampling frame. For experiments, the randomization occurs not in sample selection, but rather in the assignment of individuals to treatment groups. This randomization is the best method for creating treatment groups that are, as much as possible, similar in all regards except for the treatment. This allows us to conclude that differences in outcome are due to differences in treatment.

In the results of the study, the researchers state, “Rates of sustained abstinence were significantly better with active treatment than with placebo.” What does “significantly” mean here? The researchers are stating that their results are statistically significant; that is, the differences between treatment groups are so large that they are unlikely to have occurred merely by chance. And if it weren’t chance, then it was probably the difference in treatment—the nicotine in the patch—that caused the differences in the results.

When we looked at sample surveys in the previous section, we wanted to see the nonresponse rate. In the case of experiments, we have similar questions. How many of the individuals who began the study actually completed it? Why did individuals drop out? In this study, two subjects who received the nicotine patch and one who received the placebo dropped out because of side effects.

An easy way to see how an experiment is to proceed is to draw a picture of the design. The picture below shows the design of the nicotine patch experiment.

Figure 2.1: Design of the Nicotine Patch Experiment

This study is an example of the simplest form of a randomized comparative experiment. The participants are divided into only two treatment groups; if there were three treatments, the random assignment would require dividing the subjects into three groups. The middle part of our picture would then have three lines rather than two, but the random assignment that begins the process, and the comparison of results that end it, would remain the same. In any well-designed experiment, we always begin with random assignment of individuals to treatment groups, and end by comparing the values of the response variable.

The video Snapshots: Experimental Design details how researchers designed an experiment to determine whether nutritional supplements improved knee pain in arthritis sufferers.

Question 2.12

In a study conducted by the Harvard School of Public Health, 227 workers identified as abusing alcohol were randomly assigned to one of three rehabilitation programs sponsored by their employers: compulsory inpatient treatment, compulsory attendance at Alcoholics Anonymous meetings, or the participant’s choice from a set of options. Workers were then compared on job performance and subsequent alcohol and drug use.

Xox+Ae7T9c8AWabdq0JVHEcSHaWQFFEipcaLyAw4KbqEf4PnlCrduGjKO0PoLMr8tK0FgttDchNLZck0GkERHpduD7/GD6MvOa5DTlzFe1EXgrm6/JfjCIGu9iWD0SXUDSCmecttKZJlFXM2dxw0L+y3X9bdoL0T8ohffvwUz+Idsm/6uNRH9Fm/C5od/Ub7SUcUMVPDByhp0RzGPEqckQeqYdlmddFtidCGcw== YUdisSLnuALVgORzHleVeqVn6NVtt5ex+HN96+XRLoUcpl9kbY3xw08lpYeixqZZIv/y+FIS1Dmgr63yijQnEgEffKlsLeE6S5XaF3VdheWcYifExWqWUeiD2jC7GV8+KkIxjodX5ayfqTL8/Ftn2kQPMODCqmpf58n+UsxY3y5sBZEm3ZjTrecS8m9lE/K6rQga6Ydn0Fao0AyYxMZTnMS9bQKMpcs0XVJ3iw== 8Ih2veiJvZTzr87PHD0rEjtg7rfpKehkFrV+PuIpsUcvqKzaAAjPaEqgW5slTmjdOCz+Tb+3BMhXqc7RoX2pbhOGx44TA3hgPggFHvvn67wM5WHKBebkJluNKbplkOa2UJLtwmAbIunPUGhUCKHgEWFvS18csLkuphPaCYYim0cy+qmJQCnSqzZ23oaQM5CSpx9hmyr5BfuxANaDcN6rmR1zKhPM7WQpLCZkgiRKMbDnd4pWaORwGtCTqsE= kxIewHDe2PPztYfsDQ+yz6c6Sau28mxwQydl12Eh+uQF1nbo8PvRu7Z3QCHL21wZzhRFKtyJbjxAyv2JrMgOwON96pqKB048zgEedBFHpxc=
2
You did not select the correct answers. Please try again.
Correct. (1) "Type of treatment" is the explanatory variable for this study. (2) The response variables for this study were both job performance and subsequent alcohol and drug use. (3) There were three treatments: compulsory inpatient care, compulsory attendance at AA meetings, and the participant’s choice of options. (4) This was not a placebo-controlled study. All three treatments were active. In this case it would not be possible to administer an inactive treatment; the subjects would know they were not being treated.
Incorrect. (1) "Type of treatment" is the explanatory variable for this study. (2) The response variables for this study were both job performance and subsequent alcohol and drug use. (3) There were three treatments: compulsory inpatient care, compulsory attendance at AA meetings, and the participant’s choice of options. (4) This was not a placebo-controlled study. All three treatments were active. In this case it would not be possible to administer an inactive treatment; the subjects would know they were not being treated.

It has been shown that some medications work differently in men than in women, so the researchers in the nicotine patch study might have grouped all the women together and all the men together before dividing each gender into treatment groups. We say that such a study has a block design. A block is a group of individuals that share one or more characteristics. Using a block design helps to eliminate the variations caused by the differences between the blocks—in this case, the differences due to sex. The picture below shows a block design for the nicotine patch experiment.

Figure 2.2: Block Design for the Nicotine Patch Experiment

Human subjects can be assigned to blocks based on traits such as age, ethnicity, educational level or a particular health condition. Block designs are also used in experiments not involving human subjects. In agricultural studies, for example, test plots can be selected based on characteristics like location, soil fertility or precipitation. Two or more treatments are then applied within each test plot, so that differences in outcomes can be attributed to the treatments rather than differences between plots.

As seen in these examples, a well-designed experiment has three critical properties:

  1. Control of extraneous influences on the response variable by comparing two or more treatments;
  2. Randomization of individuals to treatment groups;
  3. Replication of treatment to a number of individuals.

Experiments that lack these properties have conclusions that are open to question.

2.2.2 Matched-Pair Experiments

In order to control for variables not studied, researchers, particularly in the social sciences and education, use matched-pair designs in their experiments. Two individuals, matched according to important characteristics, are paired, with one individual from each pair randomly assigned to the first treatment, and the other to the second treatment. Typically, one of the groups receives the control treatment (a placebo or no treatment), while the second receives the experimental treatment, although two experimental treatments may be administered in some circumstances.

A number of years ago, researchers investigated the question “Does Transcendental Meditation affect grades?” by using sex, college, year, grades, and first letter of last name to match 70 students who took Transcendental Meditation training with students who had not had the training. They compared GPAs one and two quarters after the training, and found no significant difference between those who had the training and those who did not. This is a classic example of a matched-pair design, although in this case the study is an observational one, not an experiment.

Perhaps no set of individuals is as appealing to researchers as subjects for an observational study or an experiment as identical twins. Identical twins, having come originally from a single fertilized egg, are genetically the same, and form a natural matched pair. A BBC News report detailed an experiment in which five-year-old identical twin boys were given different diets for a two-week period. One twin continued to eat his normal diet, while the second consumed only food with no additives. The children were given IQ tests before and after the experiment. While the scores beforehand were identical, after the experiment, the child who had no additives outscored his brother. While this experiment lacked replication (and perhaps randomization as well), it demonstrates the use of matched-pair design as applied to twins—one twin served as the control subject and the other as the experimental subject.

The Vietnam Era Twin Registry is composed of approximately 7,000 male-male twin pairs both of whom served in the military during the time of the Vietnam conflict (1964-1975). A number of studies have been conducted using these pairs of individuals, investigating health issues such as substance abuse, cardiovascular disease and cognitive function. An ongoing study at Tufts University compares the brain function in identical twin pairs, where only one twin was exposed to combat during the Vietnam War. The design of the study involved pairs where the combat-exposed twin experienced post-traumatic stress disorder and control pairs where the combat-exposed twin was healthy

The final way in which to conduct a matched-pair study is to use two measurements on the same individual as the pair. A researcher might pair a measurement before a treatment with one after treatment, or measure the same characteristic by two different methods. Recording a student’s scores on the SAT before and after taking a prep course would be an example of the first type of study, while an individual’s scores from two different IQ tests would be an example of the second.

Question 2.13

The Test of English for International Communication (TOEIC) is used to assess workplace English speaking and writing skills for nonnative speakers. A study was conducted at a Japanese university to determine the effect of direct test preparation on TOEIC gain score. Two groups of students (English majors and non-majors) were enrolled in one of three classes: TOEIC Preparation, Business English, or General English. The groups were majors and non-majors groups were treated separately because of the differences in the groups. The results showed that test score gains were statistically significant only for the non-major’s reading component of the test.

szqG/sOINetEI4detzbBXy5LjT3gcyvsheLY20foAuy1oC4u18F3HQ10VJ7/GZp/jx9eiNS3RNK/thRKL1VhN8gch49GrwIcO0mlKCxkWLlXVM/78JdLijr0UYd4bqjHJKNApoN2RegSs2I3OYG2in5vDR0fIldDp0esXh7fyHyUM+m+uD8xWKsIoEVJoRjY7yY88osFZIeGMimuMqNE58fGexfd/QkhF94m2fD0/ZCblhXRpfJtkcrs96jfV4azEgSCnP9frtW07osFmqv2zizmyNM= irZbTTPwNe9wip1UmnirXO0o3/azhg91EpDHl+cn2R5KtTnD9wo7rTf5WrlUwgqiExSZ+HG6s8esNBQlE2vU2t60HvnnXy682S8zysIAtuVCCw7+VLN0UQxjwGbvurkzZDEAXnFMdkG6iv6+mQ2bzi4wDa8LpsPQ7jTGWXqLH0g7LWr1fZkiLNh3QfNeiZtCjkfhYJewmeFHOynJ1SeRla9UY3KtAPMdV/XWzp4Jg9fFKv/iqNe1v9RSqR46IaXhv6cThwWIkn7TOeZAPXSdFfHFxI14KwEa HykLmiAeXayagCePt1H96Oy39FwxwILbJwtlr44nAAaDYwxK4qHgVZJhPTDGyMAqTC4OPE11YaBXjwSm5y2/A2L/vDedLE5Ebyk2dUN1nHDEZ9pPROCh8ec+GU5OBaekJgFny/gTb6O/vVu0XWbTG60yzHR9EWXyR3MW4uAiQc4HM3Td4scAsRp0Uh/sPG3WjlAq+hPOPfazZZ6IrXH1K2v1xB9OVvZQc0uF5hjy3xgxyUC8iVXohTNxwmwaKMjtTgEmg8rpzWSgJ9A5a6TTY06edCgWBb4uIDJxrjxUYL028IcW0ECt7PUsV7bUDyejDx/YZQhW49O1rShjWNWjD8ZoIyd+N/fkDxuIJGLX5hRzNgy6b+3dtTN0RJMye6YFCzWvd6CGg9/hVe2noWVF5pukM5ofok3luuPIFa17TTLYg2nd/aE6qg==
2
You did not select the correct answer. Please try again.
Correct. This study compared three treatments. The students who took the standard courses (Business English and General English) were the control group. The experimental group was the students enrolled in the test preparation course. The experiment was a block design, because two different groups of students (majors and non-majors) were treated separately. This was not a matched-pair design.
Incorrect. This study compared three treatments. The students who took the standard courses (Business English and General English) were the control group. The experimental group was the students enrolled in the test preparation course. The experiment was a block design, because two different groups of students (majors and non-majors) were treated separately. This was not a matched-pair design.

Question 2.14

About 80% of humans develop contract dermatitis when exposed to the urushiol in poison ivy. Carbon dioxide (CO2) has been shown to promote growth in other vines, so researchers at Duke University wondered about its effect on poison ivy. They selected 6 unmanaged plots in the Duke Forest. Three received the usual atmospheric CO2 and three received elevated CO2. At the end of the five growing seasons, they determined that the poison ivy growth in the elevated CO2 plots had an average increase of 149% over the untreated plants. In addition, the high-CO2 plants produced a more allergenic form of urushiol.

mO0IQgb70ywaIM9Y3FP2w01jWMH013lfV3ax7m8yUd90IPapfvOVOqj+Skn5ORl6op6CJYLd+hEmzbMUwd8OG45MEdSc+eRzehqWy4dz+4U8EHQ0aBZXuPvL0CVH7nlshIsGFtfea5trlB5z/3VyeQAQmXP9Mxj8Lg4DhBgoqV1lOZOhVdcjeh/YivrrLWZb0Otfhf4oZsfvZCp4KQTvheUUhfj4TYfSpwUTccdB4hF97y83qkbTXEt/cgwOzD89GK4D5bZRdYE= 8KofMxkAb642w6JBM/qnfZX294gZWi78Wj8AsmRzPQVALYp6rR/DIrunpJdstlRgj9k/wXmuRqFZ0aVw7ibWovq9isVbcHQFmbRsZAzQbAdVMkfBE7/bogHHlEMb+HUTT79TVVSve/fXO+Xqw2c7t+ZxacUbQL0nzbgWZ3l8Bbjx6tTtcFG84PXI/1v0+TNMw+TGXYDksxZR2bkwVUuampG6Za6UkwO7enmkIR0JbetMPKCrNlRj2ukiDKLNpfau+VZuR7M+LReKSqKY eY2027PuoZqBCmHVuT/nIT+R0Fe9XFkvghormmNEpIahEQItpNw9FF7rv7CqqLmGOL1xVKLoJVtS/ZC8plyk+/cnUrNiV4WyDfhBtRuSnsvdvgrXmDuQtSPYIPX/mJVBADPa216d8FJxwZ7n6iu9DlCmXBbbVm01U/1KkpG5sOiF22n1XlElcM3bPneyPWQLy6lfZY2qIzeZIwzH3u4/yfEULm/z8xiRuzpodNxXmL2/hZYHa43xHmz3sqd7/wGU6QuXfUviCNtqblBb1H/3ZC7zQIJv9FvIwWYOtWknwA/AjH7zpo/5K6yvcwUbns4bJtvHH0sSvn9+V3z2siFfynOMFNaRcZpuLHG5mtUZeL8yjQ1EKUehMeefaODssEF+XkZwqx2SFDRzu0Y7JT3C5abxKqdp/7GV1FPUB1ZKSwk7gxHKa138EA==
2
You did not select the correct answer. Please try again.
Correct. The control group was the three plots receiving the usual atmospheric CO2. The experimental group was the three plots receiving the elevated CO2. Neither a block design nor a matched pairs was used.
Incorrect. The control group was the three plots receiving the usual atmospheric CO2. The experimental group was the three plots receiving the elevated CO2. Neither a block design nor a matched pairs was used.

2.2.3 Ethical Considerations in Studies

Whenever a study involves living subjects, there are questions about the ethics of the design. Are animals being treated humanely, or are human subjects receiving appropriate medical treatment? What level of risk is acceptable in the pursuit of scientific or medical advances?

A medical or health-related study on human subjects is generally referred to as a clinical trial. The federal government has strict guidelines about clinical trials; these are designed to protect the participants in the studies. The website clinicaltrials.gov, a service of the National Institutes of Health, gives a wealth of information about clinical trials in general and about trials in progress in the U.S. and abroad.

One important feature of a clinical trial is informed consent, the process by which the details of the study are explained to a potential participant. These details include the purpose of the study, its length, required procedures, potential benefits, and risks. A person agreeing to participate in the clinical trial signs an informed consent document. The subject is also entitled to further information throughout the study and may withdraw at any time.

But difficulties can arise even from such a beneficial policy. How are improvements made in emergency or trauma care when patients are unable to provide informed consent? In 1996, the U.S. Food and Drug Administration adopted CFR §50.24 (revised in 2006) that regulated exceptions from informed consent for emergency research when the subject, a family member, or legal representative cannot provide consent. This regulation requires that the patient’s condition is life-threatening, available treatments are unproven or unsatisfactory, evidence supports the potential benefit to the patient, and the risks are reasonable with regard to the patient’s condition and both standard and experimental treatments. A further requirement is that the public be notified that such a trial will be taking place in their community, and that they are given a specific procedure for opting out of the trial in advance of any emergency situation. A press release from the University of Michigan Health System explained the federal regulation, and described two studies proposed by UMHS researchers.

Difficulties may also arise when investigators compare an experimental treatment to a placebo, rather than to a standard treatment. Researchers at the University of Chicago examined all clinical asthma trials involving children conducted between 1998 and 2001. They found that 45 of the 70 studies placed children in a placebo group in which they did not receive the established treatment (anti-inflammatory medication). The researchers concluded that because the children were exposed to unnecessary risk, such trials were unethical. In cases in which an accepted treatment exists, this standard treatment should be used as a control rather than a placebo.

Sometimes as a clinical trial proceeds, researchers find evidence that the experimental treatment is causing serious adverse effects, and terminate the study early. In 2002 the National Institutes of Health stopped a study involving 16,000 women taking hormone replacement therapy. The clinical trial was scheduled to continue three more years, but scientists discovered increased risk of breast cancer, strokes, and heart attacks from long-term use of the therapy.

Question 2.15

What should be the role of placebos in ordinary health care?

Read Internists say they prescribe placebos on occasion, and then answer the following question:

qsPcpu6vNp3H+tLU4aAohckoU6iEn9Jxsqi26ZsDeU2cYiDhK20aD9Z1z8GNrygCTsDsij6XVvX1xjx46D/xTv5ZTBqtYm3ww4sPo4DER/TVWrbAAP9OeFd1aJ2PJ6oNPYMbo5yWRga16z+n6J7Mk0ISkGg7mt9/Lt9Y67RCsuRp7nXyTaAKy9Y3gIJ3cMolvSFKE1iQII7kOqpJ2LAbZQtG+nDRiZhX+/k6xyqkVGB3DS1rEqfpYPCvGvhb9jF5SIgnf3X9Ry8CDfwfq3POJy9ecbA46KC97iLUE3ms8Y1ULYSvqedIFs7CQtvPwTEnC9h8PEWkv6oSmUdcRYjlzG8LwujJmqMesrkMR44OR4yr23NFywynj+CxS+q8t+SQTrnNceGOfWGxylh4rHu/fdOInbEjhOwv+wm+QQL+r41nSQz/VieDXTT/uerVblPXNTGTAAhY98Brd9UbWzgaUn2BjwfqbJosfS1Mlg==

2
Incorrect. You did not select the correct answer. Please try again.
Correct.
Incorrect. A majority of internists believed that placebos can have therapeutic benefits for patients.

Question 2.16

Read Experts Question Placebo Pill for Children, and then answer the following questions:

W+GIQ9cKhj2AsvA1ft48SvsdNZf+RanOoA6YqXf/Rm4d9EHp3DbTYW+YQgVcK3+c9hr5ix9rmqKBHyCMEQ1jllpaURPZvM9vNmlq4PGtcZ5KGUQaKZgSfu2tAcxNBAZ2kgs9fEWIFKSDL8PEP7VDPdKuVtq9p6LAau2xDtgoQfncDMesRuXLPg5LG4v1+nT419KL8aewf+MpB4pRZdgQnskBFOULcEUB01AkoWqOVLjcXIIYRWY8UmYDdlGRSnfYalVuMMFm09rgDwT4taF3MbP4sf0BHl9lKntNS23YyE0PgAkYy7tufbPgmnpM4g8wn7L3LTEa1ikoZJ8DuNyt20cdZOMoOI4wENfjPnQLfbdaRcR+a2Xnsb0lWf7ryBjlzXp6VCJpiVLXgmhH91Q05w==

2
Incorrect. You did not select the correct answer. Please try again.
Correct.
Incorrect. It is designed to trick children into thinking they are taking something.

Both experiments and observational studies provide us with data that we can use to investigate questions about the world around us. Each type of study has its advantages and disadvantages. We have noted that experiments allow us to draw conclusions about cause-and-effect relationships, while observational studies yield only conclusions concerning the association between two variables. This would suggest that experiments are preferable, but they are not always possible due to ethical or practical considerations. Whatever the study, the results we obtain are always determined to some degree by chance.