Activity 1: Anthropometrics (Prelab)
Activity 2: Working with measurement data from human and stickleback populations
Activity 3: Working with continuous and categorical data from Isopod Behavior experiments
For a long time, marine resources have been taken for granted and exploited without regard to the consequences of over-fishing and habitat destruction. In the last 25 years, there have been precipitous changes in our marine ecosystems and these changes are reflected in the abundance of our once vital fisheries. How do we distinguish between the changes in animal populations that humans can control vs. changes that are part of “natural cycles” that are not under our control?
Scientists are asked to investigate a huge range of biological issues including ecosystem diversity, health and disease, food production and they seek to develop innovative products and services to address these issues. What are some basic approaches to scientific investigation? How are experiments designed? How are the data produced by these investigations used to draw cause and effect conclusions as well as solutions to our problems?
Science in Action
Conover and Munch, two professors from Stony Brook’s School of Marine and Atmospheric Sciences (SoMAS), have recently contributed answers to these important questions. It is well known that over- fished species have a reduced number of older and larger individuals. It was thought that if only larger fish were taken and smaller individuals and juveniles are spared, that the population would remain relatively undisturbed or rebound easily. Therefore, local, state, and federal governments imposed fishing regulations which restricted the minimum size of many game fish. To test this assumption, Conover and Munch (2002) compared three populations of fish in which 1) only large individuals were removed, 2) only small individuals were removed, and 3) individuals were removed at random.
After four generations of this type of selection pressure, an interesting trend emerged—“the mean weight of the harvested individuals from the small-harvested lines was larger than that of the large-harvested lines.” In the small-harvested populations, the individuals who grew fastest as juveniles were more likely to survive whereas in the large-harvested group, individuals that grew most slowly were more likely to survive (Figure 2). Hence, by Generation 4, the mean individual weight at spawning was 1.05g for the large-selected line, and 6.47g for the small-selected line. Does this mean our policies should require us to take only small fish? What happens if we deplete the juveniles who have not yet reproduced?
The authors suggest that effective management plans must consider evolutionary consequences of fishing regulations to preserve natural genetic variation and healthy ecosystem dynamics. “No take zones” and maximum size restrictions (all captured fish must be below a certain size) are two forms of management that might preserve genetic variation in fish so that over-fished populations have the potential to rebound and recover.
In today’s lab, you will record data from simple measurements of human and fish anatomy. Although this may seem like a trivial exercise, the ability to take measurements that are accurate (as close to the true value as possible) and precise (reproducibility of the measurement) is not as simple as it appears. If ten students measure the length of the same student’s ear lobe even if they use the same instrument (a Vernier caliper), there will be variability in the values due to human measuring error. Thus, if you were measuring two variables to determine if there are significant differences between them, your measurements will include a “real” difference component and an “error” component. Statistics provide standard measures of variability and measurement characteristics to help us identify significant differences.
When you think of human body parts, you may think of the Vitruvian Man—the rendition by Leonardo da Vinci of the “proportioned man.” Was Leonardo da Vinci’s drawing meant to illustrate natural relationships between parts of the human body? Or was it a recursive geometrical algorithm meant to approximate the squaring of a circle (a geometric solution to the problem of drawing a circle and a square of equal areas)?
Although the meaning of da Vinci’s Vitruvian man may not interest you, does the field of forensics interest you? How would you determine the height of a suspect from their shoe print? Does shopping for clothes interest you? What information does a clothing manufacturer use to produce properly proportioned clothing for women versus men? Does anthropology interest you? How can you estimate the height of a Neanderthal from the width of their vertebra?
What do all of these questions have in common? Anthropometry: the study of human proportions or body dimensions.
Perhaps you will become a nurse one day What if you need to accurately measure the height of a patient confined to a wheelchair, and they are unable to assume an upright position, even with help. What would you do? Take another look at da Vinci’s drawing. Does it hint at a solution to your problem? Are the outstretched arms of the Vitruvian man the same length as his height? Is this true for you? What about other body parts? Is your hand size proportional to your height? What about the size of your ear? In today’s lab you will collect data to test such hypotheses.
Scientists who study form and shape of living organisms work in a research area known as morphometrics. In morphometrics, geometric equations are used to characterize features and patterns in organism body plans including those that may be dimorphic (i.e., sex-dependent) or ecomorphic (i.e., environment-dependent). Scientists take reproducible measurements of specimens and can use them for their descriptive value and possibly use their analyses to develop phylogenetic hypotheses about the relationships among the specimens. This information can take on further significance if the measured traits are known to have changed over time and the traits are linked to specific genes. Thus morphometrics can be used to link anatomical changes to changes in gene expression.
The three-spined stickleback fish (Gasterosteus aculeatus) is one of many organisms well studied using morphometrics. Three-spined stickleback is a highly variable species that evolves rapidly and exists in a wide range of habitats. It is an extensively researched animal model system and its study allows scientists to make connections between genetics, ecology, evolution, and behavior. These fish can be monitored for evolutionary changes over decades...yes, decades. For example, sticklebacks have a pelvis supported by a pelvic girdle with two pelvic fins attached. There are marine, freshwater, and anadromous populations. In some populations, this pelvic girdle and the fins are reduced or at least much smaller than other populations of sticklebacks. The gene responsible for this reduction is called Pitx1. Scientists have shown that changes in expression of Pitx1 in mice leads to morphological asymmetry—the hind limb on the right side of affected mice are slightly more reduced than the hind limb on their left side (Shapiro et al. 2004). A similar pattern of Pitx1 expression has been found in sticklebacks, the left pelvic spine length was longer than the right.
Three-spined sticklebacks may also have armored plates on their sides. The degree to which armor is present depends on the presence of particular communities of predators in their environment. Stickleback populations that live in lakes where fish and bird predators are prevalent tend to contain a full set (complete morph) of these lateral plates, from 32–36, that stretch from the head to the tail of the fish. In other areas where there are either no predators or insect predators, the plate morph can be medium to low (<9) in number. The variability in plate morphology is dependent on a gene known as Ectodysplasin or EDA (Colosimo et al. 2005).
The stickleback specimens that you will use in today’s lab were collected from Walby Lake and Morvro Lake in Alaska in June 2009 by Dr. Michael Bell of Stony Brook University and former graduate student Dr. Peter Park. The stickleback fish that inhabit these lakes are known as bottom (benthic) feeders. With the exception of Morvro lake populations, in which the lateral plate number is polymorphic (varies considerably within the population), other lakes contain three-spined sticklebacks with low plate numbers. In today’s lab you will use stickleback measurement data to test possible relationships between fish body parts. We will test whether our stickleback population exhibits pelvic spine asymmetry. While you are making your measurements remember that modern scientists can now use a combination of morphometric and molecular genetic information to further our understanding of vertebrate skeletal development and their highly conserved associated genes. Lessons learned from our distant relatives, the fish, are amazingly relevant to mice and other mammals such as humans.
Using statistics to characterize groups of organisms
Scientists are often interested in the degree of correspondence or differences in measured characteristics among groups of organisms. Descriptive statistics such as means and standard deviations are used to characterize a feature of a population that is normally distributed. For example, the entire population of Stony Brook students have an average height called the mean height of the population. If we measure the heights of 25 Stony Brook students, this would be a sample of student heights from this population. But what if the heights were only taken on members of the basketball team? Would that sample be representative of the true population mean? In this case the average heights of Stony Brook students would probably be overestimated because, in general varsity basketball players are quite tall. So if your goal is to determine the mean heights of SBU students, you would want to randomly sample a sufficiently large sample size of the population, so that the mean of your sample is not biased.
However, differences in sample means can be very useful in detecting real differences between groups when there is an underlying cause and effect resulting in measurable differences between groups. For example, mean body length of a species of fish can be used to compare different groups treated with different diets but the experimental design used to compare mean differences in diets is very important.
If one treatment group is composed of larger individuals while another group is composed of smaller individuals before being given the experimental diets, the two groups are not really comparable.
Explain the statement above and discuss what you would do to design an experiment that was unbiased.
Bell MA and Foster SA, editors. 1994.The Evolutionary Biology of the Threespine Stickleback. New York: Oxford University Press. 571p.
Colosimo PF, Hosemann KE, Balabhadra S, Guadalupe VJ, Dickson M, Grimwood J, Schmutz J, Myers RM, Schluter D, and Kingsley DM. 2005.Widespread parallel evolution in sticklebacks by repeated fixation of Ectodysplasin alleles. Science 307:1928–1933.
Conover DO and Munch SB. 2002. Sustaining fisheries yields over evolutionary time scales. Science 297: 94 – 96.
Harris M, Taylor G, Taylor J. 2008. Math and Statistics for the Life Sciences. New York: W. H. Freeman.
Knisely K. 2008. A Student Handbook for Writing in Biology, SBU edition. New York:
W. H. Freeman. 221 p.
Reimchen TE and Nosil P. 2001. Lateral plate asymmetry, diet and parasitism in threespine stickleback. Journal of Evolutionary Biology 14:632–645.
Shapiro MD, Marks ME, Peichel CL, Blackman BK, Nereng KS, Jo ́nsson B, Schluter D and Kingsley DM. 2004. Genetic and developmental basis of evolutionary pelvic reduction in threespine sticklebacks. Nature 428: 717–723.
Complete the BioPortal Quiz, which is designed to gauge your understanding of the prerequisites for this course, as well as your knowledge of the required content. It is your responsibility to review this material, if necessary, then watch the vodcast and read this lab. Place all notes in your lab notebook, which can be used during the Pre-Lab Quiz.
Purpose
In this activity you will focus on human body part data collection by proposing a question or possible relationship you want to test. Your group will collect data from a sample size of at least 10 adult individuals (including your group members) on 3 (or more) body dimensions, one of which must be left hand area. You can collect data on any of the listed body dimensions, or dimensions of your choosing if approved by your instructor. We recommended that you collect demographics (sex, age, etc.) for each subject. If you plan to measure parts like fingers, ears, and nose, then Vernier calipers will be most appropriate. You will apply statistics to describe your sample data and use graphical analysis to test for possible correlations between body measurements.
Dimensions (many of these are common anthropometric measurements)
Materials
ImageJ software
Measuring Tape
Ruler
Vernier Caliper
Scanner
Note: If you have difficulty tracing your hand, copy your hand, and then trace the outline of the copy of the hand with a permanent marker that bleeds through to the other side of the paper. Trace the marks on the opposite side of the paper to produce your hand for scanning. Remember to place the two scales (inches and centimeters) on the paper.
Table 1 Human Proportions Data
Thus far you have worked with human measurement data. In this activity, we will introduce you to three-spined stickleback data collection and then you will proceed to analyze both human and fish data sets and learn when and how to apply different statistical tests.
Purpose
In this activity, each group will analyze sample data from two stickleback populations, one from Walby Lake and one from Morvro Lake. The stickleback measurements (continuous data) include body dimensions of standard length (SL), right and left pelvic spine length (rPLV and lPLV), eye diameter (ED), and caudal peduncle (CP). The number of lateral plates (LPN) for each fish will serve as a categorical data trait. You will use statistics to describe the data, to test for possible relationships between body dimensions, and to test a hypothesis about the asymmetry of right and left pelvic spines. So that you are familiar with how the data you analyze was collected, your group will collect stickleback measurements on one fish from each of the two lakes.
Procedure
Safety: Wear gloves at all times when handling the fish. The fish were fixed using a toxic preservative, rinsed free of preservative, and then stored in isopropyl alcohol. Although there should be undetectable amounts of preservative present on the fish, please exercise proper caution by always handling the fish with gloves. Wear goggles to protect your eyes from splashing alcohol.
How difficult was it to measure the stickleback body dimensions? Do you think that you could design a method to minimize measurement error if you were doing a morphometric-based study on stickleback? Would you have more than one individual collect the data?
Table 2 Measurements of body parts of three-spined stickleback from __________ Lake
Describe your Data
Test for possible relationships between your variables
Remember: a high correlation suggests a relationship between two variables, but does not determine the cause and effect relationship. For this, you would have to know a great deal more about the nature of these relationships.
Purpose
Scientists very commonly compare means (averages) of independent data such as before and after, control and treatment, or changes over time of the same sample for data such as growth. When sample sizes are small, data means can be compared using a common kind of statistical test called a t-test. There are many times when we must make comparisons between means of data in which there is dependence between them—this comparison is called a paired t-test. An example of paired measurements are “before and after exercise” heart rate measurements on the same individual. However, the simplest t-test called an unpaired t-test is used to compare the means of independent samples. An example of independent samples would be average heart rates for two different individuals taken over the same period of time.
We will apply an unpaired t-test to study left-right asymmetry in stickleback pelvic spines and human feet. We are doing the fish fin–human foot comparison because we know that asymmetry in sticklebacks correlate with Pitx1 gene expression and that this gene-specific asymmetry also exists in mouse hindlimbs. This leads to the question: Do asymmetries naturally exist in human populations? We are able to speculate that there are differences between the right and left foot in humans since these body parts are homologous to mouse hindlimbs.
We will use an unpaired t-test to test the hypothesis that there are significant differences in the means for left minus right anatomical asymmetry in human populations and stickleback populations. If we wanted to compare left versus right spine directly or left versus right foot directly, we would not use a t-test because pelvic spine length is dependent on the fish size (SL) just as foot size is dependent on human height. We must compensate for variability that is due solely to this size dependence otherwise our analysis is meaningless—the data may show differences when there really isn’t any, only differences due to organism size. We could do a paired comparison (e.g., rPLV vs. lPLV) using analysis of variance (ANOVA) to gauge whether there might be a significant difference, but this would not take into account the dependence of PLV on SL or dependence of foot length on height. If we include SL or height, we need to do an analysis of covariance (ANCOVA). Because both of these statistics are somewhat complicated for an introductory student, we will look at differences between populations for which unpaired t-tests can be applied.
Refer to the Math Stats Catchup guide for information about this test and the Knisely Appendix for steps on how to use Excel to do the t-test calculations. Make sure you understand the rationale for comparing t-values and determining the critical t-value based on particular significance levels as provided in the Math and Stats Catchup guide.
Scientific research involves first making observations, then developing hypotheses (“educated guesses” about phenomena of interest) based on these observations, and finally collecting and analyzing data to test the hypotheses. Good examples of this process often inspire a new set of hypotheses and the cycle continues to develop our understanding of the natural world. One valuable and widely used approach to scientific research (but by no means the only approach) is the use of manipulative experiments. In a manipulative experiment, the researcher varies a specific factor or condition to determine how it affects the phenomenon of interest. This contrasts with the approach of correlation studies, in which the researcher does not control specific factors or conditions, but instead looks at relations between variables as they are found in nature.
In today’s isopod exercise, you will perform a manipulative experiment, monitoring the effects of moisture levels and types on the behavior of this small crustacean. You will test whether the isopods move toward wet (treatment) or dry (control) areas.
Learning Objectives
After successful completion of this activity, you should be able to:
Materials
Isopod arena (8” culture dish with sandpaper bottom)
85W flood light on ring stand
4 sponges
Isopods
RO water
Transfer pipettes
Light meter (or use light meter app)
Vernier caliper
Permanent Marker
Ruler and Vernier calipers
Thermometer, pH meter, and balance
Wear goggles! It is important that you protect your eyes from the potential broken glass. If the flood light is on, wear your goggles.
Purpose
Background
The isopod behavior experiment can be simplified to one comparison: Are the isopods randomly assorting into categories OR are the isopods distinguishing between the microhabitats and selecting categories based on preference? We will use statistics to determine whether isopods are evenly distributed with respect to the different sites.
Analysis is a Guessing Game without the use of Statistics:
We based our expected counts on the null hypothesis, that the treatment had no effect on isopod behavior thus, isopods should be distributed evenly with respect to the different microhabitats. For example, if there were 20 animals in our arena and 2 possible sites where they could hide, we would expect to find about 10 animals in each site if they showed no preference for the different habitats. However, if the animals display a preference for one site over another, we might observe an uneven distribution of isopods with respect to habitat. But how do we know if this distribution is meaningful and not due to random chance?
If all 20 isopods in the example above were found in one site and 0 were found in the second site, the isopods would be unevenly distributed, and we would conclude that they were displaying a habitat preference. Similarly, if 10 were found in one site and 10 at the second site, the isopods clearly would be evenly distributed, and we would conclude that they had no preference for the two microhabitats offered in the experiment. But what if we found 9 isopods at one site and 11 at another? Would we conclude this to be an equal or an unequal distribution with respect to microhabitat? What about 5 and 15? Do the results differ from the expected results simply due to chance, or due to the treatment effect that we were testing (e.g., wet vs. dry)? Statistics can help us with this question. But what statistical test should we use to analyze our data? Follow this series of questions to determine the right statistical test to select based on our data and experiment.
Before we use any statistical test to analyze our data, we should always ask the following very important question: What are the assumptions of our statistical test?
Assumptions for Chi-Square:
Chi-Square Example (also refer to Appendix C and MathStats CatchUp Guide chapter 40): Given the roughly 1:1 ratio of males and females in the human population, we might expect the number of males and females on a bus to be equal at any given time. Suppose we census a bus load of people and find the following:
There seem to be many more females than males on the bus. But, is this deviation from a 1:1 ratio in the number of males to females due to chance or some other cause? To find out, we perform a chi-square test.
Calculating χ2: The general formula for calculating the χ2 statistic is as follows:
χ2 = Σ (observed - expected)2 /expected
For each possible class (in our example there are 2: males or females), you first calculate the difference between the observed and expected values and square this difference. You then divide this squared difference by the expected value, and then sum the values for all possible classes. The larger the difference between the observed and expected values, the larger the χ2 value. In our example:
χ2 = Σ(obs - exp)2 /exp = (30 - 41)2/41 + (52 - 41)2/41 = 2.95 + 2.95 = 5.90
Calculating Probability: The calculated χ2 statistic for these data is 5.90. To determine the probability level (P value) for this test statistic, we first determine the degrees of freedom (d.f.) for the test, then look up the Χ2 value in the probability table (Table 7).
Degrees of Freedom (d.f.): The degrees of freedom are determined by the total number of classes minus 1. Because there are 2 classes in our example (males and females) the degree of freedom is 1 (# of classes - 1 = 2 - 1 = 1 d.f.).
P Values: To determine the probability associated with a given χ2 statistic, look in the row of the table for the appropriate degrees of freedom (in our example d.f.=1, so you look in the first row) and look for your χ2 statistic. You will not find your exact χ2 value, but you can locate which columns, and therefore P values, match your results. If the P value is greater than 5% (to the left of the P=0.05 column), you would conclude that the difference between the observed and expected results is due to chance. If the P value is less than 5% (to the right of the P=0.05 column), then you conclude that the observed distribution is most likely not caused by chance, but rather is caused by some other effect.
Interpreting statistical results: For our bus-rider sample, (χ2=5.90), the number of classes is 2, so d.f.=1 and we look in the first row of the table where 5.90 is found between P=0.05 and P=0.01 (fifth and sixth columns). Thus, the probability that the difference between the observed and expected results is due to chance is very low, between 5% and 1% (0.01 < P <0.05). Because P < 0.05, we can conclude that the greater number of females compared to males on the bus is not due to chance alone, but due to some other effect. In this example, we were not testing a specific treatment effect, so we don’t know the exact cause of the deviation from a 1:1 ratio. However, there may be many possible causes – for example, a Girl Scout Troop might have jumped on the bus just before we took the census.
You will have an opportunity to repeat this study next lab testing your own hypothesis. You should plan to bring in your own supplies if you want to make changes to the isopod arena design (i.e. colored cellophane, filter paper, leaves, bark, etc.). You are welcome to bring in reagents that are innocuous to humans and isopods. It is recommended that you refer to the literature for background on isopods. If you have time at the end of this lab, discuss some questions with your group about isopod behavior that you would like to answer. Be creative with your experimental design.
Table 7 χ2 Test Probability Table
These questions were taken from previous exams and are meant to represent a sample, not a complete study guide. The questions in these examples are designed to test your understanding of the concepts and skills presented in this lab, and your ability to apply what you have learned to novel problems.
Computation: The table below contains categorical data of a sample of women with and without breast cancer compared with the number of servings of fish that they consume per week. What percent of women surveyed had breast cancer?WHLP4lt+nrx3wi3k How many categories would you have if you wanted to do chi square analysis on fish consumption?h4XZagboIgc=