|
|
|
Biology 483G Assignments Homework/Exam Data Sets (click to download) transfrm.syd,
transform.xls -
Assorted data for normality testing and (if necessary) transformation
|
|
|
|
Second
Exam (Due by Complete all
questions. Limit your answers to 2 single-spaced pages per question
(i.e. 1A). Construct your answers in manuscript format, as Materials
and Methods and/or Results section(s). For those questions requiring
statistical analysis, support your conclusions with appropriate statistics
incorporated in the text or summarized in Table or Figure form. DO NOT
include raw SYSTAT/ 1. Prior to the 1980s, most states required
couples filing for divorce to identify specific reasons for terminating the
marriage contract. Each state
maintained a list of acceptable grounds for divorce. In some cases these were quite obscure or
bizarre. We might expect there to be
related sets of causes that might co-occur in groups of states; that is, if a
state accepted desertion as grounds for divorce, they might also tend to
accept lack of support. In addition,
we might imagine (or hypothesize) that the set of acceptable grounds for
divorce might show some sort of regional pattern, reflecting the unique
cultural or demographic characteristics of an area. The data set divorce.xls
identifies the acceptable grounds for divorce in each state from the year
1971. In this data set,
‘1’ indicates that the state recognized a particular cause as
acceptable grounds for divorce, while ‘0’ indicates that it did
not. Use these data to address the
following issues ina statistically appropriate manner: a. Is there any evidence to suggest that there
are related sets of grounds for divorce that show patterns of co-occurrence
among states ? That is, do certain
types of grounds for divorce tend to show the same distribution among states,
and is there any consistent relationship between them (i.e., all are moral
causes, vs. financial causes, etc.).
What is the significance and/or strength of your conclusion ? Justify your conclusions with appropriate
numerical or graphical results. (25
points) b. Is there any evidence to suggest that there
regional or demographic patterns in acceptable grounds for divorce ? That is, do are there sets of states that
have similar patterns of acceptable causes, and is there any consistent
relationship between them (i.e., southern vs. northern, urban vs. rural,
etc.). What is the significance and/or
strength of your conclusion ? Justify
your conclusions with appropriate numerical or graphical results. (25 points) 2. While it is often a financial necessity,
parents and child development experts have long debated the advantages and
disadvantages of placing children in a day care setting. Some might argue that the socialization
aspect encourages development of interpersonal skills in children, while
others might suggest that those same interactions might foster aggression
and/or out of control behavior. We
could potentially address this question by comparing the behavior of children
exposed to different childcare settings along certain developmental
vectors. The data set daycare.xls provides such a study. Here, a set of children was characterized
on the basis of 4 variables: (1) whether they were cared for by parents, a
private sitter, or in a day care setting; (2) the behavioral skills they
expressed at the dinner table; (3) their behavioral skills upon encountering
a stranger; and (3) their social problem solving skills as measured through a
cognitive test. Use these data to
address the following questions in a statistically appropriate manner: a. What are the patterns of variation among
the behavioral variables across all groups ? That is, are there associations (positive
or negative) between behavioral skills ?
How well do these associations account for the variation among
children within the data set ? Support
your conclusions with appropriate numerical and/or graphical results. b. Is there any evidence that the childcare
setting affects childrens’ behavioral skills ? That is, are their significant differences
in behavioral skill characteristics among groups of children ? Describe the nature of the patterns of
variation as fully as possible. How
consistent or strong are these differences ?
Support your conclusions with appropriate numerical and/or graphical
results. (25 points) |
|
First
Exam (Due by Complete all
questions. Limit your answers to 2 single-spaced pages per question
(i.e. 1A). Construct your answers in manuscript format, as Materials
and Methods and/or Results section(s). For those questions requiring
statistical analysis, support your conclusions with appropriate statistics
incorporated in the text or summarized in Table or Figure form. DO NOT
include raw SYSTAT/
a. Is there a relationship between age and/or smoking status and predicted residual lung volume ? What is the nature and strength of that relationship ? Describe the patterns of variation as fully as possible. Ignore the potential effect of sex. Justify your conclusions with appropriate numerical results. (25 points) b. Is there a
relationship between smoking status and/or sex and predicted residual lung
volume ? What is the nature and strength of that relationship ?
Decsribe the patterns of variation as fully as possible. Ignore the
potential effect of age. Justify your conclusions with appropriate
numerical results. (25 points) 2. As a borderline (my wife might argue over
this choice of adjective) obsessive-compulsive, I can relate to this
question. Imagine that you administer
the Multidimensional Perfectionism Scale ( a. Is there any evidence that a combination of
simple variables (WASHING, CHECKING, b. What are the biological relationships among
the variables (WASHING, CHECKING, |
|
|
|
Homework #4 The data set cluster.syd provided above represents data on the presence or absence of certain artifacts in graves from a cemetary in northern Thailand. There are 38 different types of artifacts, and the bodies in the graves are classified as either adult males (1), adult females (2), or children (3). Carry out a cluster analysis to examine the relationship among the 47 burials. Is there any evidence to suggest that the type of body in the grave is related to the nature of artifacts associated with that grave ? How strong is that evidence and why ? Provide BRIEF and annotated SYSTAT output to support your conclusions, as well as a summary of your results and conclusions in paragraph form suitable as a manuscript Results and Discussion section. |
|
|
|
Homework #3 The file 'pcdfa.syd' contains data used in an attempt to differentiate populations of the endangered cyprinid fish Gila cypha. 148 individuals from 6 isolated populations in the upper Colorado River basin were sampled for 56 morphological characteristics; size differences among individuals have been factored out, so the information remaining is thought to reflect patterns of shape variation within and among populations. The populations are as follows: 1 - Black Rocks; 2 - Cataract Canyon; 3 - Desolation Canyon; 4 - Grand Canyon; 5 - Westwater Canyon; 6 - Yampa River. Carry out both a principal components and discriminant function analysis on these data, and from each analysis consider the following question: is there any evidence to suggest that isolated populations are morphologically distinct ? What accounts for the very different picture that emerges from the two analyses and why ? |
|
|
||||||||||||||||||||
|
Homework #2 1. Concerted evolution is a common phenomenon among repetitive DNA sequences such as rDNA, mtDNA, and immunoglobulin gene families. In concerted evolution, multiple copies are homogenized to some extent (presumably) through unequal crossing-over and gene conversion during DNA replication; the result is that the individual copies of the repeat sequences do not accumulate divergent mutations as quickly as one might expect. Li (1997) has reviewed the data on concerted evolution and has prosed that the rate of sequence divergence may be affected by a number of biological factors, including but not limited to: a. Functional
requirements - the need for a large amount of identical gene product for
normal functioning of Using the 'concevol.syd'
data set provided above, examine the dependence of the degree of sequence
divergence among repeat copies on these four factors. What is the
'best' model of the relationship between independent and dependent variables
and why ? Provide BRIEF and annotated SYSTAT output to support your
conclusions, as well as a summary of your methods and conclusions in
paragraph form suitable as manuscript Methods and Results sections. 2. A medical
researcher was investigating levels of a protein whose excess is thought to
be related to onset of a particular disease syndrome more prevalent in males
than in females. This researcher examined protein concentrations in
male and female mice from 3 inbred strains that differ in their
susceptibility to the disease; the E strain is most susceptible, while the I
strain is least so. These data are given below. Test the
hypothesis that high protein concentration is associated with disease
onset. Provide BRIEF and annotated SYSTAT output to support your
conclusions, as well as a summary of your methods and onclusions in paragraph
form suitable as manuscript Methods and Results sections.
|
||||||||||||||||||||
|
|
|
Homework #1 Using the file transfrm.syd above, as well as the data provided on fecundity and body size in fish, test each variable for normality, and apply appropriate data transformations to restore normality as appropriate. In the style of a Materials and Methods section, provide a written summary of your statistical approach taken for each variable (approximately 1 paragraph each). Do not describe all the steps you took, only the protocol for getting from the original data to the appropriately-transformed data. Given that normality testing represents preliminary (or exploratory) analysis done in advance of the tests of hypotheses central to the study, it is appropriate to provide statistical support for your conclusions within this part of a Materials and Methods section (and by extension, within the context of your answers); however, this would typically not involve use of tables or figures. |