9. Chapter 1
Introduction
“I have been impressed with the urgency of doing. Knowing is not enough; we
must apply. Being willing is not enough; we must do.”
-Leonardo da Vinci
1
10. Balka ISE 1.10 1.1. INTRODUCTION 2
1.1 Introduction
When first encountering the study of statistics, students often have a preconceived—
and incorrect—notion of what the field of statistics is all about. Some people
think that statisticians are able to quote all sorts of unusual statistics, such as
32% of undergraduate students report patterns of harmful drinking behaviour,
or that 55% of undergraduates do not understand what the field of statistics is
all about. But the field of statistics has little to do with quoting obscure percent-
ages or other numerical summaries. In statistics, we often use data to answer
questions like:
• Is a newly developed drug more effective than one currently in use?
• Is there a still a sex effect in salaries? After accounting for other relevant
variables, is there a difference in salaries between men and women? Can
we estimate the size of this effect for different occupations?
• Can post-menopausal women lower their heart attack risk by undergoing
hormone replacement therapy?
To answer these types of questions, we will first need to find or collect appropriate
data. We must be careful in the planning and data collection process, as unfortu-
nately sometimes the data a researcher collects is not appropriate for answering
the questions of interest. Once appropriate data has been collected, we summa-
rize and illustrate it with plots and numerical summaries. Then—ideally—we use
the data in the most effective way possible to answer our question or questions
of interest.
Before we move on to answering these types of questions using statistical in-
ference techniques, we will first explore the basics of descriptive statistics.
1.2 Descriptive Statistics
In descriptive statistics, plots and numerical summaries are used to describe a
data set.
Example 1.1 Consider a data set representing final grades in a large introduc-
tory statistics course. We may wish to illustrate the grades using a histogram or
a boxplot,1 as in Figure 1.1.
1
You may not have encountered boxplots before. Boxplots will be discussed in detail in Sec-
tion 3.4. In their simplest form, they are plots of the five-number summary: minimum, 25th
percentile, median, 75th percentile, and the maximum. Extreme values are plotted in indi-
vidually. They are most useful for comparing two or more distributions.
11. Balka ISE 1.10 1.3. INFERENTIAL STATISTICS 3
20 40 60 80 100
0
40
80
120
Final Grade
Frequency
(a) Histogram of final grades.
25
50
75
100
Final
Grade
(b) Boxplot of final grades.
Figure 1.1: Boxplot and histogram of final grades in an introductory statistics
course.
Plots like these can give an effective visual summary of the data. But we are
also interested in numerical summary statistics, such as the mean, median, and
variance. We will investigate descriptive statistics in greater detail in Chapter 3.
But the main purpose of this text is to introduce statistical inference concepts
and techniques.
1.3 Inferential Statistics
The most interesting statistical techniques involve investigating the relationship
between variables. Let’s look at a few examples of the types of problems we will
be investigating.
Example 1.2 Do traffic police officers in Cairo have higher levels of lead in their
blood than that of other police officers? A study2 investigated this question by
drawing random samples of 126 Cairo traffic officers and 50 officers from the
suburbs. Lead levels in the blood (µg/dL) were measured.
The boxplots in Figure 1.2 illustrate the data. Boxplots are very useful for
comparing the distributions of two or more groups.
The boxplots show a difference in the distributions—it appears as though the
distribution of lead in the blood of Cairo traffic officers is shifted higher than that
of officers in the suburbs. In other words, it appears as though the traffic officers
have a higher mean blood lead level. The summary statistics are illustrated in
Table 1.1.3
2
Kamal, A., Eldamaty, S., and Faris, R. (1991). Blood level of Cairo traffic policemen. Science
of the Total Environment, 105:165–170. The data used in this text is simulated data based on
the summary statistics from that study.
3
The standard deviation is a measure of the variability of the data. We will discuss it in detail
in Section 3.3.3.
12. Balka ISE 1.10 1.3. INFERENTIAL STATISTICS 4
10
20
30
40
Lead
Concentration
Cairo Traffic Suburbs
Figure 1.2: Lead levels in the blood of Egyptian police officers.
Cairo Suburbs
Number of observations 126 50
Mean 29.2 18.2
Standard deviation 7.5 5.8
Table 1.1: Summary statistics for the blood lead level data.
In this scenario there are two main points of interest:
1. Estimating the difference in mean blood lead levels between the two groups.
2. Testing if the observed difference in blood lead levels is statistically signifi-
cant. (A statistically significant difference means it would be very unlikely
to observe a difference of that size, if in reality the groups had the same
true mean, thus giving strong evidence the observed effect is a real one.
We will discuss this notion in much greater detail later.)
Later in this text we will learn about confidence intervals and hypothesis
tests—statistical inference techniques that will allow us to properly address these
points of interest.
Example 1.3 Can self-control be restored during intoxication? Researchers in-
vestigated this in an experiment with 44 male undergraduate student volunteers.4
The males were randomly assigned to one of 4 treatment groups (11 to each
group):
1. An alcohol group, receiving drinks containing a total of 0.62 mg/kg alcohol.
(Group A)
2. A group receiving drinks with the same alcohol content as Group A, but
also containing 4.4 mg/kg of caffeine. (Group AC)
4
Grattan-Miscio, K. and Vogel-Sprott, M. (2005). Alcohol, intentional control, and inappropri-
ate behavior: Regulation by caffeine or an incentive. Experimental and Clinical Psychophar-
macology, 13:48–55.
13. Balka ISE 1.10 1.3. INFERENTIAL STATISTICS 5
3. A group receiving drinks with the same alcohol content as Group A, but
also receiving a monetary reward for success on the task. (Group AR)
4. A group told they would receive alcoholic drinks, but instead given a
placebo (a drink containing a few drops of alcohol on the surface, and
misted to give a strong alcoholic scent). (Group P)
After consuming the drinks and resting for a few minutes, the participants car-
ried out a word stem completion task involving “controlled (effortful) memory
processes”. Figure 1.3 shows the boxplots for the four treatment groups. Higher
scores are indicative of greater self-control.
A AC AR P
-0.4
-0.2
0.0
0.2
0.4
0.6
Score
on
Task
Figure 1.3: Boxplots of word stem completion task scores.
The plot seems to show a systematic difference between the groups in their task
scores. Are these differences statistically significant? How unlikely is it to see
differences of this size, if in reality there isn’t a real effect of the different treat-
ments? Does this data give evidence that self-control can in fact be restored by
the use of caffeine or a monetary reward? In Chapter 14 we will use a statistical
inference technique called ANOVA to help answer these questions.
Example 1.4 A study5 investigated a possible relationship between eggshell
thickness and environmental contaminants in brown pelican eggs. It was sus-
pected that higher levels of contaminants would result in thinner eggshells. This
study looked at the relationship of several environmental contaminants on the
thickness of shells. One contaminant was DDT, measured in parts per million
of the yolk lipid. Figure 1.4 shows a scatterplot of shell thickness vs. DDT in a
sample of 65 brown pelican eggs from Anacapa Island, California.
There appears to be a decreasing relationship. Does this data give strong ev-
5
Risebrough, R. (1972). Effects of environmental pollutants upon animals other than man. In
Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability.
14. Balka ISE 1.10 1.3. INFERENTIAL STATISTICS 6
0 500 1000 1500 2000 2500 3000
0.1
0.2
0.3
0.4
0.5
Shell
Thickness
(mm)
DDT (ppm)
Figure 1.4: Shell thickness (mm) vs. DDT (ppm) for 65 brown pelican eggs.
idence of a relationship between DDT contamination and eggshell thickness?
Can we use the relationship to help predict eggshell thickness for a given level
of DDT contamination? In Chapter 15 we will use a statistical method called
linear regression analysis to help answer these questions.
In the world of statistics—and our world at large for that matter—we rarely if
ever know anything with certainty. Our state