To determine the accuracy and precision of pupil measurements made with the Tobii 1750 remote video eye tracker, we performed a formal metrological study with respect to a calibrated reference instrument, a medical pupillometer. We found that the eye tracker measures mean binocular pupil diameter with precision 0.10 mm and mean binocular pupil
dilations with precision 0.15 mm.
2. where π is the diameter of the pupil, Π is the measurement
of that diameter, and is the measurement error. π and
are random variables that take on new values for each mea-
surement. Each instrument’s bias is the the fixed component
of the measurement error μ, its accuracy is the magnitude of
the bias |μ|, and its precision is the standard deviation of the
measurement error σ. For the reference pupillometer (pm),
μ[ pm ] = 0 mm and σ[ pm ] = 0.05 mm, according to infor-
mation provided by its manufacturer. For the eye tracker
(et), the parameters of the measurement error distribution
μ[ et ] (bias) and σ[ et ] (precision) are what we are trying to
determine.
We can estimate these parameters by analyzing the differ-
ences between simultaneous measurements made with the
eye tracker and the pupillometer:
Figure 2: Metrology study arrangement. An investigator
is measuring the participant’s left pupil using the reference Πet − Πpm = (πet + et ) − (πpm + pm )
pupillometer while the eye tracker simultaneously measures = πet − πpm + et − pm
his right pupil.
This is an equation of random variables. Considering the
variance of each side:
measurements in which we measured participants’ pupils us-
ing the eye tracker and the pupillometer simultaneously. Be-
cause the pupillometer covers the eye it measures, we could
not conduct simultaneous measurements of the same eye us- σ 2 [Πet − Πpm ] = σ 2 [πet − πpm + et − pm ]
2
ing both instruments, so for each double measurement, the σ [Πet − Πpm ] = σ 2 [πet − πpm ] + σ 2 [ 2
et ] + σ [ pm ]
pupillometer measured one of the participant’s pupils while
o
:
the eye tracker measured the other (Figure 2). The metro- σ 2 [Πet − Πpm ] σ 2 [π
= et − πpm ] + σ 2 [ et ] + σ2 [ pm ](1)
logical validity of this study is therefore based on the strong 2 2 2
correlation between the diameters of the left and right pupils σ [Πet − Πpm ] = σ [ et ] + σ [ pm ]
p
[Loewenfeld 1999]. Measurements taken with the eye tracker σ[ et ] = σ 2 [Πet − Πpm ] − σ 2 [ pm ] (2)
were averages over the 100 camera frames gathered in the
same two-second measurement window used by the reference
pupillometer. The relationship in Equation 2 gives us a way to estimate
the precision of the eye tracker based on the known precision
The measurements were conducted under various lighting of the reference pupillometer σ[ pm ] and the variance in the
conditions so that our measurements would span a variety differences between the simultaneous measurements σ 2 [Πet −
of pupil states: half under normal room lighting and half Πpm ]. Similarly, we can compute the bias of the eye tracker
under dim lighting, where a third of the time we switched based on the mean of those differences:
the lights on or off during the few seconds between succes-
sive double measurements. In all trials, subjects looked at a
small fixation target at the center of the eye tracker’s screen,
which was otherwise filled with 64 cd/m2 medium gray. We μ[Πet − Πpm ] = μ[πet − πpm + et − pm ]
excluded 120 measurements in which we did not get a clean μ[Πet − Πpm ] = μ[πet − πpm ] + μ[ et ] − μ[ pm ]
reading with the pupillometer and 10 measurements in which o
:
o
:
we did not get a clean reading with the eye tracker, leaving μ[Πet − Πpm ] = pm ] + μ[ et ] + (3)
μ[πet − π μ[ pm ]
206 successful double measurements, analyzed below. μ[ et ] = μ[Πet − Πpm ] (4)
3 Metrology Substituting the mean and variance of the actually observed
differences Πet − Πpm in Equations 4 and 2, the eye tracker’s
We present two different metrological analyses of these dou- pupillometric bias is 0.11 mm, and its precision is 0.38 mm.
ble measurements: the first, based on pupil diameters, is
simpler and can use all of the data but is limited by strong These figures are misleading, however, because the bias and
assumptions. The second analysis, based on dilations, uses precision of the eye tracker varied substantially between the
weaker assumptions but is restricted to a subset of the avail- three participants and between the two eyes of each partic-
able data. ipant. Figure 3 shows the results of all 206 successful si-
multaneous measurements and illustrates this inter-subject
3.1 Pupil Diameter Metrology variation. For each eye individually, the measurement error
has a much narrower spread, but the average is wrong by as
For both instruments, we model the measurement error as much as 0.67 mm. The accuracy and precision varies from
being additive and normally distributed: eye to eye like this because the eye tracker’s pupil measure-
ments depend on its estimate of the camera-pupil distance,
which is affected by errors in the eye tracker’s calibration to
Π=π+ ∼ N (μ, σ) each eye’s corneal shape.
260
3. 5.0 1.0 o o
o o o
o oooo o o
ooo
difference in measurements (mm)
o o o oooo oo
eye tracker measurement (mm)
ooo
oo
ooo o o
o
oo
o
4.5 o oo o
ooo
o oo
o
oo ooo
ooo
o ooo oooo
oo o
oo
oo
oo oo
oo
oooo o
o o oo
o
o oo ooooo ooo
oo
o 0.5 ooo
oo o oo o o
oo ooooooo o o
oo o
o
oo ooooo oo
o o o o o
o
4.0 oo
ooo o
ooooo oo o
o
oo
oo o o
ooooo
ooo oo
oo
o oo o
oo
oo
oooo
oo o oooo o
o
oo ooooo
o o
oooo
oo
ooo
oo
o
oo
oo
oo oo
ooo o
o oo o
oo
o ooooo
oooo
oo
ooo
oo oo
oo
ooo
o
3.5 ooo
oo
oooo
ooooo ooo
oooo
oooo
oo
o o
oo
o
ooo
oo
o
o o
o
ooo o o 0.0 oooo
oooo o oo
ooo
oo oooo
o o o o
o o
oo
oo
oo oo o oo
o
o
ooo o
oo o
oo oo
oo oo
ooo o o
o ooo
o o oo ooo
oo
o o o
3.0 oo
ooo o
ooo
o oo
oo oo
ooo
oo o
o oo
oo
o
o oo oo o o o
oo
oo
o
ooo
o
o
oo
o
oo
o oo
o
o
o o
−0.5 oo
oo
oo
o oo
oo
oo o oo
2.5 oo o o
all data
Participant 1
Participant 1
Participant 2
Participant 2
Participant 3
Participant 3
Left Eye
Right Eye
Left Eye
Right Eye
Left Eye
Right Eye
2.5 3.0 3.5 4.0 4.5 5.0
pupillometer measurement (mm)
Figure 3: The left graph shows the raw data of the metrology study, with each point representing a double measurement
(Πpm , Πet ). Data from each participant and each subject are plotted in a different color. The right chart shows the differences
between the eye tracker and pupillometer measurements, Πet − Πpm , broken down by study participant and eye, showing how
the eye tracker’s pupillometric bias varies for each eye.
When Equations 4 and 2 are applied to the data from each
eye separately, we find an average bias of 0.34 mm (worse
than the overall 0.11 mm) and an average precision of 0.12 Δet − Δpm = (Πet2 − Πet1 ) − (Πpm2 − Πpm1 )
mm (better than the overall 0.38 mm). Because it is differ- = [(πet2 + et2 ) − (πet1 + et1 )]
ences in measurements for the same eye (dilations) that form − [(πpm2 + pm2 ) − (πpm1 + pm1 )]
the basis of most experimental use of pupillometry [Beatty
and Lucero-Wagoner 2000], and because pupillometric ex- = (πet2 − πet1 ) − (πpm2 − πpm1 )
periments are usually conducted with several participants, + et2 − et1 + pm1 − pm2
these per-eye results for the eye tracker’s bias and precision = (δet − δpm ) + − + − (5)
et2 et1 pm1 pm2
are the most relevant and are the ones summarized in Ta-
ble 1.
As before, now considering the variance of the random vari-
ables on each side of Equation 5:
In Equation 3 of the derivation for accuracy, the term μ[πet −
πpm ] was assumed to be zero. We ensured this zero mean
left-right difference in pupil size by counterbalancing which
of the two eyes was measured with which instrument within σ 2 [Δet − Δpm ] = σ 2 [(δet − δpm ) + et2 − et1 + pm1 − pm2 ]
the trials for each participant. o
:
= pm ] + σ 2 [ et2 ]
σ 2 [δet − δ (6)
Similarly, the cancelation in Equation 1 of the derivation for + σ2 [ 2
et1 ] + σ [
2
pm1 ] + σ [ pm2 ]
precision assumes that term σ 2 [πet − πpm ] is zero. This as-
sumption, that the difference in size between participants’ = σ2 [ et2 ] + σ2 [ et1 ] + σ2 [ pm1 ] + σ2 [ pm2 ]
left and right pupils is constant throughout the study, is 2 2
= 2σ [ et ] + 2σ [ pm ] (7)
much stronger. Judging from pupil data we’ve recorded in a 2 1 2 2
variety of studies, it is true over short periods of time (a few σ [ et ] = 2
σ [Δet − Δpm ] − σ [ pm ]
minutes) but can sometimes drift over the 15–20 minutes it q
1 2
takes to make the measurements of each participant. Viola- σ[ et ] = 2
σ [Δet − Δpm ] − σ 2 [ pm ] (8)
tions of this assumption would lead to an underestimate of
the average error the eye tracker. A more conservative anal- The cancellation in Equation 6 is based on the assumption
ysis, based on differences in short-term dilations measured that the difference between the left eye’s dilation and the
by each instrument, provides an alternative estimate of the right eye’s dilation is constant over a short period of time.
eye tracker’s precision. We observed this fact in an earlier study we conducted on lat-
eralized pupillary responses, in which we tried several stim-
ulus based ways of inducing different dilations in subjects’
3.2 Pupil Dilation Metrology two pupils but never succeeded in causing any significant left-
right differences. We abandoned the effort after learning that
We can determine the pupillometric precision of the eye the neuroanatomy of pupil size regulation renders such dif-
tracker using differences in measurements of dilations rather ferences extremely unlikely [Loewenfeld 1999]. Step 7 relies
than absolute pupil diameters. Using δ = π2 − π1 to de- on the assumption that the bias of the measurement error
note the dilation of the pupil from time 1 to time 2 and is stable over time for both instruments (σ[ pm1 ] = σ[ pm2 ]
Δ = Π2 − Π1 to denote the measurement of that dilation, and σ[ et1 ] = σ[ et2 ]).
261
4. eye tracker eye tracker precision (mm)
diameter
pupil diameter dilation magnitude
accuracy
assumption data per-eye (mm) monocular binocular monocular binocular
mean mean
The difference in size be- 206 double 0.34 0.12 0.08 0.17 0.12
tween the left and right measurements
pupils is constant over the
study.
The difference between the 84 pairs NA 0.15 0.10 0.21 0.15
left eye’s dilation and the of double
right eye’s dilation is con- measurements
stant over 30 sec.
Table 1: Summary of the Tobii 1750’s pupillometric performance. Figures for monocular diameter accuracy and precision are
the results of the metrological analysis above. Other figures in the table were then derived from these primary results. Dilation
√
measurement precision is larger (worse) by a factor of 2, because it is based on the difference of two diameter measurements.
When both eyes are measured and averaged, the precision in the estimate of their mean dilation (or diameter) improves by a
√
factor of 2 over the monocular case.
Among the 206 successful double measurements, there are 84 References
pairs of double measurements that took place within 30 sec-
onds of each other. That is, there were 84 dilations with du- Andreassi, J. L. 2006. Pupillary response and behavior.
ration less than 30 seconds with starting diameters and end- In Psychophysiology: Human Behavior and Physological
ing diameters that were both measured with the two instru- Response, 5th ed. Routledge, ch. 12, 289–307.
ments simultaneously. Substituting the observed Δet − Δpm
in Equation 8 gives pupillometric precision of the eye tracker Beatty, J., and Lucero-Wagoner, B. 2000. The pupil-
as 0.15 mm, slightly worse than the diameter-based precision lary system. In Handbook of Psychophysiology, J. T. Ca-
of 0.12 mm. cioppo, L. G. Tassinary, and G. Berntson, Eds., 2nd ed.
Cambridge University Press, 142–162.
Chatham, C. H., Frank, M. J., and Munakata, Y. 2009.
4 Summary Pupillometric and behavioral markers of a developmen-
tal shift in the temporal dynamics of cognitive control.
We made many measurements of pupil size using a remote Proceedings of the National Academy of Sciences 106, 14,
video eye tracker and medical pupillometer simultaneously. 5529–5533.
We analyzed these simultaneous measurements in two ways Loewenfeld, I. 1999. The Pupil: Anatomy, Physiology,
to assess the pupillometric performance of the Tobii 1750 and Clinical Applications, 2nd ed., vol. 1. Butterworth-
remote video eye tracker. Heinemann, Oxford, UK.
The first analysis, diameter-based metrology (Section 3.1), Neuroptics, Inc., 2008. Instruction manual, VIP-200
provided an estimate of the eye tracker’s pupillometric ac- pupillometer, revision A.
curacy and—via a relatively strong assumption—a lower Paas, F., and Van Merri¨nboer, J. 1994. Instructional
e
bound on the eye tracker’s pupillometric precision. The control of cognitive load in the training of complex cog-
second analysis, dilation-based metrology (Section 3.2), pro- nitive tasks. Educational Psychology Review 6, 4 (Dec.),
vided an alternative estimate of precision relying on fewer as- 351–371.
sumptions but also with less applicable data. The results of
both analyses are summarized in Table 1, together with the Pomplun, M., and Sunkara, S. 2003. Pupil dilation as
resultant derived precision for binocular and dilation mea- an indicator of cognitive workload in human-computer in-
surements. teraction. In Proceedings of the International Conference
on HCI.
Porter, G., Troscianko, T., and Gilchrist, I. D. 2007.
Acknowledgements Effort during visual search and counting: Insights from
pupillometry. The Quarterly Journal of Experimental Psy-
This work was funded by the Stanford Regional Visual Ana- chology 60, 2, 211.
lytics Center, through the U.S. Department of Energy’s Pa-
Steinhauer, S. R., and Hakerem, G. 1992. The pupillary
cific Northwest National Laboratory. Our eye tracker was
response in cognitive psychophysiology and schizophre-
funded by the Stanford MediaX project and the Stanford
nia. Annals of the New York Academy of Sciences 658,
School of Engineering.
Psychophysiology and Experimental Psychopathology: A
Tribute to Samuel Sutton, 182–204.
Tobii Technologies, Inc., 2007. Tobii 1750.
262