Test Reliability and Validity

Test Reliability and Validity
Dr. Brian D. Ebie

Considerations for Test Makers
• We need to make sure that our test will
gather the appropriate data.
– Does it relate to content covered?
– Will it adequately measure evidence of
student learning?
• We need to make sure that our test will
consistently work.
– What assumptions are in place?
– Materials, content, conditions, subjects?

Validity
• Validity is the degree to which a test
measures what it is supposed to
measure
• A test is not valid per se; it is valid for a
particular purpose and for a particular
group

Face Validity
• If a test appears to be designed
correctly, it has good face validity.
• “Looks good.”
• Appears that it should do what it’s
supposed to do.
– Ex. Algebra questions on an algebra test.
• Uses the very scientific “Biocular
Rejection Test.” 

Did what it was supposed to do!

Validity
• Construct Validity
– A construct is a non-observable trait, such
as intelligence, which explains behavior
– The degree to which a test measures an
intended hypothetical construct
• Content Validity
– The degree to which a test measures an
intended content area

Validity
• Concurrent Validity
– The degree to which the scores on a test are
related to the scores on another, already
established test administered at the same
time.
– Use a measure of reliability: Pearson r,
Spearman rho
• Predictive Validity
– The degree to which a test can predict how
well an individual will do in a future situation

Reliability
• Reliability is the degree to which a test
consistently measures whatever it
measures.
• If you give the test over and over it will
consistently return the same results.

Reliability
• Test-Retest
– Degree to which scores are consistent over time
• Equivalent forms
– Two tests identical in every way except item order
• Split-half
– Give a test to group. Score ½ of test with ½ of
class, score other ½ of test with other ½ of class
• Interscorer/interrater
– A measure of the agreements and disagreements
of 2 or more judges

Can a measure be both
valid and reliable?
• Does it measure what it is supposed to
measure?
– Did you find what you were looking for?
• Does it measure what it is supposed to
measure and does it consistently
measure it, time after time?
– If you administered the same measure
many times would you get similar results?

Reliability and Validity
• How do they work together?
• Get your bow and arrow ready!

Reliable but not valid.
Arrows on target, and grouped, indicating consistency. Yet not in
the bullseye—assumed to be the goal of the archer.

Neither reliable or valid.
Arrows not on target and not grouped, indicating inconsistency.

Both reliable and valid.
Arrows on target, and grouped in the bullseye, demonstrating both
accuracy (validity) and consistency (reliability).

Being both reliable and valid
• A test can be reliable, meaning test-
takers will get the same score no
matter when or where they take it.
– This does not mean that the test is valid or
measuring what it is supposed to
measure.
• A test can be reliable without being
valid, however, a test cannot be valid
unless it is reliable.

Consideration of the test format
• What will best test evidence of
learning?
– Multiple choice
– Matching
– Essay
– Open ended?
• Try to use questions that stimulate
higher-order thinking

Summary
• Validity is the degree to which a test
measures what it is supposed to measure
• Reliability is the degree to which a test
consistently measures whatever it measures.
• A test can be reliable without being valid,
however, a test cannot be valid unless it is
reliable.

Test Reliability and Validity

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (7)

Similar to Test Reliability and Validity

Similar to Test Reliability and Validity (20)

More from Brian Ebie

More from Brian Ebie (9)

Recently uploaded

Recently uploaded (20)

Test Reliability and Validity