SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Item Analysis,
Test Validity and Reliability
Prepared by:
Rovel A. Aparicio
Mathematics Teacher
THERE IS always A
better WAY
Stages in Test Construction
I. Planning the Test
A. Determining the Objectives
B. Preparing the Table of Specifications
C. Selecting the Appropriate Item Format
D. Writing the Test items
E. Editing the Test items
Stages in Test Construction
II. Trying Out the Test

A. Administering the test
Item analysis
C. Preparing the Final Form of the Test
Stages in Test Construction
III. Establishing Test Validity

IV. Establishing Test Reliability

V. Interpreting the Test Scores
Item Analysis

GOAL: Improve the test.
IMPORTANCE: Measure the effectiveness of individual test item.

DIFFICULTY INDEX

DISCRIMINATION INDEX

 the percentage

 refers to the degree

of the pupils who
got the items rigth.
interpreted as
how easy or how
difficult an item is.

to which success or
failure of an item
indicates possession of
the acheivement being
measured.
ACTIVITY NO.1

COMPUTE THE DIFFICULTY
INDEX AND
DISCRIMINATION INDEX OF
PERIODICAL TEST.
U-L INDEX METHOD
(STEPS)

1. Score the papers and rank them from
highest
to lowest according to the total score.
2. Separate the top 27% and the bottom 27%
of the papers.
3. Tally the responses made to each test item
by each individual in the upper 27% group.
4.Tally the responses made to each test item
by each individual in the lower 27% group.
U-L INDEX METHOD
(STEPS)

5. Compute the difficulty index.
[d= (U+L)/(nu+nl)]
6. Compute the discrimination index.
[D=(U-L)/nu] or [D=(U-L)/nl]
Item
no.

Upper
27%

1

Lower
27%

Difficulty
Index

14

12

0.81

0.13

Revised

2

10

6

0.50

0.25

3

11

7

0.56

0.25

Retained
Retained

4

9

2

0.34

0.44

Retained

5

12

0.56

0.38

Retained

6

6

6
14

0.63

-0.50

Rejected

7

13

4

0.53

0.56

Retained

8

3

10

0.41

-0.44

Rejected

9

13

12

0.78

0.06

Rejected

10

8

6

0.44

0.13

Revised

No. of pupils tested- 60

Discrimination
Index

Remarks
Item Analysis

DIFFICULTY INDEX
.00-.20

Very Difficult

.21-.80

Moderately
Difficult

.81-1.00

DISCRIMINATION INDEX
< .09

Poor items
(Reject)

.10-.39

Reasonably
Good (Revise)

.40-1.00

Very Good
items (Retain)

Very Easy
Establishing Test Validity

Criterionrelated
Validity

Content
Validity
Types of Test
Validity

Construct
Validity
Establishing Test Validity
Types of Validity
Types of Validity

1.Content
Validity

Meaning
Meaning

Procedure
Procedure

Compare test tasks
How well the
with test
sample test bar specifications
tasks represent describing the task
the domain of
domain under
tasks to be
consideration
measured.
(non-statistical)
Establishing Test Validity
Types of Validity
Types of Validity

2. Construct
Validity

Meaning
Meaning

Procedure
Procedure

Experimentally
determine what factors
How test
influence scores on test.
performance can
The procedure may be
be described
psychologically. logical and statistical
using correlations and
other statistical
methods.
Establishing Test Validity
Types of Validity
Types of Validity

3.

Meaning
Meaning

Procedure
Procedure

Compare test scores
How well test
with measure of
performance
performance(grade)
obtain on later date(for
Criterion- predicts
future performance prediction).or another
related or estimates current
measure of performance
Validity
performance on obtain concurrently(for
some valued
estimating present
measures other
status.( Primarily
than the test
Statistical). Correlate
itself.
test results with
outside criterion.
Establishing Test Reliability
Measure of
Stability and
Equivalence

Measure of
Stability

Types of
Reliability
Measure

Measure of
Internal
Consistency

Measure of
Equivalence
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

1. Measure of
Stability

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Test- retest
method

Procedure
Procedure

Give a test twice to
the same group with
any time interval
between tests from
several minutes to
several years.
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

2. Measure of
Equivalence

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give two forms of a
Equivalent formstest to the same
group in close
method
succession
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

3. Measure of
Stability

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give two forms of
a test to the same
Test- retest
with equivalentgroup with increased
time intervals
forms
between forms.
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

4. Measure of
internal
consistency

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give a test once.
Kuder-Richarson
Score the total test
method
and apply the Kuder
Richardson formula.
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

4. Measure of
internal
consistency

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Split half
method

Procedure
Procedure

Give a test once.
Score equivalent
halves of the test.
(e.g. odd and even
numbered items.
(Pearson r and
Spearman- Brown
formula)
ACTIVITY NO.2

TEST THE RELIABILITY OF
PERIODICAL TEST.
Pearson r Standard Scores
(Directions)

1. Begin by writing the pairs of scores to be
studied in two columns. Be sure that the pair of
scores for each pupils is in the same row.
Label one set of scores X , the other Y.
2.Get the sum (∑) of the scores for each
column. Divide the sum by the number of
scores (N) in each column to get the mean.
3.Subtract each score in column X from the
mean x. Write the difference in column x. Be
sure to put an algebraic sign.
Pearson r Standard Scores
(Directions)

4. Subtract each score in column Y from the
mean y. Write the difference in column y. Don't
forget the sign.
5. Square each score in column X. Enter each
result under X2 .
6. Square each score in column Y. Enter each
result under Y2 .
7. Compute the standard deviation of X and Y
and enter the result under the column of SDx
and SDy respectively .
Pearson r Standard Scores
(Directions)

8. Divide each entry in column x and y by the
standard deviation SDx and SDy respectively
and enter the result under Zx and Zy
respectively.
9. Multiply Zx and Zy and enter the result
under ZxZy.
10. Get the sum (∑) ZxZy.
11. Apply the formula r=∑ZxZy
N
Interpretation of
Coefficient of Correlation

Correlation is a measure of relationship
between two variables.
Magnitude or size of
Relationship
0.8 and above means
high correlation
0.5 means moderate
correlation
0.3 and below means
low correlation

Direction of Relationship
Negative coefficient
means, as one variable
increases, the other
decreases.
Positive Coefficient
means, as one variable
increases, the other also
increases
Interpretation of
Coefficient of Variation

Coeffecient of Variation is defined as the
ratio of the standard deviation and the mean
and usually expressed in percent.

Criteria:
c.v. = (mean/s.d.)x100

less than 10%Homogenous
greater than 10%- Heterogenous
REMEMBER:

1. Use item analysis procedures to check the quality
of the test. The item analysis should be interpreted
with care and caution
2. A test is valid when it measures what it is supposed
to measure
3. A test is reliable when it is consistent .
Test validity

Contenu connexe

Tendances (20)

Item analysis
Item analysisItem analysis
Item analysis
 
Reliability for testing and assessment
Reliability for testing and assessmentReliability for testing and assessment
Reliability for testing and assessment
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
ITEM ANALYSIS
ITEM ANALYSISITEM ANALYSIS
ITEM ANALYSIS
 
Item analysis
Item analysis Item analysis
Item analysis
 
Causal comparative research
Causal comparative researchCausal comparative research
Causal comparative research
 
Classical Test Theory (CTT)- By Dr. Jai Singh
Classical Test Theory (CTT)- By Dr. Jai SinghClassical Test Theory (CTT)- By Dr. Jai Singh
Classical Test Theory (CTT)- By Dr. Jai Singh
 
Item and Distracter Analysis
Item and Distracter AnalysisItem and Distracter Analysis
Item and Distracter Analysis
 
Developing instruments for research
Developing instruments for researchDeveloping instruments for research
Developing instruments for research
 
7. the t distribution
7. the t distribution7. the t distribution
7. the t distribution
 
Item analysis.pptx du
Item analysis.pptx duItem analysis.pptx du
Item analysis.pptx du
 
Item analysis
Item analysisItem analysis
Item analysis
 
Table of specification dr shafqat ali
Table of specification  dr shafqat aliTable of specification  dr shafqat ali
Table of specification dr shafqat ali
 
Standard Scores
Standard ScoresStandard Scores
Standard Scores
 
Interpreting Assessment and Reporting
Interpreting Assessment and ReportingInterpreting Assessment and Reporting
Interpreting Assessment and Reporting
 
Kendall coefficient of concordance
Kendall coefficient of concordance Kendall coefficient of concordance
Kendall coefficient of concordance
 
Types of Tests
Types of TestsTypes of Tests
Types of Tests
 
Correlational Research
Correlational ResearchCorrelational Research
Correlational Research
 
Item analysis
Item analysisItem analysis
Item analysis
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 

Similaire à Test validity

Criteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsCriteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsShimmy Tolentino
 
Criteria to consider when constructing good tests
Criteria to consider when constructing good testsCriteria to consider when constructing good tests
Criteria to consider when constructing good testsshimmy ct
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxMarjoriAnneDelosReye
 
Chp8 170419081335-converted
Chp8 170419081335-convertedChp8 170419081335-converted
Chp8 170419081335-convertedMishal Tahir
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scalingBalaji P
 
Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Mahsa Farahanynia
 
STEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSSTEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSLoradelLegaspi
 
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp8  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger BougieHassan Usman
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptxNithuNithu7
 
Measures of variability to grading and reporting
Measures of variability to grading and reportingMeasures of variability to grading and reporting
Measures of variability to grading and reportingApril Gealene Alera
 
Cot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxCot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxHarbsOrHar
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnairesVenkitachalam R
 
Principles of design of experiments (doe)20 5-2014
Principles of  design of experiments (doe)20 5-2014Principles of  design of experiments (doe)20 5-2014
Principles of design of experiments (doe)20 5-2014Awad Albalwi
 
5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docxalinainglis
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE Pradip Limbani
 
AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. Jamie Davies
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)Aminullah Assagaf
 
Education Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxEducation Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxRayLorenzOrtega
 
Establishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxEstablishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxRayLorenzOrtega
 

Similaire à Test validity (20)

Criteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsCriteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good tests
 
Criteria to consider when constructing good tests
Criteria to consider when constructing good testsCriteria to consider when constructing good tests
Criteria to consider when constructing good tests
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
 
Chp8 170419081335-converted
Chp8 170419081335-convertedChp8 170419081335-converted
Chp8 170419081335-converted
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
 
Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)
 
STEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSSTEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLS
 
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp8  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptx
 
Measures of variability to grading and reporting
Measures of variability to grading and reportingMeasures of variability to grading and reporting
Measures of variability to grading and reporting
 
Cot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxCot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptx
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnaires
 
Principles of design of experiments (doe)20 5-2014
Principles of  design of experiments (doe)20 5-2014Principles of  design of experiments (doe)20 5-2014
Principles of design of experiments (doe)20 5-2014
 
5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
 
AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement.
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
 
Education Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxEducation Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptx
 
Establishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxEstablishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptx
 

Dernier

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 

Dernier (20)

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 

Test validity

  • 1. Item Analysis, Test Validity and Reliability Prepared by: Rovel A. Aparicio Mathematics Teacher
  • 2. THERE IS always A better WAY
  • 3. Stages in Test Construction I. Planning the Test A. Determining the Objectives B. Preparing the Table of Specifications C. Selecting the Appropriate Item Format D. Writing the Test items E. Editing the Test items
  • 4. Stages in Test Construction II. Trying Out the Test A. Administering the test Item analysis C. Preparing the Final Form of the Test
  • 5. Stages in Test Construction III. Establishing Test Validity IV. Establishing Test Reliability V. Interpreting the Test Scores
  • 6. Item Analysis GOAL: Improve the test. IMPORTANCE: Measure the effectiveness of individual test item. DIFFICULTY INDEX DISCRIMINATION INDEX  the percentage  refers to the degree of the pupils who got the items rigth. interpreted as how easy or how difficult an item is. to which success or failure of an item indicates possession of the acheivement being measured.
  • 7. ACTIVITY NO.1 COMPUTE THE DIFFICULTY INDEX AND DISCRIMINATION INDEX OF PERIODICAL TEST.
  • 8. U-L INDEX METHOD (STEPS) 1. Score the papers and rank them from highest to lowest according to the total score. 2. Separate the top 27% and the bottom 27% of the papers. 3. Tally the responses made to each test item by each individual in the upper 27% group. 4.Tally the responses made to each test item by each individual in the lower 27% group.
  • 9. U-L INDEX METHOD (STEPS) 5. Compute the difficulty index. [d= (U+L)/(nu+nl)] 6. Compute the discrimination index. [D=(U-L)/nu] or [D=(U-L)/nl]
  • 11. Item Analysis DIFFICULTY INDEX .00-.20 Very Difficult .21-.80 Moderately Difficult .81-1.00 DISCRIMINATION INDEX < .09 Poor items (Reject) .10-.39 Reasonably Good (Revise) .40-1.00 Very Good items (Retain) Very Easy
  • 13. Establishing Test Validity Types of Validity Types of Validity 1.Content Validity Meaning Meaning Procedure Procedure Compare test tasks How well the with test sample test bar specifications tasks represent describing the task the domain of domain under tasks to be consideration measured. (non-statistical)
  • 14. Establishing Test Validity Types of Validity Types of Validity 2. Construct Validity Meaning Meaning Procedure Procedure Experimentally determine what factors How test influence scores on test. performance can The procedure may be be described psychologically. logical and statistical using correlations and other statistical methods.
  • 15. Establishing Test Validity Types of Validity Types of Validity 3. Meaning Meaning Procedure Procedure Compare test scores How well test with measure of performance performance(grade) obtain on later date(for Criterion- predicts future performance prediction).or another related or estimates current measure of performance Validity performance on obtain concurrently(for some valued estimating present measures other status.( Primarily than the test Statistical). Correlate itself. test results with outside criterion.
  • 16. Establishing Test Reliability Measure of Stability and Equivalence Measure of Stability Types of Reliability Measure Measure of Internal Consistency Measure of Equivalence
  • 17. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 1. Measure of Stability Methods of Methods of Estimating Reliability Estimating Reliability Test- retest method Procedure Procedure Give a test twice to the same group with any time interval between tests from several minutes to several years. (Pearson r)
  • 18. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 2. Measure of Equivalence Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give two forms of a Equivalent formstest to the same group in close method succession (Pearson r)
  • 19. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 3. Measure of Stability Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give two forms of a test to the same Test- retest with equivalentgroup with increased time intervals forms between forms. (Pearson r)
  • 20. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 4. Measure of internal consistency Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give a test once. Kuder-Richarson Score the total test method and apply the Kuder Richardson formula.
  • 21. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 4. Measure of internal consistency Methods of Methods of Estimating Reliability Estimating Reliability Split half method Procedure Procedure Give a test once. Score equivalent halves of the test. (e.g. odd and even numbered items. (Pearson r and Spearman- Brown formula)
  • 22. ACTIVITY NO.2 TEST THE RELIABILITY OF PERIODICAL TEST.
  • 23. Pearson r Standard Scores (Directions) 1. Begin by writing the pairs of scores to be studied in two columns. Be sure that the pair of scores for each pupils is in the same row. Label one set of scores X , the other Y. 2.Get the sum (∑) of the scores for each column. Divide the sum by the number of scores (N) in each column to get the mean. 3.Subtract each score in column X from the mean x. Write the difference in column x. Be sure to put an algebraic sign.
  • 24. Pearson r Standard Scores (Directions) 4. Subtract each score in column Y from the mean y. Write the difference in column y. Don't forget the sign. 5. Square each score in column X. Enter each result under X2 . 6. Square each score in column Y. Enter each result under Y2 . 7. Compute the standard deviation of X and Y and enter the result under the column of SDx and SDy respectively .
  • 25. Pearson r Standard Scores (Directions) 8. Divide each entry in column x and y by the standard deviation SDx and SDy respectively and enter the result under Zx and Zy respectively. 9. Multiply Zx and Zy and enter the result under ZxZy. 10. Get the sum (∑) ZxZy. 11. Apply the formula r=∑ZxZy N
  • 26. Interpretation of Coefficient of Correlation Correlation is a measure of relationship between two variables. Magnitude or size of Relationship 0.8 and above means high correlation 0.5 means moderate correlation 0.3 and below means low correlation Direction of Relationship Negative coefficient means, as one variable increases, the other decreases. Positive Coefficient means, as one variable increases, the other also increases
  • 27. Interpretation of Coefficient of Variation Coeffecient of Variation is defined as the ratio of the standard deviation and the mean and usually expressed in percent. Criteria: c.v. = (mean/s.d.)x100 less than 10%Homogenous greater than 10%- Heterogenous
  • 28. REMEMBER: 1. Use item analysis procedures to check the quality of the test. The item analysis should be interpreted with care and caution 2. A test is valid when it measures what it is supposed to measure 3. A test is reliable when it is consistent .