SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
CALCULATING
COHEN’S KAPPA
A MEASURE OF INTER-RATER RELIABILITY
FOR QUALITATIVE RESEARCH INVOLVING NOMINAL CODING
WHAT IS COHEN’S KAPPA?
COHEN’S KAPPA IS A STATISTICAL MEASURE CREATED
BY JACOB COHEN IN 1960 TO BE A MORE ACCURATE
MEASURE OF RELIABILITY BETWEEN TWO RATERS
MAKING DECISONS ABOUT HOW A PARTICULAR UNIT OF
ANALYSIS SHOULD BE CATEGORIZED.
KAPPA MEASURES NOT ONLY THE % OF AGREEMENT
BETWEEN TWO RATERS, IT ALSO CALCULATES THE
DEGREE TO WHICH AGREEMENT CAN BE ATTRIBUTED TO
CHANCE.
JACOB COHEN, A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES, EDUCATIONAL AND
PSYCHOLOGICAL MEASUREMENT 20: 37–46, 1960.
THE EQUATION FOR K
K = Pr(a) - Pr(e)
N-Pr(e)
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
N = TOTAL NUMBER
OF RATED ITEMS,
ALSO CALLED
“CASES”
THE EQUATION FOR K
K = Pr(a) - Pr(e)
N-Pr(e)
THE FANCY “K” STANDS FOR KAPPA
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
N = TOTAL NUMBER
OF RATED ITEMS,
ALSO CALLED
“CASES”
THE EQUATION FOR K
K = Pr(a) - Pr(e)
N-Pr(e)
THE FANCY “K” STANDS FOR KAPPA
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
N = TOTAL NUMBER
OF RATED ITEMS,
ALSO CALLED
“CASES”
CALCULATING K BY HAND USING
A CONTINGENCY TABLE
RATER 1
R
A
T
E
R
2
THE SIZE OF
THE TABLE IS
DETERMINED BY
HOW MANY
CODING
CATEGORIES
YOU HAVE
THIS EXAMPLE
ASSUMES THAT
YOUR UNITS
CAN BE SORTED
INTO THREE
CATEGORIES,
HENCE A 3X3
GRID
A B C
A
B
C
CALCULATING K BY HAND USING
A CONTINGENCY TABLE
# of agreements on A disagreement disagreement
disagreement # of agreements on B disagreement
disagreement disagreement # of agreements on C
RATER 1
R
A
T
E
R
2
THE DIAGONAL
HIGHLIGHTED
HERE
REPRESENTS
AGREEMENT
(WHERE THE
TWO RATERS
BOTH MARK THE
SAME THING)
A B C
A
B
C
DATA: RATING BLOG COMMENTS
USING A RANDOM NUMBER TABLE, I PULLED COMMENTS
FROM ENGLISH LANGUAGE BLOGS ON BLOGGER.COM
UNTIL I HAD A SAMPLE OF 10 COMMENTS
I ASKED R&W COLLEAGUES TO RATE EACH COMMENT:
“PLEASE CATEGORIZE EACH USING THE FOLLOWING
CHOICES: RELEVANT, SPAM, OR OTHER.”
WE CAN NOW CALCULATE AGREEMENT BETWEEN ANY
TWO RATERS
DATA: RATERS 1-5
Item # 1 2 3 4 5 6 7 8 9 10
Rater 1 R R R R R R R R R S
Rater 2 S R R O R R R R O S
Rater 3 R R R O R R O O R S
Rater 4 R R R R R R R R R S
Rater 5 S R R O R O O R R S
CALCULATING K FOR
RATERS 1 & 2
6
(Item #2,3, 4-8)
0 0
1
(Item #1)
1
(Item #10)
0
2
(Item #4 & 9)
0 0
RATER 1
R
A
T
E
R
2
R S O
R
S
O
6
2
2
9 1 0 10
ADD
ROWS &
COLUMNS
SINCE WE
HAVE 10
ITEMS,
THE
TOTALS
SHOULD
ADD UP
TO 10 FOR
EACH
CALCULATING K
COMPUTING SIMPLE AGREEMENT
6
(Item #2,3, 4-8)
0 0
1
(Item #1)
1
(Item #10)
0
2
(Item #4 & 9)
0 0
RATER 1
R
A
T
E
R
2
R S O
R
S
O
(6+1)/10
ADD VALUES
OF DIAGONAL
CELLS &
DIVIDE BY
TOTAL
NUMBER OF
CASES TO
COMPUTE
SIMPLE
AGREEMENT
OR
“PR(A)”
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - Pr(e)
10 -Pr(e)
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
RATERS 1 & 2 AGREED ON 70%
OF THE CASES. BUT HOW
MUCH OF THAT AGREEMENT
WAS BY CHANCE?
WE ALSO SUBSTITUTE 10 AS
THE VALUE OF N
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - Pr(e)
10 -Pr(e)
WE CAN NOW ENTER THE VALUE OF PR(A)
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
RATERS 1 & 2 AGREED ON 70%
OF THE CASES. BUT HOW
MUCH OF THAT AGREEMENT
WAS BY CHANCE?
WE ALSO SUBSTITUTE 10 AS
THE VALUE OF N
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - Pr(e)
10 -Pr(e)
WE CAN NOW ENTER THE VALUE OF PR(A)
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
RATERS 1 & 2 AGREED ON 70%
OF THE CASES. BUT HOW
MUCH OF THAT AGREEMENT
WAS BY CHANCE?
WE ALSO SUBSTITUTE 10 AS
THE VALUE OF N
CALCULATING K
EXPECTED FREQUENCY OF CHANCE AGREEMENT
6
(5.4)
0 0
1
(Item #1)
1
(.2)
0
2
(Item #4 & 9)
0
0
(0)
RATER 1
R
A
T
E
R
2
R S O
R
S
O
FOR EACH
DIAGONAL
CELL, WE
COMPUTE
EXPECTED
FREQUENCY OF
CHANCE (EF)
EF =
ROW TOTAL X COL TOTAL
TOTAL # OF CASES
EF FOR “RELEVANT” = (6*9)/10 = 5.4
CALCULATING K
EXPECTED FREQUENCY OF CHANCE AGREEMENT
6
(5.4)
0 0
1
(Item #1)
1
(.2)
0
2
(Item #4 & 9)
0
0
(0)
RATER 1
R
A
T
E
R
2
R S O
R
S
O
ADD ALL
VALUES OF(EF)
TO GET
“PR(E”
PR(E)=
5.4 + .2 + 0 =
5.6
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - 5.6
10 - 5.6
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
K = .3182
THIS IS FAR BELOW THE ACCEPTABLE
LEVEL OF AGREEMENT, WHICH SHOULD
BE AT LEAST .70
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - 5.6
10 - 5.6
WE CAN NOW ENTER THE VALUE OF PR(E)
& COMPUTE KAPPA
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
K = .3182
THIS IS FAR BELOW THE ACCEPTABLE
LEVEL OF AGREEMENT, WHICH SHOULD
BE AT LEAST .70
THE EQUATION FOR K:
RATERS 1 & 2
K = 7 - 5.6
10 - 5.6
WE CAN NOW ENTER THE VALUE OF PR(E)
& COMPUTE KAPPA
PR(A) = SIMPLE
AGREEMENT AMONG
RATERS
PR(E) = LIKLIHOOD
THAT AGREEMENT IS
ATTRIBUTABLE TO
CHANCE
K = .3182
THIS IS FAR BELOW THE ACCEPTABLE
LEVEL OF AGREEMENT, WHICH SHOULD
BE AT LEAST .70
DATA: RATERS 1& 2
Item # 1 2 3 4 5 6 7 8 9 10
Rater 1 R R R R R R R R R S
Rater 2 S R R O R R R R O S
K = .3182 How can we improve?
•LOOK FOR THE PATTERN IN DISAGREEMENTS CAN SOMETHING
ABOUT THE CODING SCHEME BE CLARIFIED?
•TOTAL # OF CASES IS LOW, COULD BE ALLOWING A FEW STICKY
CASES TO DISPROPORTIONALLY INFLUENCE AGREEMENT
DATA: RATERS 1-5
Item # 1 2 3 4 5 6 7 8 9 10
Rater 1 R R R R R R R R R S
Rater 2 S R R O R R R R O S
Rater 3 R R R O R R O O R S
Rater 4 R R R R R R R R R S
Rater 5 S R R O R O O R R S
CASE 1 SHOWS A PATTERN OF DISAGREEMENT BETWEEN
“SPAM” & “RELEVANT,” WHILE CASE 4 SHOWS A PATTERN OF
DISAGREEMENT BETWEEN RELEVANT & OTHER
EXERCISES & QUESTIONS
Item # 1 2 3 4 5 6 7 8 9 10
Rater 1 R R R R R R R R R S
Rater 2 S R R O R R R R O S
Rater 3 R R R O R R O O R S
Rater 4 R R R R R R R R R S
Rater 5 S R R O R O O R R S
1. COMPUTE COHEN’S K FOR RATERS 3 & 5
2. REVISE THE CODING PROMPT TO ADDRESS PROBLEMS YOU
DETECT; GIVE YOUR NEW CODING SCHEME TO TWO RATERS AND
COMPUTE K TO SEE IF YOUR REVISIONS WORKED; BE PREPARED
TO TALK ABOUT WHAT CHANGES YOU MADE
3. COHEN’S KAPPA IS SAID TO BE A VERY CONSERVATIVE
MEASURE OF INTER-RATER RELIABILITY...CAN YOU EXPLAIN
WHY? WHAT ARE ITS LIMITATIONS AS YOU SEE THEM?
DO I HAVE TO DO THIS BY HAND?
NO, YOU COULD GO HERE:
http://faculty.vassar.edu/lowry/kappa.html

Contenu connexe

Dernier

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 

Dernier (20)

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 

En vedette

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

En vedette (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

Using Cohen's Kappa to Gauge Interrater Reliability

  • 1. CALCULATING COHEN’S KAPPA A MEASURE OF INTER-RATER RELIABILITY FOR QUALITATIVE RESEARCH INVOLVING NOMINAL CODING
  • 2. WHAT IS COHEN’S KAPPA? COHEN’S KAPPA IS A STATISTICAL MEASURE CREATED BY JACOB COHEN IN 1960 TO BE A MORE ACCURATE MEASURE OF RELIABILITY BETWEEN TWO RATERS MAKING DECISONS ABOUT HOW A PARTICULAR UNIT OF ANALYSIS SHOULD BE CATEGORIZED. KAPPA MEASURES NOT ONLY THE % OF AGREEMENT BETWEEN TWO RATERS, IT ALSO CALCULATES THE DEGREE TO WHICH AGREEMENT CAN BE ATTRIBUTED TO CHANCE. JACOB COHEN, A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES, EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 20: 37–46, 1960.
  • 3. THE EQUATION FOR K K = Pr(a) - Pr(e) N-Pr(e) PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE N = TOTAL NUMBER OF RATED ITEMS, ALSO CALLED “CASES”
  • 4. THE EQUATION FOR K K = Pr(a) - Pr(e) N-Pr(e) THE FANCY “K” STANDS FOR KAPPA PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE N = TOTAL NUMBER OF RATED ITEMS, ALSO CALLED “CASES”
  • 5. THE EQUATION FOR K K = Pr(a) - Pr(e) N-Pr(e) THE FANCY “K” STANDS FOR KAPPA PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE N = TOTAL NUMBER OF RATED ITEMS, ALSO CALLED “CASES”
  • 6. CALCULATING K BY HAND USING A CONTINGENCY TABLE RATER 1 R A T E R 2 THE SIZE OF THE TABLE IS DETERMINED BY HOW MANY CODING CATEGORIES YOU HAVE THIS EXAMPLE ASSUMES THAT YOUR UNITS CAN BE SORTED INTO THREE CATEGORIES, HENCE A 3X3 GRID A B C A B C
  • 7. CALCULATING K BY HAND USING A CONTINGENCY TABLE # of agreements on A disagreement disagreement disagreement # of agreements on B disagreement disagreement disagreement # of agreements on C RATER 1 R A T E R 2 THE DIAGONAL HIGHLIGHTED HERE REPRESENTS AGREEMENT (WHERE THE TWO RATERS BOTH MARK THE SAME THING) A B C A B C
  • 8. DATA: RATING BLOG COMMENTS USING A RANDOM NUMBER TABLE, I PULLED COMMENTS FROM ENGLISH LANGUAGE BLOGS ON BLOGGER.COM UNTIL I HAD A SAMPLE OF 10 COMMENTS I ASKED R&W COLLEAGUES TO RATE EACH COMMENT: “PLEASE CATEGORIZE EACH USING THE FOLLOWING CHOICES: RELEVANT, SPAM, OR OTHER.” WE CAN NOW CALCULATE AGREEMENT BETWEEN ANY TWO RATERS
  • 9. DATA: RATERS 1-5 Item # 1 2 3 4 5 6 7 8 9 10 Rater 1 R R R R R R R R R S Rater 2 S R R O R R R R O S Rater 3 R R R O R R O O R S Rater 4 R R R R R R R R R S Rater 5 S R R O R O O R R S
  • 10. CALCULATING K FOR RATERS 1 & 2 6 (Item #2,3, 4-8) 0 0 1 (Item #1) 1 (Item #10) 0 2 (Item #4 & 9) 0 0 RATER 1 R A T E R 2 R S O R S O 6 2 2 9 1 0 10 ADD ROWS & COLUMNS SINCE WE HAVE 10 ITEMS, THE TOTALS SHOULD ADD UP TO 10 FOR EACH
  • 11. CALCULATING K COMPUTING SIMPLE AGREEMENT 6 (Item #2,3, 4-8) 0 0 1 (Item #1) 1 (Item #10) 0 2 (Item #4 & 9) 0 0 RATER 1 R A T E R 2 R S O R S O (6+1)/10 ADD VALUES OF DIAGONAL CELLS & DIVIDE BY TOTAL NUMBER OF CASES TO COMPUTE SIMPLE AGREEMENT OR “PR(A)”
  • 12. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - Pr(e) 10 -Pr(e) PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE RATERS 1 & 2 AGREED ON 70% OF THE CASES. BUT HOW MUCH OF THAT AGREEMENT WAS BY CHANCE? WE ALSO SUBSTITUTE 10 AS THE VALUE OF N
  • 13. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - Pr(e) 10 -Pr(e) WE CAN NOW ENTER THE VALUE OF PR(A) PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE RATERS 1 & 2 AGREED ON 70% OF THE CASES. BUT HOW MUCH OF THAT AGREEMENT WAS BY CHANCE? WE ALSO SUBSTITUTE 10 AS THE VALUE OF N
  • 14. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - Pr(e) 10 -Pr(e) WE CAN NOW ENTER THE VALUE OF PR(A) PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE RATERS 1 & 2 AGREED ON 70% OF THE CASES. BUT HOW MUCH OF THAT AGREEMENT WAS BY CHANCE? WE ALSO SUBSTITUTE 10 AS THE VALUE OF N
  • 15. CALCULATING K EXPECTED FREQUENCY OF CHANCE AGREEMENT 6 (5.4) 0 0 1 (Item #1) 1 (.2) 0 2 (Item #4 & 9) 0 0 (0) RATER 1 R A T E R 2 R S O R S O FOR EACH DIAGONAL CELL, WE COMPUTE EXPECTED FREQUENCY OF CHANCE (EF) EF = ROW TOTAL X COL TOTAL TOTAL # OF CASES EF FOR “RELEVANT” = (6*9)/10 = 5.4
  • 16. CALCULATING K EXPECTED FREQUENCY OF CHANCE AGREEMENT 6 (5.4) 0 0 1 (Item #1) 1 (.2) 0 2 (Item #4 & 9) 0 0 (0) RATER 1 R A T E R 2 R S O R S O ADD ALL VALUES OF(EF) TO GET “PR(E” PR(E)= 5.4 + .2 + 0 = 5.6
  • 17. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - 5.6 10 - 5.6 PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE K = .3182 THIS IS FAR BELOW THE ACCEPTABLE LEVEL OF AGREEMENT, WHICH SHOULD BE AT LEAST .70
  • 18. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - 5.6 10 - 5.6 WE CAN NOW ENTER THE VALUE OF PR(E) & COMPUTE KAPPA PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE K = .3182 THIS IS FAR BELOW THE ACCEPTABLE LEVEL OF AGREEMENT, WHICH SHOULD BE AT LEAST .70
  • 19. THE EQUATION FOR K: RATERS 1 & 2 K = 7 - 5.6 10 - 5.6 WE CAN NOW ENTER THE VALUE OF PR(E) & COMPUTE KAPPA PR(A) = SIMPLE AGREEMENT AMONG RATERS PR(E) = LIKLIHOOD THAT AGREEMENT IS ATTRIBUTABLE TO CHANCE K = .3182 THIS IS FAR BELOW THE ACCEPTABLE LEVEL OF AGREEMENT, WHICH SHOULD BE AT LEAST .70
  • 20. DATA: RATERS 1& 2 Item # 1 2 3 4 5 6 7 8 9 10 Rater 1 R R R R R R R R R S Rater 2 S R R O R R R R O S K = .3182 How can we improve? •LOOK FOR THE PATTERN IN DISAGREEMENTS CAN SOMETHING ABOUT THE CODING SCHEME BE CLARIFIED? •TOTAL # OF CASES IS LOW, COULD BE ALLOWING A FEW STICKY CASES TO DISPROPORTIONALLY INFLUENCE AGREEMENT
  • 21. DATA: RATERS 1-5 Item # 1 2 3 4 5 6 7 8 9 10 Rater 1 R R R R R R R R R S Rater 2 S R R O R R R R O S Rater 3 R R R O R R O O R S Rater 4 R R R R R R R R R S Rater 5 S R R O R O O R R S CASE 1 SHOWS A PATTERN OF DISAGREEMENT BETWEEN “SPAM” & “RELEVANT,” WHILE CASE 4 SHOWS A PATTERN OF DISAGREEMENT BETWEEN RELEVANT & OTHER
  • 22. EXERCISES & QUESTIONS Item # 1 2 3 4 5 6 7 8 9 10 Rater 1 R R R R R R R R R S Rater 2 S R R O R R R R O S Rater 3 R R R O R R O O R S Rater 4 R R R R R R R R R S Rater 5 S R R O R O O R R S 1. COMPUTE COHEN’S K FOR RATERS 3 & 5 2. REVISE THE CODING PROMPT TO ADDRESS PROBLEMS YOU DETECT; GIVE YOUR NEW CODING SCHEME TO TWO RATERS AND COMPUTE K TO SEE IF YOUR REVISIONS WORKED; BE PREPARED TO TALK ABOUT WHAT CHANGES YOU MADE 3. COHEN’S KAPPA IS SAID TO BE A VERY CONSERVATIVE MEASURE OF INTER-RATER RELIABILITY...CAN YOU EXPLAIN WHY? WHAT ARE ITS LIMITATIONS AS YOU SEE THEM?
  • 23. DO I HAVE TO DO THIS BY HAND? NO, YOU COULD GO HERE: http://faculty.vassar.edu/lowry/kappa.html