SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Automating CIRI Ratings of
Human Rights Reports Using GATE
Joshua Joiner and Karthikeyan Umapathy
School of Computing,
University of North Florida,
Jacksonville, FL USA 32224
R E S E A R C H C O N T E X T
This project involves parsing human rights reports
produced by the U.S Government and rating the human
practices for various countries. The U.S Human Rights
Reports are annual reports that cover internationally
recognized human rights practices in regards to individual,
civil, political, and worker rights.
T E X T M I N I N G T O O L
GATE is an open source text mining platform used for
developing custom text processing solutions.
G E N E R A T I N G C I R I R A T I N G U S I N G G A T E
C O N C L U S I O N S
In conclusion, I believe the automated process will not
provide a high accuracy when comparing to the CIRI
dataset because the dataset was compiled by humans. I do,
however, believe that processes involved in creating the
automated process can create a more objective standard
when analyzing country report text and producing ratings
for the human practices. There also needs to be more
patterns implemented within the automated process to
more accurately match with the qualitative text from the
U.S Human Rights Reports.
Text Mining of Human Rights Reports
Project Objective:
Denmark Physical Integrity
U.S. Department of
State
CIRI coders rely on a manual process of reading through
the Human Rights Reports and then applying ratings to
each human rights practice for each country.
• The objective of this project is to automate the process
of scouring the human rights country reports.
CIRI (Cingranelli-Richards) Human Rights Data Project
rates the human rights practices of the U.S. Human Rights
country reports. Students, scholars, policymakers, and
analysts use the CIRI ratings for practical and research
purposes.
CIRI Rating of Human Rights Reports
Standard ANNIE process flow:
C I R I R A T I N G S C O M P A R I S O N
Denmark Empowerment Rights
E V A L U A T I O N P L A N C O N T R I B U T I O N S
• CIRI Coding Annotation Processing Resource
• Custom JAPE patterns for keywords and
phrases.
• Custom annotations for entity extraction.
• Custom implementation of sentiment
analysis.
• Database Storage
• CIRI Dataset Source Ratings.
• Automatically generated CIRI Ratings.
The F-Measure scores in the tables above show which
type of ratings the automated system correctly matches.
KILL TORT POLPRIS DISAP
FORMOV DOMMOV ELECSD ASSN
WORKER SPEECH NEW_RELFRE

Contenu connexe

Plus de Karthikeyan Umapathy

2021 Florida Data Science for Social Good Big Reveal
2021 Florida Data Science for Social Good Big Reveal2021 Florida Data Science for Social Good Big Reveal
2021 Florida Data Science for Social Good Big RevealKarthikeyan Umapathy
 
2020 Florida Data Science for Social Good Big Reveal
2020 Florida Data Science for Social Good Big Reveal2020 Florida Data Science for Social Good Big Reveal
2020 Florida Data Science for Social Good Big RevealKarthikeyan Umapathy
 
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...Karthikeyan Umapathy
 
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...Karthikeyan Umapathy
 
Collaborative Community Engagement: Bringing Data Science to Societal Challen...
Collaborative Community Engagement: Bringing Data Science to Societal Challen...Collaborative Community Engagement: Bringing Data Science to Societal Challen...
Collaborative Community Engagement: Bringing Data Science to Societal Challen...Karthikeyan Umapathy
 
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big RevealKarthikeyan Umapathy
 
2018 Academy Health Annual Research Meeting Poster
2018 Academy Health Annual Research Meeting Poster2018 Academy Health Annual Research Meeting Poster
2018 Academy Health Annual Research Meeting PosterKarthikeyan Umapathy
 
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal PresentationKarthikeyan Umapathy
 
Security and User Experience: A Holistic Model for CAPTCHA Usability Issues
Security and User Experience: A Holistic Model for CAPTCHA Usability IssuesSecurity and User Experience: A Holistic Model for CAPTCHA Usability Issues
Security and User Experience: A Holistic Model for CAPTCHA Usability IssuesKarthikeyan Umapathy
 
2017 Florida Data Science for Social Good Big Reveal
2017 Florida Data Science for Social Good Big Reveal2017 Florida Data Science for Social Good Big Reveal
2017 Florida Data Science for Social Good Big RevealKarthikeyan Umapathy
 
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...Karthikeyan Umapathy
 
UNF Computing Senior Capstone Project
UNF Computing Senior Capstone ProjectUNF Computing Senior Capstone Project
UNF Computing Senior Capstone ProjectKarthikeyan Umapathy
 
Leveraging Service Computing and Big Data Analytics for E-Commerce
Leveraging Service Computing and Big Data Analytics for E-CommerceLeveraging Service Computing and Big Data Analytics for E-Commerce
Leveraging Service Computing and Big Data Analytics for E-CommerceKarthikeyan Umapathy
 

Plus de Karthikeyan Umapathy (13)

2021 Florida Data Science for Social Good Big Reveal
2021 Florida Data Science for Social Good Big Reveal2021 Florida Data Science for Social Good Big Reveal
2021 Florida Data Science for Social Good Big Reveal
 
2020 Florida Data Science for Social Good Big Reveal
2020 Florida Data Science for Social Good Big Reveal2020 Florida Data Science for Social Good Big Reveal
2020 Florida Data Science for Social Good Big Reveal
 
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...
Dashboard for Extracting Regional Insights and Ranking Food Deserts in Northe...
 
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
Developing a GIS Dashboard Tool to Inform Non-Profit Hospitals of Community H...
 
Collaborative Community Engagement: Bringing Data Science to Societal Challen...
Collaborative Community Engagement: Bringing Data Science to Societal Challen...Collaborative Community Engagement: Bringing Data Science to Societal Challen...
Collaborative Community Engagement: Bringing Data Science to Societal Challen...
 
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
2019 Florida Data Science for Social Good (FL-DSSG) Big Reveal
 
2018 Academy Health Annual Research Meeting Poster
2018 Academy Health Annual Research Meeting Poster2018 Academy Health Annual Research Meeting Poster
2018 Academy Health Annual Research Meeting Poster
 
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation
2018 Florida Data Science for Social Good (FL-DSSG) Big Reveal Presentation
 
Security and User Experience: A Holistic Model for CAPTCHA Usability Issues
Security and User Experience: A Holistic Model for CAPTCHA Usability IssuesSecurity and User Experience: A Holistic Model for CAPTCHA Usability Issues
Security and User Experience: A Holistic Model for CAPTCHA Usability Issues
 
2017 Florida Data Science for Social Good Big Reveal
2017 Florida Data Science for Social Good Big Reveal2017 Florida Data Science for Social Good Big Reveal
2017 Florida Data Science for Social Good Big Reveal
 
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
A Research Plan to Study Impact of a Collaborative Web Search Tool on Novice'...
 
UNF Computing Senior Capstone Project
UNF Computing Senior Capstone ProjectUNF Computing Senior Capstone Project
UNF Computing Senior Capstone Project
 
Leveraging Service Computing and Big Data Analytics for E-Commerce
Leveraging Service Computing and Big Data Analytics for E-CommerceLeveraging Service Computing and Big Data Analytics for E-Commerce
Leveraging Service Computing and Big Data Analytics for E-Commerce
 

Dernier

Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...boychatmate1
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligencePriyadharshiniG41
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 

Dernier (20)

Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligence
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 

Automating CIRI Ratings of Human Rights Reports Using GATE: Evaluation Results

  • 1. Automating CIRI Ratings of Human Rights Reports Using GATE Joshua Joiner and Karthikeyan Umapathy School of Computing, University of North Florida, Jacksonville, FL USA 32224 R E S E A R C H C O N T E X T This project involves parsing human rights reports produced by the U.S Government and rating the human practices for various countries. The U.S Human Rights Reports are annual reports that cover internationally recognized human rights practices in regards to individual, civil, political, and worker rights. T E X T M I N I N G T O O L GATE is an open source text mining platform used for developing custom text processing solutions. G E N E R A T I N G C I R I R A T I N G U S I N G G A T E C O N C L U S I O N S In conclusion, I believe the automated process will not provide a high accuracy when comparing to the CIRI dataset because the dataset was compiled by humans. I do, however, believe that processes involved in creating the automated process can create a more objective standard when analyzing country report text and producing ratings for the human practices. There also needs to be more patterns implemented within the automated process to more accurately match with the qualitative text from the U.S Human Rights Reports. Text Mining of Human Rights Reports Project Objective: Denmark Physical Integrity U.S. Department of State CIRI coders rely on a manual process of reading through the Human Rights Reports and then applying ratings to each human rights practice for each country. • The objective of this project is to automate the process of scouring the human rights country reports. CIRI (Cingranelli-Richards) Human Rights Data Project rates the human rights practices of the U.S. Human Rights country reports. Students, scholars, policymakers, and analysts use the CIRI ratings for practical and research purposes. CIRI Rating of Human Rights Reports Standard ANNIE process flow: C I R I R A T I N G S C O M P A R I S O N Denmark Empowerment Rights E V A L U A T I O N P L A N C O N T R I B U T I O N S • CIRI Coding Annotation Processing Resource • Custom JAPE patterns for keywords and phrases. • Custom annotations for entity extraction. • Custom implementation of sentiment analysis. • Database Storage • CIRI Dataset Source Ratings. • Automatically generated CIRI Ratings. The F-Measure scores in the tables above show which type of ratings the automated system correctly matches. KILL TORT POLPRIS DISAP FORMOV DOMMOV ELECSD ASSN WORKER SPEECH NEW_RELFRE