SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Bringing Medieval Occitan to Life:
Visualization Analytics
July 6, 2017
AIEO, Albi 2017
Olga Scrivner and Sandra K¨ubler
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Digital Humanities - Transformation
The “epic transformation of archives” - shifting from print to
digital archival form (Folsom, 2007)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Digital Humanities
“As our collective knowledge continues to be digitized and
stored (...) it becomes more difficult to find and discover
what we are looking for.” (Blei 2012)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visual Analytics in Literature
“The science of analytical reasoning facilitated by
visual interactive interfaces”
(Thomas et al., 2005)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Close Reading
Concept Micro-analysis (Jockers, 2013)
Close textual analysis of individual texts to
“unveil words, verbal images, elements of style,
sentences, argument patterns” (Jasinski, 2001)
Methods Color coding, marginal comments, underlining
Tools Poem Viewer, PRISM, Juxta, eMargin
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Close Reading Visualization: eMargin and
JUXTA
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Distant Reading
Concept Macro-analysis (Jockers, 2013)
“the construction of abstract models”
(Jasinski, 2001)
Methods Tag clouds, heat maps, clusters, topics,
network graphs
Tools GUI: Voyant, Papermachine
TUI: Mallet, Meta, R and Python packages
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visualization Methods in Literature
Computer-assisted methods for text analysis can “offer new
and unexpected insights and knowledge to the literary
scholar” (Oelke et al., 2012)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visualization Methods in Literature
Computer-assisted methods for text analysis can “offer new
and unexpected insights and knowledge to the literary
scholar” (Oelke et al., 2012)
Word clouds to analyze a novel (Vuillemot et al., 2009)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visualization Methods in Literature
Social network graphs of characters in Greek tragedies
(Rydberg-Cox, 2011)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visualization Methods in Literature
Literary fingerprint and summaries (Oelke et al., 2012)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Visualization Methods in Literature
Tracking emotion and sentiment in fairy tales
(Mohammad, 2012)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Topic Modeling
Discovering underlying theme of text collections (Blei, 2012)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Technological and Methodological Obstacles
Many tools require some programming skills (Mallet,
Meta, R and Python libraries)
GUI tools are limited to certain formats and functions
(Voyant, PaperMachine)
Lack of active control by users
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Our Goals - Interactive Text Mining Suite
A user-friendly interactive tool for quantitative and
visualization analysis
Designed for linguistic and literary analysis
Incorporation of annotated corpora in macro-analysis
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Background
1. R - a free programming language for statistical
computing and graphics
2. RStudio - Integrated Development Environment: a
source code editor, an executor and a debugger
3. Shiny App - a web application framework for R
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
ITMS - Interactive Text Mining Suite
Platform-independent, user-friendly and interactive
State-of-the-art statistical and graphical tools (R
libraries)
http://www.interactivetextminingsuite.com
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Multi-Functional
1. Import txt, pdf, rdf and Google books API
2. Metadata extraction
3. Interactive data pre-processing
4. Dynamic visualization
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Case Study - Medieval Occitan
Occitan (Proven¸cal) constitutes an important element of the
literary, linguistic, and cultural heritage in the history of
Romance languages
Interactive online database and linguistically annotated
corpus (Scrivner et al., 2014)
http://www.oldoccitancorpus.org
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Comparative Analysis: Original and Translation
Lexical level
Grammatical level (part-of-speech)
Stylistic level (sentence length, punctuation)
Document level (cluster, topic analysis)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Bigrams
Bigrams are occurrences of two consecutive words observed
in the text (genre, text classification, discourse features)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Bigram - Lexical Comparison: HE / SHE
went
left
found
had
could
took
were
did
wanted
would
gave
was
said
will
does
saw
knew
might
should
can
has
is
who
asked
0.25x 0.5x Same 2x 4x
Relative appearance after 'she' compared to 'he'
More 'she'
More 'he'
Women asked while men went
Words paired with 'he' and 'she'
She - asked and modal verbs: could, might, can
He - action verbs: went, left, found
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Bigram - Archimbaut / Flamenca
distressed
jealous
lady
lord
order
left
leave
marguerite
dear
giving
head
knew
troubled
replied
asked
found
gave
heard
heart
lay
0.5x Same 2x
logratio
More 'Flamenca'
More 'Archambaut'
Words paired with 'Archambaut' and 'Flamenca'
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Bigram - Archimbaut / Flamenca
estet
cor
demanda
venc
dis
0.25x 0.5x Same 2x
Relative appearance after 'Flamenca' compared to 'Archimbaut(z)'
More 'Flamenca'
More 'Archimbaut'
Words paired with 'Archimbaut(z)' and 'Flamenca'
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Grammatical Level - Part-of-Speech
Occitan corpus English translation
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Stylistic Similarities - Sentence Length
Occitan Corpus English Translation
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Stylistic Comparison - Punctuation
Occitan Corpus English Translation
Question marks and exclamation marks - red; quotation marks, hyphens and parenthesis - green;
semicolons, colons, commas, periods - blue
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Document Level - Network Analyis
Network - “resembling a net (..) to capture the notion of
elements in a system and their interconnectedness”
(Kolaczyk, 2009)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Document Level - Cluster Analysis
Cluster analysis - groups documents into subgroups. These
subgroups “are coherent internally, but clearly different from
each other”
(Manning, 2009)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Document Level - Topic Analysis
Text collections - “represented as random mixtures over
latent topics, where each topic is characterized by a
distribution over words”
(Blei, 2003)
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
Conclusion
1. There is a need for text mining tools designed for
linguists and literary scholars
2. Interactive user-friendly applications bridge the gap
between data visualization and digital humanities
3. Shiny framework can be incorporated in any digital
corpora to exhibit, search or visualize written collections
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
ITMS
Browser and Smart Phone
Questions, comments
https:
//languagevariationsuite.wordpress.com/
Bringing Medieval
Occitan to Life:
Visualization
Analytics
Introduction
Visualization
Methods
ITMS
Medieval Corpus
Conclusion
References
Mohammad, Saif. 2013. From Once Upon a Time to Happily Ever After:
Tracking Emotions in Novels and Fairy Tales. In Proceedings of the ACL
Workshop on Language Technology for Cultural Heritage, Social Sciences, and
Humanities (LaTeCH), 2011, Portland, OR.
Moretti, Franco. 2005. Graphs, maps, trees: abstract models for a literary
history. R.R. Donnelley & Sons.
Oelke, Daniela, Dimitrios Kokkinakis and Mats Malm. 2012. Advanced Visual
Analytics Methods for Literature Analysis. In Proceedings of the 6th EACL
Workshop, 35-44.
Rydberg-Cox, Jeff. 2011. Social Networks and the Language of Greek
Tragedy. Journal of the Chicago Colloquium on Digital Humanities and
Computer Science. 1(3): 1-11.
Thomas, James and Kristin Cook. 2005. Illuminating the Path: the Research
and Development Agenda for Visual Analytics. National Visualization and
Analytics Center.
Vuillemot, Romain, Tanya Clement, Catherine Plaisant and Amit Kumar.
2009. What’s Being Near “Martha”? Exploring Name Entities in Literary Text
Collections. In Proceedings if the IEEE Symposium. Atlantic City, New Jersey.
107-114.
http://www.clipartbest.com/clipart-9i4A55xiE

Contenu connexe

Similaire à Data Visualization for Literary Analysis

EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageMax Kaiser
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanitiesStefan Gradmann
 
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ..."some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...Pip Willcox
 
Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital libraryAlexandr Belov
 
A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)Raphael Troncy
 
Linked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the CitizenLinked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the CitizenStefan Gradmann
 
Wikibhasha by Dr A Kumaran
Wikibhasha by  Dr A KumaranWikibhasha by  Dr A Kumaran
Wikibhasha by Dr A KumaranNIFT
 
E lit Research
E lit ResearchE lit Research
E lit Researchsusiswo
 
Electronic Literature
Electronic LiteratureElectronic Literature
Electronic LiteratureSiswo Harsono
 
Recontextualizing Audiovisual Archives: Immigrants and Remixing practices
Recontextualizing Audiovisual Archives: Immigrants and Remixing practicesRecontextualizing Audiovisual Archives: Immigrants and Remixing practices
Recontextualizing Audiovisual Archives: Immigrants and Remixing practicesMariana Salgado
 
Seville2000
Seville2000Seville2000
Seville2000behem0t
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for EuropeanaAntoine Isaac
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - OntologiesSerge Linckels
 
chinchor_nvac_may06
chinchor_nvac_may06chinchor_nvac_may06
chinchor_nvac_may06webuploader
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mbguestd67478
 
MarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meetingMarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meetingmdabrowski
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries mdabrowski
 
Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Olaf Janssen
 
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebBuilding Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebSebastian Ryszard Kruk
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHgerogepatton
 

Similaire à Data Visualization for Literary Analysis (20)

EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities20110324 linked openeuropeanahumanities
20110324 linked openeuropeanahumanities
 
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ..."some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...
"some crauen scruple/ Of thinking too precisely": democratization, dialogue, ...
 
Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital library
 
A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)
 
Linked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the CitizenLinked Open Europeana: Semantics for the Citizen
Linked Open Europeana: Semantics for the Citizen
 
Wikibhasha by Dr A Kumaran
Wikibhasha by  Dr A KumaranWikibhasha by  Dr A Kumaran
Wikibhasha by Dr A Kumaran
 
E lit Research
E lit ResearchE lit Research
E lit Research
 
Electronic Literature
Electronic LiteratureElectronic Literature
Electronic Literature
 
Recontextualizing Audiovisual Archives: Immigrants and Remixing practices
Recontextualizing Audiovisual Archives: Immigrants and Remixing practicesRecontextualizing Audiovisual Archives: Immigrants and Remixing practices
Recontextualizing Audiovisual Archives: Immigrants and Remixing practices
 
Seville2000
Seville2000Seville2000
Seville2000
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - Ontologies
 
chinchor_nvac_may06
chinchor_nvac_may06chinchor_nvac_may06
chinchor_nvac_may06
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mb
 
MarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meetingMarcOnt Initiative - Protege meeting
MarcOnt Initiative - Protege meeting
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
 
Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...
 
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebBuilding Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
 
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECHAN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
AN ONTOLOGICAL ANALYSIS AND NATURAL LANGUAGE PROCESSING OF FIGURES OF SPEECH
 

Plus de Olga Scrivner

Engaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxEngaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxOlga Scrivner
 
HICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesHICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesOlga Scrivner
 
The power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsThe power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsOlga Scrivner
 
Cognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderCognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderOlga Scrivner
 
Introduction to Web Scraping with Python
Introduction to Web Scraping with PythonIntroduction to Web Scraping with Python
Introduction to Web Scraping with PythonOlga Scrivner
 
Call for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyCall for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyOlga Scrivner
 
Jupyter machine learning crash course
Jupyter machine learning crash courseJupyter machine learning crash course
Jupyter machine learning crash courseOlga Scrivner
 
R and RMarkdown crash course
R and RMarkdown crash courseR and RMarkdown crash course
R and RMarkdown crash courseOlga Scrivner
 
The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...Olga Scrivner
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...Olga Scrivner
 
Introduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationIntroduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationOlga Scrivner
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf WorkshopOlga Scrivner
 
R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303Olga Scrivner
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisOlga Scrivner
 
Gender Disparity in Employment and Education
Gender Disparity in Employment and EducationGender Disparity in Employment and Education
Gender Disparity in Employment and EducationOlga Scrivner
 
CrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersCrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersOlga Scrivner
 
Optimizing Data Analysis: Web application with Shiny
Optimizing Data Analysis: Web application with ShinyOptimizing Data Analysis: Web application with Shiny
Optimizing Data Analysis: Web application with ShinyOlga Scrivner
 
Data Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowData Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowOlga Scrivner
 
Reproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataReproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataOlga Scrivner
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 

Plus de Olga Scrivner (20)

Engaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptxEngaging Students Competition and Polls.pptx
Engaging Students Competition and Polls.pptx
 
HICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning TechnologiesHICSS ATLT: Advances in Teaching and Learning Technologies
HICSS ATLT: Advances in Teaching and Learning Technologies
 
The power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systemsThe power of unstructured data: Recommendation systems
The power of unstructured data: Recommendation systems
 
Cognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use DisorderCognitive executive functions and Opioid Use Disorder
Cognitive executive functions and Opioid Use Disorder
 
Introduction to Web Scraping with Python
Introduction to Web Scraping with PythonIntroduction to Web Scraping with Python
Introduction to Web Scraping with Python
 
Call for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and TechnologyCall for paper Collaboration Systems and Technology
Call for paper Collaboration Systems and Technology
 
Jupyter machine learning crash course
Jupyter machine learning crash courseJupyter machine learning crash course
Jupyter machine learning crash course
 
R and RMarkdown crash course
R and RMarkdown crash courseR and RMarkdown crash course
R and RMarkdown crash course
 
The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...The Impact of Language Requirement on Students' Performance, Retention, and M...
The Impact of Language Requirement on Students' Performance, Retention, and M...
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...
 
Introduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web ApplicationIntroduction to Interactive Shiny Web Application
Introduction to Interactive Shiny Web Application
 
Introduction to Overleaf Workshop
Introduction to Overleaf WorkshopIntroduction to Overleaf Workshop
Introduction to Overleaf Workshop
 
R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303R crash course for Business Analytics Course K303
R crash course for Business Analytics Course K303
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
 
Gender Disparity in Employment and Education
Gender Disparity in Employment and EducationGender Disparity in Employment and Education
Gender Disparity in Employment and Education
 
CrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for BeginnersCrashCourse: Python with DataCamp and Jupyter for Beginners
CrashCourse: Python with DataCamp and Jupyter for Beginners
 
Optimizing Data Analysis: Web application with Shiny
Optimizing Data Analysis: Web application with ShinyOptimizing Data Analysis: Web application with Shiny
Optimizing Data Analysis: Web application with Shiny
 
Data Analysis and Visualization: R Workflow
Data Analysis and Visualization: R WorkflowData Analysis and Visualization: R Workflow
Data Analysis and Visualization: R Workflow
 
Reproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid dataReproducible visual analytics of public opioid data
Reproducible visual analytics of public opioid data
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 

Dernier

Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...ssuserf63bd7
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证ppy8zfkfm
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024patrickdtherriault
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationmuqadasqasim10
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证pwgnohujw
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjadimosmejiaslendon
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证zifhagzkk
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444saurabvyas476
 
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontang
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di  Bontangobat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di  Bontang
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontangsiskavia95
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.pptRachmaGhifari
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...ThinkInnovation
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshareraiaryan448
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsBrainSell Technologies
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证pwgnohujw
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics
 
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...siskavia95
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunksgmuir1066
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...yulianti213969
 

Dernier (20)

Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
 
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontang
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di  Bontangobat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di  Bontang
obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontang
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 

Data Visualization for Literary Analysis

  • 1. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Bringing Medieval Occitan to Life: Visualization Analytics July 6, 2017 AIEO, Albi 2017 Olga Scrivner and Sandra K¨ubler
  • 2. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Digital Humanities - Transformation The “epic transformation of archives” - shifting from print to digital archival form (Folsom, 2007)
  • 3. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Digital Humanities “As our collective knowledge continues to be digitized and stored (...) it becomes more difficult to find and discover what we are looking for.” (Blei 2012)
  • 4. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visual Analytics in Literature “The science of analytical reasoning facilitated by visual interactive interfaces” (Thomas et al., 2005)
  • 5. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Close Reading Concept Micro-analysis (Jockers, 2013) Close textual analysis of individual texts to “unveil words, verbal images, elements of style, sentences, argument patterns” (Jasinski, 2001) Methods Color coding, marginal comments, underlining Tools Poem Viewer, PRISM, Juxta, eMargin
  • 6. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Close Reading Visualization: eMargin and JUXTA
  • 7. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Distant Reading Concept Macro-analysis (Jockers, 2013) “the construction of abstract models” (Jasinski, 2001) Methods Tag clouds, heat maps, clusters, topics, network graphs Tools GUI: Voyant, Papermachine TUI: Mallet, Meta, R and Python packages
  • 8. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visualization Methods in Literature Computer-assisted methods for text analysis can “offer new and unexpected insights and knowledge to the literary scholar” (Oelke et al., 2012)
  • 9. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visualization Methods in Literature Computer-assisted methods for text analysis can “offer new and unexpected insights and knowledge to the literary scholar” (Oelke et al., 2012) Word clouds to analyze a novel (Vuillemot et al., 2009)
  • 10. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visualization Methods in Literature Social network graphs of characters in Greek tragedies (Rydberg-Cox, 2011)
  • 11. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visualization Methods in Literature Literary fingerprint and summaries (Oelke et al., 2012)
  • 12. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Visualization Methods in Literature Tracking emotion and sentiment in fairy tales (Mohammad, 2012)
  • 13. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Topic Modeling Discovering underlying theme of text collections (Blei, 2012)
  • 14. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Technological and Methodological Obstacles Many tools require some programming skills (Mallet, Meta, R and Python libraries) GUI tools are limited to certain formats and functions (Voyant, PaperMachine) Lack of active control by users
  • 15. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Our Goals - Interactive Text Mining Suite A user-friendly interactive tool for quantitative and visualization analysis Designed for linguistic and literary analysis Incorporation of annotated corpora in macro-analysis
  • 16. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Background 1. R - a free programming language for statistical computing and graphics 2. RStudio - Integrated Development Environment: a source code editor, an executor and a debugger 3. Shiny App - a web application framework for R
  • 17. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion ITMS - Interactive Text Mining Suite Platform-independent, user-friendly and interactive State-of-the-art statistical and graphical tools (R libraries) http://www.interactivetextminingsuite.com
  • 18. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Multi-Functional 1. Import txt, pdf, rdf and Google books API 2. Metadata extraction 3. Interactive data pre-processing 4. Dynamic visualization
  • 19. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Case Study - Medieval Occitan Occitan (Proven¸cal) constitutes an important element of the literary, linguistic, and cultural heritage in the history of Romance languages Interactive online database and linguistically annotated corpus (Scrivner et al., 2014) http://www.oldoccitancorpus.org
  • 20. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Comparative Analysis: Original and Translation Lexical level Grammatical level (part-of-speech) Stylistic level (sentence length, punctuation) Document level (cluster, topic analysis)
  • 21. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Bigrams Bigrams are occurrences of two consecutive words observed in the text (genre, text classification, discourse features)
  • 22. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Bigram - Lexical Comparison: HE / SHE went left found had could took were did wanted would gave was said will does saw knew might should can has is who asked 0.25x 0.5x Same 2x 4x Relative appearance after 'she' compared to 'he' More 'she' More 'he' Women asked while men went Words paired with 'he' and 'she' She - asked and modal verbs: could, might, can He - action verbs: went, left, found
  • 23. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Bigram - Archimbaut / Flamenca distressed jealous lady lord order left leave marguerite dear giving head knew troubled replied asked found gave heard heart lay 0.5x Same 2x logratio More 'Flamenca' More 'Archambaut' Words paired with 'Archambaut' and 'Flamenca'
  • 24. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Bigram - Archimbaut / Flamenca estet cor demanda venc dis 0.25x 0.5x Same 2x Relative appearance after 'Flamenca' compared to 'Archimbaut(z)' More 'Flamenca' More 'Archimbaut' Words paired with 'Archimbaut(z)' and 'Flamenca'
  • 25. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Grammatical Level - Part-of-Speech Occitan corpus English translation
  • 26. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Stylistic Similarities - Sentence Length Occitan Corpus English Translation
  • 27. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Stylistic Comparison - Punctuation Occitan Corpus English Translation Question marks and exclamation marks - red; quotation marks, hyphens and parenthesis - green; semicolons, colons, commas, periods - blue
  • 28. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Document Level - Network Analyis Network - “resembling a net (..) to capture the notion of elements in a system and their interconnectedness” (Kolaczyk, 2009)
  • 29. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Document Level - Cluster Analysis Cluster analysis - groups documents into subgroups. These subgroups “are coherent internally, but clearly different from each other” (Manning, 2009)
  • 30. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Document Level - Topic Analysis Text collections - “represented as random mixtures over latent topics, where each topic is characterized by a distribution over words” (Blei, 2003)
  • 31. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion Conclusion 1. There is a need for text mining tools designed for linguists and literary scholars 2. Interactive user-friendly applications bridge the gap between data visualization and digital humanities 3. Shiny framework can be incorporated in any digital corpora to exhibit, search or visualize written collections
  • 32. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion ITMS Browser and Smart Phone Questions, comments https: //languagevariationsuite.wordpress.com/
  • 33. Bringing Medieval Occitan to Life: Visualization Analytics Introduction Visualization Methods ITMS Medieval Corpus Conclusion References Mohammad, Saif. 2013. From Once Upon a Time to Happily Ever After: Tracking Emotions in Novels and Fairy Tales. In Proceedings of the ACL Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), 2011, Portland, OR. Moretti, Franco. 2005. Graphs, maps, trees: abstract models for a literary history. R.R. Donnelley & Sons. Oelke, Daniela, Dimitrios Kokkinakis and Mats Malm. 2012. Advanced Visual Analytics Methods for Literature Analysis. In Proceedings of the 6th EACL Workshop, 35-44. Rydberg-Cox, Jeff. 2011. Social Networks and the Language of Greek Tragedy. Journal of the Chicago Colloquium on Digital Humanities and Computer Science. 1(3): 1-11. Thomas, James and Kristin Cook. 2005. Illuminating the Path: the Research and Development Agenda for Visual Analytics. National Visualization and Analytics Center. Vuillemot, Romain, Tanya Clement, Catherine Plaisant and Amit Kumar. 2009. What’s Being Near “Martha”? Exploring Name Entities in Literary Text Collections. In Proceedings if the IEEE Symposium. Atlantic City, New Jersey. 107-114. http://www.clipartbest.com/clipart-9i4A55xiE