SlideShare a Scribd company logo
1 of 10
Open Data in
Data Journalists' Workflow
Institute of Mathematics and Computer
Science, University of Latvia
National Library of Latvia
Uldis Bojārs (@CaptSolo)
ODW-2013 – 24-Apr-2013
National Library of Latvia (NLL)
• Digital Library “Lettonica”
– http://www.lnb.lv/en/digital-library
• Linked Open Data [Publishing]
– being added into NLL’s systems
• Examples:
– authority data
– digital object management system
– digital text corpus + named entity database
IMCS, University of Latvia
• Institute of Mathematics and
Computer Science (IMCS)
– http://www.lumii.lv/resource/show/170
• Open Data
– making it easier for people to work with data
(discover, transform, visualize, ...)
– interested in collaboration on open data projects
Make it simpler
• working with data must be
as easy as possible
– *frictionless* (as Rufus says)
• need a data eco-system
– [work with] data  more useful data
= motivation for the Data Journalism /
Data Processing Tool [proposal]
Marko Lorenz, 2010 – CC BY 2.0 license
http://en.wikipedia.org/wiki/File:Data_driven_journalism_process.jpg
Data Visualization Pipeline (Ben Fry)
via “Speculative Maps & Open Data“ talk @ ODW-2013
by Benedikt Groß
Data Processing Tool
• The Idea:
– a tool (or set of tools) covering the whole workflow
• repeatability, provenance, data publishing
– make it easy for people to use open data
• graphical modeling, visualization, natural language
• Data Journalism (one of the use cases)
– discovery
– transformation (clean, filter, integrate, ...)
– interpretation (visualization, ...)
– developing a story
– publishing
Research @ IMCS
• semantic web
– data modeling, mapping RDBMS data to RDF, ...
• network analysis and visualization [tools]
– http://www.slideshare.net/CaptSolo/exploring-the-
networks-in-open-public-data-13391338
• computational linguistics
– named entity and relationship extraction
– natural language interfaces
in the context of Data Web
• important [for the web]:
– data discovery
– data publishing
• publish the data along with the story
– make it easy to publish data as a part of the data
journalism workflow
– make data discoverable for re-use
– [automatically] maintain provenance info
More info
• Uldis Bojārs - @CaptSolo
uldis.bojars@gmail.com
• National Library of Latvia
http://www.lnb.lv/en/digital-library
• IMCS: Exploring the Networks on Open Public Data
http://www.slideshare.net/CaptSolo/exploring-the-
networks-in-open-public-data-13391338
Data Journalism Tool proposal in progress,
get in touch for more info

More Related Content

Viewers also liked

Journalism for a digitalized society
Journalism for a digitalized societyJournalism for a digitalized society
Journalism for a digitalized society
pepemadariaga
 
Implementing the Storyline Ontology in BBC News
Implementing the Storyline Ontology in BBC NewsImplementing the Storyline Ontology in BBC News
Implementing the Storyline Ontology in BBC News
Jeremy Tarling
 

Viewers also liked (20)

Data journalism: Data rules, while data rule
Data journalism: Data rules, while data ruleData journalism: Data rules, while data rule
Data journalism: Data rules, while data rule
 
Active audiences and Journalism: Innovation in the media companies and new pr...
Active audiences and Journalism: Innovation in the media companies and new pr...Active audiences and Journalism: Innovation in the media companies and new pr...
Active audiences and Journalism: Innovation in the media companies and new pr...
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 
data - driven journalism 1
 data - driven journalism 1 data - driven journalism 1
data - driven journalism 1
 
Da vinci presentation ontology epistemology Dr Rica VIljoen
Da vinci presentation ontology epistemology Dr Rica VIljoenDa vinci presentation ontology epistemology Dr Rica VIljoen
Da vinci presentation ontology epistemology Dr Rica VIljoen
 
Future of journalism
Future of journalismFuture of journalism
Future of journalism
 
Ontologies in computer science and on the web
Ontologies in computer science and on the webOntologies in computer science and on the web
Ontologies in computer science and on the web
 
Text analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATEText analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATE
 
Content Curation Tools - International Journalism Festival 2015
Content Curation Tools - International Journalism Festival 2015Content Curation Tools - International Journalism Festival 2015
Content Curation Tools - International Journalism Festival 2015
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Trends in Online Journalism
Trends in Online JournalismTrends in Online Journalism
Trends in Online Journalism
 
Journalism 2.0
Journalism 2.0Journalism 2.0
Journalism 2.0
 
Web 3.0 and Dutch journalism by Raymond Franz
Web 3.0 and Dutch journalism by Raymond FranzWeb 3.0 and Dutch journalism by Raymond Franz
Web 3.0 and Dutch journalism by Raymond Franz
 
Journalism and the Semantic Web
Journalism and the Semantic WebJournalism and the Semantic Web
Journalism and the Semantic Web
 
Journalism for a digitalized society
Journalism for a digitalized societyJournalism for a digitalized society
Journalism for a digitalized society
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Toward a news data science
Toward a news data scienceToward a news data science
Toward a news data science
 
Future Newsrooms and Civic Journalism - Bahareh Heravi
Future Newsrooms and Civic Journalism - Bahareh Heravi Future Newsrooms and Civic Journalism - Bahareh Heravi
Future Newsrooms and Civic Journalism - Bahareh Heravi
 
ontologie de capteurs
ontologie de capteursontologie de capteurs
ontologie de capteurs
 
Implementing the Storyline Ontology in BBC News
Implementing the Storyline Ontology in BBC NewsImplementing the Storyline Ontology in BBC News
Implementing the Storyline Ontology in BBC News
 

More from Uldis Bojars

Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public Data
Uldis Bojars
 
Web Science 01.12.2011 - Linked Data
Web Science 01.12.2011 - Linked DataWeb Science 01.12.2011 - Linked Data
Web Science 01.12.2011 - Linked Data
Uldis Bojars
 

More from Uldis Bojars (19)

Linked Digital Collection "Rainis and Aspazija"
Linked Digital Collection "Rainis and Aspazija"Linked Digital Collection "Rainis and Aspazija"
Linked Digital Collection "Rainis and Aspazija"
 
Case study: Towards a linked digital collection of Latvian Cultural Heritage
Case study: Towards a linked digital collection of Latvian Cultural HeritageCase study: Towards a linked digital collection of Latvian Cultural Heritage
Case study: Towards a linked digital collection of Latvian Cultural Heritage
 
OWLGrEd Ontology Visualizer
OWLGrEd Ontology VisualizerOWLGrEd Ontology Visualizer
OWLGrEd Ontology Visualizer
 
Library Linked Data in Latvia - #LIBER2014 poster
Library Linked Data in Latvia - #LIBER2014 posterLibrary Linked Data in Latvia - #LIBER2014 poster
Library Linked Data in Latvia - #LIBER2014 poster
 
Semantiskais tīmeklis un Atvērtie dati
Semantiskais tīmeklis un Atvērtie datiSemantiskais tīmeklis un Atvērtie dati
Semantiskais tīmeklis un Atvērtie dati
 
Linked Open Data / Atvērtie saistītie dati
Linked Open Data / Atvērtie saistītie datiLinked Open Data / Atvērtie saistītie dati
Linked Open Data / Atvērtie saistītie dati
 
Linked Data from a Digital Object Management System
Linked Data from a Digital Object Management SystemLinked Data from a Digital Object Management System
Linked Data from a Digital Object Management System
 
Web Science - 1. lekcija
Web Science - 1. lekcijaWeb Science - 1. lekcija
Web Science - 1. lekcija
 
Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public Data
 
Envisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataEnvisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked Data
 
Web Science 01.12.2011 - Linked Data
Web Science 01.12.2011 - Linked DataWeb Science 01.12.2011 - Linked Data
Web Science 01.12.2011 - Linked Data
 
Web Science 29.09.2011
Web Science 29.09.2011Web Science 29.09.2011
Web Science 29.09.2011
 
Web Science 15.09.2011
Web Science 15.09.2011Web Science 15.09.2011
Web Science 15.09.2011
 
Web Science seminārs - intro
Web Science seminārs - introWeb Science seminārs - intro
Web Science seminārs - intro
 
Weaving SIOC into the Web of Linked Data
Weaving SIOC into the Web of Linked DataWeaving SIOC into the Web of Linked Data
Weaving SIOC into the Web of Linked Data
 
Data Portability with SIOC and FOAF
Data Portability with SIOC and FOAFData Portability with SIOC and FOAF
Data Portability with SIOC and FOAF
 
FOAF for Social Network Portability
FOAF for Social Network PortabilityFOAF for Social Network Portability
FOAF for Social Network Portability
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media Sites
 
XUL - Mozilla Application Framework
XUL - Mozilla Application FrameworkXUL - Mozilla Application Framework
XUL - Mozilla Application Framework
 

Recently uploaded

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Open Data in Data Journalists' Workflow

  • 1. Open Data in Data Journalists' Workflow Institute of Mathematics and Computer Science, University of Latvia National Library of Latvia Uldis Bojārs (@CaptSolo) ODW-2013 – 24-Apr-2013
  • 2. National Library of Latvia (NLL) • Digital Library “Lettonica” – http://www.lnb.lv/en/digital-library • Linked Open Data [Publishing] – being added into NLL’s systems • Examples: – authority data – digital object management system – digital text corpus + named entity database
  • 3. IMCS, University of Latvia • Institute of Mathematics and Computer Science (IMCS) – http://www.lumii.lv/resource/show/170 • Open Data – making it easier for people to work with data (discover, transform, visualize, ...) – interested in collaboration on open data projects
  • 4. Make it simpler • working with data must be as easy as possible – *frictionless* (as Rufus says) • need a data eco-system – [work with] data  more useful data = motivation for the Data Journalism / Data Processing Tool [proposal]
  • 5. Marko Lorenz, 2010 – CC BY 2.0 license http://en.wikipedia.org/wiki/File:Data_driven_journalism_process.jpg
  • 6. Data Visualization Pipeline (Ben Fry) via “Speculative Maps & Open Data“ talk @ ODW-2013 by Benedikt Groß
  • 7. Data Processing Tool • The Idea: – a tool (or set of tools) covering the whole workflow • repeatability, provenance, data publishing – make it easy for people to use open data • graphical modeling, visualization, natural language • Data Journalism (one of the use cases) – discovery – transformation (clean, filter, integrate, ...) – interpretation (visualization, ...) – developing a story – publishing
  • 8. Research @ IMCS • semantic web – data modeling, mapping RDBMS data to RDF, ... • network analysis and visualization [tools] – http://www.slideshare.net/CaptSolo/exploring-the- networks-in-open-public-data-13391338 • computational linguistics – named entity and relationship extraction – natural language interfaces
  • 9. in the context of Data Web • important [for the web]: – data discovery – data publishing • publish the data along with the story – make it easy to publish data as a part of the data journalism workflow – make data discoverable for re-use – [automatically] maintain provenance info
  • 10. More info • Uldis Bojārs - @CaptSolo uldis.bojars@gmail.com • National Library of Latvia http://www.lnb.lv/en/digital-library • IMCS: Exploring the Networks on Open Public Data http://www.slideshare.net/CaptSolo/exploring-the- networks-in-open-public-data-13391338 Data Journalism Tool proposal in progress, get in touch for more info

Editor's Notes

  1. Data-driven journalism is a journalistic process based on analyzing and filtering large data sets for the purpose of creating a new story. Data-driven journalism deals with open data that is freely available online and analyzed with open source tools.DDJ as one of the motivating use cases (for the tool)
  2. we know data need to be published along with the story, but [almost] nobody’s doing that