SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
5/31/2011




Consuming Open and
Linked Data with Open
    Source Tools
Or, how to do something useful
     with all this free data



    Jo Cook, Geospatial Consultant, Astun Technology
Talk Outline

 ● The data and the distribution technologies
   explained
 ● The software stack explained
 ● Actually getting some data and doing something
   with it
 ● Gotchas
 ● Further resources and links
The Data and Technologies
The Data and Technologies
The Data and Technologies




           Point in Latitude/Longitude format (WGS84)
The Data and Technologies
The Data and Technologies
The Data and Technologies
PREFIX naptan: <http://transport.data.gov.uk/def/naptan/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#>
SELECT DISTINCT ?stopPoint_airport ?name ?iataRef ?lat ?long
WHERE {
  ?item a naptan:StopArea ;
   naptan:administrativeArea <http://transport.data.gov.uk/id/administrative-
area/145> ;
   skos:prefLabel ?name ;
   geo:lat ?lat ;
   geo:long ?long ;
   naptan:stopPoint [ naptan:airport ?stopPoint_airport ] .
  ?stopPoint_airport naptan:iataRef ?iataRef .
}
The Data and Technologies
The Data and Technologies
The Data and Technologies




http://wallpapertree.com/wallpaper/Sad-Kitten/
The Data and Technologies
The Data and Technologies
The Data and Technologies
The Data and Technologies




http://freeimagesarchive.com
The Data and Technologies
The Data and Technologies




1m4ge.blogspot.com
The Data and Technologies
The Data and Technologies
Actually doing something with the data
Actually doing something with the data
Gotchas




smashingmagazine.com
Gotchas
Mitigation



                    Keep your data here so you're
                    not relying directly on the service
                    for updates to customers




(Maybe even
keep it here too)
Conclusions
A dash of all these, some readily
available python libraries, and
some relatively simple scripting
gets you useful data for your own
applications...

... and by using PostgreSQL to
store your data, you can provide
a buffer in case of api changes or
outages...

... and even do some nifty
manipulation to analyse and
present the data differently!
Conclusions




        Simples!
Conclusions
Not *that* simple, but...
Resources
Linked Data Resources and Example Queries

Best API for transport data: http://openuplabs.tso.co.uk/
O'Reilly Book: http://www.learningsparql.com/
Blog: http://talis-systems.com/blog/
Airport Query: http://getthedata.org/questions/137/where-can-i-get-a-list-of-all-airports-codes-names-locations
Beginner Guide: http://www.pezholio.co.uk/2011/01/a-beginners-guide-to-sparqling-linked-data-part-1/

NHS Choices- RSS

http://www.nhs.uk/servicedirectories/Pages/ServiceSearch.aspx

Police Data- JSON/REST

API: http://policeapi2.rkh.co.uk/api/docs/
CSV/SHP: http://www.police.uk/data

Basic Software Resources

Mapserver dynamic charting: http://mapserver.org/output/dynamic_charting.html
Python SparQLWrapper: http://sparql-wrapper.sourceforge.net/
Python SimpleJSON: https://github.com/simplejson/simplejson
Python PygreSQL: http://www.pygresql.org/
Loader: https://github.com/AstunTechnology/Loader
Thank You!




         www.archaeogeek.com
         @archaeogeek
         jocook@astuntechnology.com

Contenu connexe

Similaire à Consuming open and linked data with open source tools

Flightlogg.in'
Flightlogg.in'Flightlogg.in'
Flightlogg.in'
xx xx
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
Addy Pope
 
PowerPoint
PowerPointPowerPoint
PowerPoint
Videoguy
 
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
it-people
 

Similaire à Consuming open and linked data with open source tools (20)

SplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding OverviewSplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding Overview
 
SplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding OverviewSplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding Overview
 
Lyft talks #4 Orchestrating big data and ML pipelines at Lyft
Lyft talks #4 Orchestrating big data and ML pipelines at LyftLyft talks #4 Orchestrating big data and ML pipelines at Lyft
Lyft talks #4 Orchestrating big data and ML pipelines at Lyft
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...
 
Data Science with Spark
Data Science with SparkData Science with Spark
Data Science with Spark
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadata
 
Organizing the Data Chaos of Scientists
Organizing the Data Chaos of ScientistsOrganizing the Data Chaos of Scientists
Organizing the Data Chaos of Scientists
 
Big Data Meetup #7
Big Data Meetup #7Big Data Meetup #7
Big Data Meetup #7
 
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
ITS World Congress 2014 - Open Data in Public Transport: Challenges and Oppor...
 
DataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data ManagementDataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data Management
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of Python
 
Flightlogg.in'
Flightlogg.in'Flightlogg.in'
Flightlogg.in'
 
Maximizing Your Data’s Potential: DOTs & DPWs Edition
Maximizing Your Data’s Potential: DOTs & DPWs EditionMaximizing Your Data’s Potential: DOTs & DPWs Edition
Maximizing Your Data’s Potential: DOTs & DPWs Edition
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
 
Osgis2011 edina addy_pope
Osgis2011 edina addy_popeOsgis2011 edina addy_pope
Osgis2011 edina addy_pope
 
PowerPoint
PowerPointPowerPoint
PowerPoint
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
 
FIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
FIWARE Wednesday Webinars - Machine Learning with Cosmos and SparkFIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
FIWARE Wednesday Webinars - Machine Learning with Cosmos and Spark
 
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
 
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
«Что такое serverless-архитектура и как с ней жить?» Николай Марков, Aligned ...
 

Plus de Joanne Cook

Introduction to OSGeo:UK
Introduction to OSGeo:UKIntroduction to OSGeo:UK
Introduction to OSGeo:UK
Joanne Cook
 
The Business Case for Open Source GIS
The Business Case for Open Source GISThe Business Case for Open Source GIS
The Business Case for Open Source GIS
Joanne Cook
 
Open Source and Open Data
Open Source and Open DataOpen Source and Open Data
Open Source and Open Data
Joanne Cook
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notes
Joanne Cook
 
Lancaster University GIS Course 2010
Lancaster University GIS Course 2010Lancaster University GIS Course 2010
Lancaster University GIS Course 2010
Joanne Cook
 
Databases and web mapping the Open Source way
Databases and web mapping the Open Source wayDatabases and web mapping the Open Source way
Databases and web mapping the Open Source way
Joanne Cook
 
The Impact of Open Source
The Impact of Open SourceThe Impact of Open Source
The Impact of Open Source
Joanne Cook
 
Open Source GIS for Local Government
Open Source GIS for Local GovernmentOpen Source GIS for Local Government
Open Source GIS for Local Government
Joanne Cook
 

Plus de Joanne Cook (20)

Intro to the Open Source Geospatial Foundation
Intro to the Open Source Geospatial FoundationIntro to the Open Source Geospatial Foundation
Intro to the Open Source Geospatial Foundation
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GIS
 
Introduction to OSGeo:UK
Introduction to OSGeo:UKIntroduction to OSGeo:UK
Introduction to OSGeo:UK
 
The Business Case for Open Source GIS
The Business Case for Open Source GISThe Business Case for Open Source GIS
The Business Case for Open Source GIS
 
Open Source and Open Data
Open Source and Open DataOpen Source and Open Data
Open Source and Open Data
 
AGI 2010 Notes
AGI 2010 NotesAGI 2010 Notes
AGI 2010 Notes
 
AGI 2010: It's all one big opportunity
AGI 2010: It's all one big opportunityAGI 2010: It's all one big opportunity
AGI 2010: It's all one big opportunity
 
Worskhop OSGIS2010
Worskhop OSGIS2010Worskhop OSGIS2010
Worskhop OSGIS2010
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notes
 
Worskhop Leicester 2010
Worskhop Leicester 2010Worskhop Leicester 2010
Worskhop Leicester 2010
 
Lancaster University GIS Course 2010
Lancaster University GIS Course 2010Lancaster University GIS Course 2010
Lancaster University GIS Course 2010
 
Databases and web mapping the Open Source way
Databases and web mapping the Open Source wayDatabases and web mapping the Open Source way
Databases and web mapping the Open Source way
 
How archaeologists use GIS
How archaeologists use GISHow archaeologists use GIS
How archaeologists use GIS
 
The Impact of Open Source
The Impact of Open SourceThe Impact of Open Source
The Impact of Open Source
 
Open Source GIS for Local Government
Open Source GIS for Local GovernmentOpen Source GIS for Local Government
Open Source GIS for Local Government
 
Bcs Talk Notes
Bcs Talk NotesBcs Talk Notes
Bcs Talk Notes
 
Gateway Seminar
Gateway SeminarGateway Seminar
Gateway Seminar
 
Foss4g Portable Gis
Foss4g Portable GisFoss4g Portable Gis
Foss4g Portable Gis
 
Bcs Talk Notes
Bcs Talk NotesBcs Talk Notes
Bcs Talk Notes
 
Agi 2008
Agi 2008Agi 2008
Agi 2008
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Consuming open and linked data with open source tools

  • 1. 5/31/2011 Consuming Open and Linked Data with Open Source Tools Or, how to do something useful with all this free data Jo Cook, Geospatial Consultant, Astun Technology
  • 2. Talk Outline ● The data and the distribution technologies explained ● The software stack explained ● Actually getting some data and doing something with it ● Gotchas ● Further resources and links
  • 3. The Data and Technologies
  • 4. The Data and Technologies
  • 5. The Data and Technologies Point in Latitude/Longitude format (WGS84)
  • 6. The Data and Technologies
  • 7. The Data and Technologies
  • 8. The Data and Technologies PREFIX naptan: <http://transport.data.gov.uk/def/naptan/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> SELECT DISTINCT ?stopPoint_airport ?name ?iataRef ?lat ?long WHERE { ?item a naptan:StopArea ; naptan:administrativeArea <http://transport.data.gov.uk/id/administrative- area/145> ; skos:prefLabel ?name ; geo:lat ?lat ; geo:long ?long ; naptan:stopPoint [ naptan:airport ?stopPoint_airport ] . ?stopPoint_airport naptan:iataRef ?iataRef . }
  • 9. The Data and Technologies
  • 10. The Data and Technologies
  • 11. The Data and Technologies http://wallpapertree.com/wallpaper/Sad-Kitten/
  • 12. The Data and Technologies
  • 13. The Data and Technologies
  • 14. The Data and Technologies
  • 15. The Data and Technologies http://freeimagesarchive.com
  • 16. The Data and Technologies
  • 17. The Data and Technologies 1m4ge.blogspot.com
  • 18. The Data and Technologies
  • 19. The Data and Technologies
  • 20. Actually doing something with the data
  • 21. Actually doing something with the data
  • 24. Mitigation Keep your data here so you're not relying directly on the service for updates to customers (Maybe even keep it here too)
  • 25. Conclusions A dash of all these, some readily available python libraries, and some relatively simple scripting gets you useful data for your own applications... ... and by using PostgreSQL to store your data, you can provide a buffer in case of api changes or outages... ... and even do some nifty manipulation to analyse and present the data differently!
  • 26. Conclusions Simples!
  • 28. Resources Linked Data Resources and Example Queries Best API for transport data: http://openuplabs.tso.co.uk/ O'Reilly Book: http://www.learningsparql.com/ Blog: http://talis-systems.com/blog/ Airport Query: http://getthedata.org/questions/137/where-can-i-get-a-list-of-all-airports-codes-names-locations Beginner Guide: http://www.pezholio.co.uk/2011/01/a-beginners-guide-to-sparqling-linked-data-part-1/ NHS Choices- RSS http://www.nhs.uk/servicedirectories/Pages/ServiceSearch.aspx Police Data- JSON/REST API: http://policeapi2.rkh.co.uk/api/docs/ CSV/SHP: http://www.police.uk/data Basic Software Resources Mapserver dynamic charting: http://mapserver.org/output/dynamic_charting.html Python SparQLWrapper: http://sparql-wrapper.sourceforge.net/ Python SimpleJSON: https://github.com/simplejson/simplejson Python PygreSQL: http://www.pygresql.org/ Loader: https://github.com/AstunTechnology/Loader
  • 29. Thank You! www.archaeogeek.com @archaeogeek jocook@astuntechnology.com