SlideShare une entreprise Scribd logo
1  sur  17
Open Spatial Data
Progress towards a reusable gazetteer
                                               th
        Open Data Group – 16 April 2012
                   @ianibbo



This work is licensed under a Creative Commons Attribution 3.0 Unported License.
Overview
Original Problem
  How to transition a central gov't funded aggregation of
   childcare and positive activities with a budget of
   >2m / year to an open data* model running on £60/
   month hardware
     Retaining security (Of a certain level)
     Retaining functionality
(See http://www.madwdata.org.uk/blog/id/394)
2 Major Costs To Mitigate
Large cluster of proprietary OS hosts, ~12 front
 end web servers, hot backup sql server
  Migrated to 1*Pound Host server ~£60/month, server
   has 2 hard drives, hot backup, off site rsync
Data costs – BPH Address-Point data – Used for
 geocoding incoming records and lookups on
 search terms. OS Boundary Line
  ???
Some Noise
Open Spatial Data Consultation......
Open Spatial Data
Ordnance Survey Open Data
  http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo
Code Point Open
  Postcodes to Northing/Easting
OS Locator
  Gazetteer of road names (And other features)
Obtained by registering on website, requesting,
 getting email, following link, …..
The reality of CodePoint Open
The core data is “Open”
Missing the one vital link between CodePoint
 Open and OS Locator – PostCode → Road
 Names / Identifiers.
If you're happy to display Postcodes without road
   names, it's ideal.
Last Mile Problem.
Finding an automated way to link the 2 is hard!
Licensed data is now open, but out of date
Address Point
Still Licensed
Expensive
Probably not that useful anyway for most projects
Problem with focus on “Open Data”
Everyone ends up implementing their own
 gazetteer
Large scale providers have rate limits and
 introduce external dependencies / Speed
 issues
People want local geo-coding (for lots of different
 reasons).
Having rolled your own gazetteer, you discover
 you need to handle updates (Full replacements)
It's not an end in itself
Vision
A stand-alone gazetteer web app designed for
  local network use with features for importing
  updates from OS, reconciling multiple data
  sources and performing geo-coding lookups.
Available Tools
Apache SOLR
  Long-Standing stalwart of the open data and search
    community
  Schemas slightly clunky
  Several spatial options, all with different strengths /
   weaknesses. Multiple points a problem in some.
ElasticSearch
  Schema Free, Apparently Solid Spatial, Multi Points
  Good integration with Mongo via Rivers
Problems / Issues
ES Spatial search hard to do directly via a COOL
 URL
  Spatial query syntax is expressive, but complex and
   needs JSON sub-documents
Need service wrappers
But thats easily done
Updates!
Missed Level of Abstraction
(Common to many open data sets?)

                   Local
                   Copy




                    C
                     o
  Sourc             m      Processin
    e               pa         g
                    re


      NOSQL Like            ES Ideal for
      Mongo is ideal for    this
      this
Progress
Starting to extract code from existing services
 into a generic spatial app
https://github.com/ianibo/AnOpenGazetteerFramewo
Work progressing under aegis of GIST Mobile
 group / Open Data group
Workable Gaz now, but command line interface
 for importing.
Questions / Comments?
Some supporting info
Original Project – FOI request to DfE
                         Total costs - First 3 years

          7000000
                                           Local Authority         Consultation sem-
          6000000                          Revenue                 inars
                                           Local Authority Capi-   Methods Consulting
          5000000                          tal
                                           Central Office of In-   Engine Group
          4000000                          formation
                                           Qi Consulting           Digital Public
          3000000                          Redhouse                Tribal Education
                                           DfE Staff Costs
          2000000

          1000000

                0
               2008-09   2009-10            2010-11
First 3 years - Non LA costs

2500000

                                              Central Office of In-
2000000                                       formation
                                              Qi Consulting
                                              Redhouse
1500000
                                              DfE Staff Costs
                                              Consultation sem-
                                              inars
1000000
                                              Methods Consulting
                                              Engine Group
 500000                                       Digital Public
                                              Tribal Education

      0
     2008-09    2009-10           2010-11

Contenu connexe

Similaire à Open spatial processing

1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
IBM
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14
Nagi Reddy B
 

Similaire à Open spatial processing (20)

Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12 Irw seminar slides for am Sharepoint event on 25.4.12
Irw seminar slides for am Sharepoint event on 25.4.12
 
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
Sharepointarchitecturereal worldscenariofundamentals-150419043032-conversion-...
 
Blockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech LabBlockchain by Silver Touch Tech Lab
Blockchain by Silver Touch Tech Lab
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Case Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless EnterpriseCase Study - Appirio as the Serverless Enterprise
Case Study - Appirio as the Serverless Enterprise
 
Broadband World Forum 2012 Highlights
Broadband World Forum 2012 HighlightsBroadband World Forum 2012 Highlights
Broadband World Forum 2012 Highlights
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
Lesser Known Opportunities in Technology
Lesser Known Opportunities in TechnologyLesser Known Opportunities in Technology
Lesser Known Opportunities in Technology
 
5 2-nessi-stuart campbell
5 2-nessi-stuart campbell5 2-nessi-stuart campbell
5 2-nessi-stuart campbell
 
Resume_ASA
Resume_ASAResume_ASA
Resume_ASA
 
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisatiesData Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
Data Pioneers - Roland Haeve (Atos Nederland) - Big data in organisaties
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02
 
EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14EDW_Recommendation_Smartmonk_26-12-14
EDW_Recommendation_Smartmonk_26-12-14
 
Using Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy ManagementUsing Linked Data and the Internet of Things for Energy Management
Using Linked Data and the Internet of Things for Energy Management
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Big Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech LabBig Data and Analytics by Silver Touch Tech Lab
Big Data and Analytics by Silver Touch Tech Lab
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
 
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
Introduction to Triton Webcast Metrics for NPR Member Stations - 3/8/12
 

Plus de ianibbo (13)

Slides from Gangplank discussion
Slides from Gangplank discussionSlides from Gangplank discussion
Slides from Gangplank discussion
 
Ki presv2
Ki presv2Ki presv2
Ki presv2
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Xcri aggr dev8d
Xcri aggr dev8dXcri aggr dev8d
Xcri aggr dev8d
 
Aggregator appoverview
Aggregator appoverviewAggregator appoverview
Aggregator appoverview
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Gist od2-feb-2011
Gist od2-feb-2011Gist od2-feb-2011
Gist od2-feb-2011
 
Setting up and open fidy dev environment
Setting up and open fidy dev environmentSetting up and open fidy dev environment
Setting up and open fidy dev environment
 
OpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture OverviewOpenFIDY Module & Architecture Overview
OpenFIDY Module & Architecture Overview
 
GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011GIST Mobile Group - Welcome jan2011
GIST Mobile Group - Welcome jan2011
 
GIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield AppGIST Open Conversation - Sheffield App
GIST Open Conversation - Sheffield App
 
Making Agile Pay
Making Agile PayMaking Agile Pay
Making Agile Pay
 
Geekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web PrimerGeekup Sheffield Semantic Web Primer
Geekup Sheffield Semantic Web Primer
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Open spatial processing

  • 1. Open Spatial Data Progress towards a reusable gazetteer th Open Data Group – 16 April 2012 @ianibbo This work is licensed under a Creative Commons Attribution 3.0 Unported License.
  • 2.
  • 3. Overview Original Problem How to transition a central gov't funded aggregation of childcare and positive activities with a budget of >2m / year to an open data* model running on £60/ month hardware Retaining security (Of a certain level) Retaining functionality (See http://www.madwdata.org.uk/blog/id/394)
  • 4. 2 Major Costs To Mitigate Large cluster of proprietary OS hosts, ~12 front end web servers, hot backup sql server Migrated to 1*Pound Host server ~£60/month, server has 2 hard drives, hot backup, off site rsync Data costs – BPH Address-Point data – Used for geocoding incoming records and lookups on search terms. OS Boundary Line ???
  • 5. Some Noise Open Spatial Data Consultation......
  • 6. Open Spatial Data Ordnance Survey Open Data http://www.ordnancesurvey.co.uk/oswebsite/products/os-lo Code Point Open Postcodes to Northing/Easting OS Locator Gazetteer of road names (And other features) Obtained by registering on website, requesting, getting email, following link, …..
  • 7. The reality of CodePoint Open The core data is “Open” Missing the one vital link between CodePoint Open and OS Locator – PostCode → Road Names / Identifiers. If you're happy to display Postcodes without road names, it's ideal. Last Mile Problem. Finding an automated way to link the 2 is hard! Licensed data is now open, but out of date
  • 8. Address Point Still Licensed Expensive Probably not that useful anyway for most projects
  • 9. Problem with focus on “Open Data” Everyone ends up implementing their own gazetteer Large scale providers have rate limits and introduce external dependencies / Speed issues People want local geo-coding (for lots of different reasons). Having rolled your own gazetteer, you discover you need to handle updates (Full replacements) It's not an end in itself
  • 10. Vision A stand-alone gazetteer web app designed for local network use with features for importing updates from OS, reconciling multiple data sources and performing geo-coding lookups.
  • 11. Available Tools Apache SOLR Long-Standing stalwart of the open data and search community Schemas slightly clunky Several spatial options, all with different strengths / weaknesses. Multiple points a problem in some. ElasticSearch Schema Free, Apparently Solid Spatial, Multi Points Good integration with Mongo via Rivers
  • 12. Problems / Issues ES Spatial search hard to do directly via a COOL URL Spatial query syntax is expressive, but complex and needs JSON sub-documents Need service wrappers But thats easily done Updates!
  • 13. Missed Level of Abstraction (Common to many open data sets?) Local Copy C o Sourc m Processin e pa g re NOSQL Like ES Ideal for Mongo is ideal for this this
  • 14. Progress Starting to extract code from existing services into a generic spatial app https://github.com/ianibo/AnOpenGazetteerFramewo Work progressing under aegis of GIST Mobile group / Open Data group Workable Gaz now, but command line interface for importing.
  • 16. Some supporting info Original Project – FOI request to DfE Total costs - First 3 years 7000000 Local Authority Consultation sem- 6000000 Revenue inars Local Authority Capi- Methods Consulting 5000000 tal Central Office of In- Engine Group 4000000 formation Qi Consulting Digital Public 3000000 Redhouse Tribal Education DfE Staff Costs 2000000 1000000 0 2008-09 2009-10 2010-11
  • 17. First 3 years - Non LA costs 2500000 Central Office of In- 2000000 formation Qi Consulting Redhouse 1500000 DfE Staff Costs Consultation sem- inars 1000000 Methods Consulting Engine Group 500000 Digital Public Tribal Education 0 2008-09 2009-10 2010-11