SlideShare une entreprise Scribd logo
1  sur  68
Get Started With Open Data

             Tony Hirst
Dept of Communication and Systems,
        The Open University
So what do
 we mean by
“OPEN DATA”
You are free to:
- copy, publish, distribute and transmit
the Information;
- adapt the Information;
- exploit the Information commercially
for example, by combining it with other
Information, or by including it in your
own product or application
You must:
- acknowledge the source of the Information by including
any attribution statement specified by the Information
Provider(s) and, where possible, provide a link to this
licence;
- ensure that you do not use the Information in a way that
suggests any official status;
- ensure that you do not mislead others or misrepresent
the Information or its source;
- ensure that your use of the Information does not breach
the Data Protection Act 1998 or the Privacy and
Electronic Communications (EC Directive) Regs 2003.
Exemptions:
- personal data;
- Information that has neither been published
nor disclosed under information access
legislation (FOI) by or with the consent of the
Information Provider;
- departmental or public sector organisation
logos, crests etc;
- third party rights the Information Provider is
not authorised to license;
- Information subject to other IPR
Availability and Access

Reuse and Redistribution

Universal Participation
      The Open Knowledge Foundation
Availability and Access: the data must
be available as a whole and at no more
than a reasonable reproduction
cost, preferably by downloading over
the internet. The data must also be
available in a convenient and
modifiable form.

          The Open Knowledge Foundation
Reuse and Redistribution: the data
must be provided under terms that
permit reuse and redistribution
including the intermixing with other
datasets.


         The Open Knowledge Foundation
Universal Participation: everyone must be able
to use, reuse and redistribute – there should
be no discrimination against fields of
endeavour or against persons or groups. For
example, ‘non-commercial’ restrictions that
would prevent ‘commercial’ use, or restrictions
of use for certain purposes (e.g. only in
education), are not allowed.


            The Open Knowledge Foundation
/via http://antictrl.com/chapter-3-2-regulability-of-the-internet/
FOI
         Licensing      exemptions

                                   Data
    Paywalls                  protection Act

Authentication
                     DATA        “Privacy”

       Crappy                      Closed
    spreadsheets                 standards


            PDFs        Messy Data
Right to access data
So where’s
 the data?
“First” generation:
 data catalogues
Breathing life
 into data…
=importData(“CSV_URL”)
the spreadsheet becomes

A DATABASE
“Second” generation:
 data management
      systems
Digging for
  data…
There’s lots more
data that’s locked
up in web pages…
Scraping…
“grabbing web content
in a machine readable
   format and then
 processing it for your
    own purposes”
Original      Extract
                          Accessible
HTML web    Information
                          web page
  page         -> data
Recreating the
database that was used
     to populate a
   (templated) page
“Creating”
   Data
[Disruptive
Innovation?]
Company




          Director
           Director
             Director
               Director




                          Company
                           Company
                            Company
                             Company
Barriers
 to Use
- Character string dates
             - Erratic whitespace
             - Arbitrary separators
             - Excel Dates
Also:
- month overflows at week end
- year overflows
Open
is as open
  does… DATA
@psychemedia

blog.ouseful.info

Contenu connexe

Tendances

The Knowledge Discovery Quest
The Knowledge Discovery Quest The Knowledge Discovery Quest
The Knowledge Discovery Quest Ontotext
 
Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDFOpen Data Support
 
Open Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support
 
Cooking up the Semantic Web
Cooking up the Semantic WebCooking up the Semantic Web
Cooking up the Semantic WebOntotext
 
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...Richard Wallis
 
Data Liberation - Tony Hirst
Data Liberation - Tony HirstData Liberation - Tony Hirst
Data Liberation - Tony HirstIncisive_Events
 
The magical world of dde
The magical world of ddeThe magical world of dde
The magical world of ddeJane Finnis
 
Design and manage persistent URIs
Design and manage persistent URIsDesign and manage persistent URIs
Design and manage persistent URIsOpen Data Support
 
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...TSO
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in LibrariesCarl Hess
 
RDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorialRDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorialJerven Bolleman
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web dataGeorg Guentner
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 

Tendances (20)

Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
The Knowledge Discovery Quest
The Knowledge Discovery Quest The Knowledge Discovery Quest
The Knowledge Discovery Quest
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDF
 
Open Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support - Service Description
Open Data Support - Service Description
 
Cooking up the Semantic Web
Cooking up the Semantic WebCooking up the Semantic Web
Cooking up the Semantic Web
 
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...
Web 3.0 / Semantic Web: What it means for academic users, libraries and publi...
 
Presentation
PresentationPresentation
Presentation
 
Data Liberation - Tony Hirst
Data Liberation - Tony HirstData Liberation - Tony Hirst
Data Liberation - Tony Hirst
 
The magical world of dde
The magical world of ddeThe magical world of dde
The magical world of dde
 
Design and manage persistent URIs
Design and manage persistent URIsDesign and manage persistent URIs
Design and manage persistent URIs
 
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...
TSO Semantic Discoverability - at UK Gov Linked Data - by Richard Goodwin TSO...
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in Libraries
 
Data & metadata licensing
Data & metadata licensingData & metadata licensing
Data & metadata licensing
 
RDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorialRDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorial
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
The open semantic enterprise enterprise data meets web data
The open semantic enterprise   enterprise data meets web dataThe open semantic enterprise   enterprise data meets web data
The open semantic enterprise enterprise data meets web data
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 

Similaire à Get Started With Open Data: Availability, Reuse and Universal Participation

Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsKaitlin Thaney
 
Slide share cloudx_counsel ppt
Slide share cloudx_counsel pptSlide share cloudx_counsel ppt
Slide share cloudx_counsel pptMark Sanders
 
Open data 4 startups (2°edition)
Open data 4 startups (2°edition)Open data 4 startups (2°edition)
Open data 4 startups (2°edition)TOP-IX Consortium
 
Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Mladen Jovanovski
 
Data Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWCData Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWCKaitlin Thaney
 
Data Science: Harnessing Open Data for High Impact Solutions
Data Science: Harnessing Open Data for High Impact SolutionsData Science: Harnessing Open Data for High Impact Solutions
Data Science: Harnessing Open Data for High Impact SolutionsMohd Izhar Firdaus Ismail
 
The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureJeni Tennison
 
Open Data and Artificial Intelligence
Open Data and Artificial IntelligenceOpen Data and Artificial Intelligence
Open Data and Artificial IntelligenceOpen Knowledge Nepal
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfRanjeet Bhalshankar
 
Milwaukee data initiative gis_day
Milwaukee data initiative gis_dayMilwaukee data initiative gis_day
Milwaukee data initiative gis_dayMatt Richardson
 
Calhoun Data Sharing Panel IFLA Aug 2008
Calhoun Data Sharing Panel IFLA  Aug 2008Calhoun Data Sharing Panel IFLA  Aug 2008
Calhoun Data Sharing Panel IFLA Aug 2008Karen S Calhoun
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
Von Open Data zu Linked Open Data, M. Kaltenböck, SWC
Von Open Data zu Linked Open Data, M. Kaltenböck, SWCVon Open Data zu Linked Open Data, M. Kaltenböck, SWC
Von Open Data zu Linked Open Data, M. Kaltenböck, SWCMartin Kaltenböck
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data Shallote Dsouza
 
Data Mining Challenges
Data Mining ChallengesData Mining Challenges
Data Mining ChallengesRepustate
 
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014kevintsmith
 
Governmental Linked Data
Governmental Linked DataGovernmental Linked Data
Governmental Linked DataHaklae Kim
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And FootballAmanda Gray
 

Similaire à Get Started With Open Data: Availability, Reuse and Universal Participation (20)

Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
 
Slide share cloudx_counsel ppt
Slide share cloudx_counsel pptSlide share cloudx_counsel ppt
Slide share cloudx_counsel ppt
 
Open data 4 startups (2°edition)
Open data 4 startups (2°edition)Open data 4 startups (2°edition)
Open data 4 startups (2°edition)
 
Watson data platform_sofia_20171017
Watson data platform_sofia_20171017Watson data platform_sofia_20171017
Watson data platform_sofia_20171017
 
Data Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWCData Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWC
 
Data Science: Harnessing Open Data for High Impact Solutions
Data Science: Harnessing Open Data for High Impact SolutionsData Science: Harnessing Open Data for High Impact Solutions
Data Science: Harnessing Open Data for High Impact Solutions
 
The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructure
 
Open Data and Artificial Intelligence
Open Data and Artificial IntelligenceOpen Data and Artificial Intelligence
Open Data and Artificial Intelligence
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdf
 
Milwaukee data initiative gis_day
Milwaukee data initiative gis_dayMilwaukee data initiative gis_day
Milwaukee data initiative gis_day
 
Calhoun Data Sharing Panel IFLA Aug 2008
Calhoun Data Sharing Panel IFLA  Aug 2008Calhoun Data Sharing Panel IFLA  Aug 2008
Calhoun Data Sharing Panel IFLA Aug 2008
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
Von Open Data zu Linked Open Data, M. Kaltenböck, SWC
Von Open Data zu Linked Open Data, M. Kaltenböck, SWCVon Open Data zu Linked Open Data, M. Kaltenböck, SWC
Von Open Data zu Linked Open Data, M. Kaltenböck, SWC
 
Broad Data
Broad DataBroad Data
Broad Data
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data
 
Data Mining Challenges
Data Mining ChallengesData Mining Challenges
Data Mining Challenges
 
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014
Big Data Security and Privacy - Presentation to AFCEA Cyber Symposium 2014
 
Governmental Linked Data
Governmental Linked DataGovernmental Linked Data
Governmental Linked Data
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And Football
 

Plus de Tony Hirst

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiestaTony Hirst
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptxTony Hirst
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptxTony Hirst
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacksTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyterTony Hirst
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2Tony Hirst
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopTony Hirst
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireTony Hirst
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interestTony Hirst
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXTony Hirst
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefineTony Hirst
 
Conversations with data
Conversations with dataConversations with data
Conversations with dataTony Hirst
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingoTony Hirst
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Tony Hirst
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalismTony Hirst
 

Plus de Tony Hirst (20)

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiesta
 
Dev8d jupyter
Dev8d jupyterDev8d jupyter
Dev8d jupyter
 
Ili 16 robot
Ili 16 robotIli 16 robot
Ili 16 robot
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptx
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptx
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacks
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyter
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 Workshop
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wire
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interest
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKX
 
Week4
Week4Week4
Week4
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefine
 
Conversations with data
Conversations with dataConversations with data
Conversations with data
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingo
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
 

Get Started With Open Data: Availability, Reuse and Universal Participation

  • 1. Get Started With Open Data Tony Hirst Dept of Communication and Systems, The Open University
  • 2. So what do we mean by “OPEN DATA”
  • 3.
  • 4. You are free to: - copy, publish, distribute and transmit the Information; - adapt the Information; - exploit the Information commercially for example, by combining it with other Information, or by including it in your own product or application
  • 5. You must: - acknowledge the source of the Information by including any attribution statement specified by the Information Provider(s) and, where possible, provide a link to this licence; - ensure that you do not use the Information in a way that suggests any official status; - ensure that you do not mislead others or misrepresent the Information or its source; - ensure that your use of the Information does not breach the Data Protection Act 1998 or the Privacy and Electronic Communications (EC Directive) Regs 2003.
  • 6. Exemptions: - personal data; - Information that has neither been published nor disclosed under information access legislation (FOI) by or with the consent of the Information Provider; - departmental or public sector organisation logos, crests etc; - third party rights the Information Provider is not authorised to license; - Information subject to other IPR
  • 7. Availability and Access Reuse and Redistribution Universal Participation The Open Knowledge Foundation
  • 8. Availability and Access: the data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form. The Open Knowledge Foundation
  • 9. Reuse and Redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets. The Open Knowledge Foundation
  • 10. Universal Participation: everyone must be able to use, reuse and redistribute – there should be no discrimination against fields of endeavour or against persons or groups. For example, ‘non-commercial’ restrictions that would prevent ‘commercial’ use, or restrictions of use for certain purposes (e.g. only in education), are not allowed. The Open Knowledge Foundation
  • 12. FOI Licensing exemptions Data Paywalls protection Act Authentication DATA “Privacy” Crappy Closed spreadsheets standards PDFs Messy Data
  • 13.
  • 14.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 33.
  • 34.
  • 35.
  • 36. “Second” generation: data management systems
  • 37.
  • 38. Digging for data…
  • 39.
  • 40. There’s lots more data that’s locked up in web pages…
  • 42. “grabbing web content in a machine readable format and then processing it for your own purposes”
  • 43. Original Extract Accessible HTML web Information web page page -> data
  • 44.
  • 45.
  • 46.
  • 47. Recreating the database that was used to populate a (templated) page
  • 48.
  • 49.
  • 50.
  • 51.
  • 53.
  • 55.
  • 56. Company Director Director Director Director Company Company Company Company
  • 57.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63. - Character string dates - Erratic whitespace - Arbitrary separators - Excel Dates Also: - month overflows at week end - year overflows
  • 64.
  • 65.
  • 66.
  • 67. Open is as open does… DATA

Notes de l'éditeur

  1. A great example of timely data is data relating to roadworks. This data is often released in an impenetrable form, screeds of text detailing roadnames nobody uses and identifying in arcane language where roadworks are to take place, and what diversions have been put in place. Why is it so hard to just publish the data as KML that can be rendered trivially in an online map?!
  2. Another example that demonstrates how CSV can be used to help data flow is demonstrated by Google Spreadsheets. The =importData formula allows a user to specify a source data URL, and pull the CSV data found at that location in to the spreadsheet. Unlike Many Eyes Wikified, if the source data at the URL is updated, the updated will (eventually) be pulled into the spreadsheet automatically.
  3. One of the really good reasons for getting data into a data processing environment such as a spreadsheet is that you can start to work it. In the case of Google Spreadsheets, the spreadsheet environment can also be used as a database environment. That is, we can treat one or more data containing sheets in a spreadsheet as a database, and generate new views over the data, as well as running queries over that data.
  4. Another way of using a Google Spreadsheet as a database is via the Google Spreadsheets API. The GoogleVisualisation API (?) provides a way of passing queries written using the Google ???viz query language from an arbitrary web page or web application, and receiving the resulting data in a standard JSON based format, which also happens to play nicely with the Google Visualisation API???The Guardian Datastore explorer is a crude demonstration for 2009(??) demonstrating how data from the Guardian datastore, data that is stored across a range of Google spreadsheets, can be explored , queried and visualised via these APIs. Users can select a dataset from a drop down menu, fed from a delicious account to which various datastore spreadsheets have been bookmarked using a particular set of tags, or by pasting in the URL of an arbitrary (public) Google spreadsheet. The first row/headings of the data can then be previewed (a simple spreadsheet is assumed, in which column headings appear In the first row of the spreadsheet).
  5. A series of list boxes are then populated with the column labels and there names, and provide a certain amount of help for the creation of a query over the spreadsheet data. A range of output formats can also be selected, from simple HTML data tables, to a range of charts. URLs are also generated for HTML and CSV representations of the data returned from the query.
  6. One of the nice things about the data table widget (a standard GoogleVisualisation API component in this case, though similar examples exist for YUI, the Yahoo User Interface Libraries, or frameworks such as JQuery), is that is supports things like row sorting by column, (for free – no programming required!), allowing even further manipulation of the data, albeit at a simplistic level.(It’s probably worth pointing out here that it may be worth providing a preview of the column headings and first few rows (or a sample of random rows) of data when datasets are published, just so that users can see what sort of data is on offer without having to download the whole data set?)
  7. If you’re in the business of selling information as data, you are under threat where that information is published in an openly licensed way.
  8. Do we have a hashtag for the workshop?