SlideShare une entreprise Scribd logo
1  sur  64
Télécharger pour lire hors ligne
DATAVIZ: VISUAL REPRESENTATION OF COMPLEX
                    PHENOMENA
    data visualization & computational design
        @ Better Nouveau Workshop
                14/12/2011



What is Open Data?
   Lorenzo Benussi, TOP-IX Consotium
       lorenzo.benussi@top-ix.org




                     1
About me

           Research & Business
                 Development
      TOP-IX Consortium



       Fellow, NEXA Centre
           Polytechnic of Turin


    Fellow, Department of
Economics University of Turin


                                  2
agenda
1. Background
2. Definitions
  I. Open Knowledge Definition
  II. Open Data Licenses
  III. Pricing models
  IV. Formats
3. Examples


                 3
Did you take the bus today?
             4
Ref: National Geographic http://ngm.nationalgeographic.com/big-idea/14/augmented-reality




                                                            Background
                                                                                     5
BIG DATA stylized facts 1
• $600 to buy a disk drive that can store all the
    world's music.
•   5 billion mobile phone in use in 2010.
•   30 billion pieces of content shared on Facebook
    every month.
•   40% of projected growth in global data generated
    per year VS 5% growth in global IT spending.
•   235 terabytes data collected by US Library of
    Congress in April 2011.
•   15 out of 17 sectors in the United States have more
    data stored per company than the US Library of
    Congress
       McKinsey: Big Data:The next frontier of innovation, competition and productivity. (may 2011)
                                                    6
BIG DATA stylized facts 2
    $300 billion potential annual value to US health care - more
    than X 2 total annual health care spending in Spain.
•    €250 billion potential annual value to Europe's public sector
     administration - more than GDP of Greece.
•    $600 billion potential annual consumer surplus from using
     personal location data globally.
•    60% potential increase in retailers' operating margins
     possible with big data.
•    140.000-190.000 more deep analytical talent position and
     1.5 million more data-savvy managers needed to take full
     advantage of big data in the USA.
            McKinsey: Big Data:The next frontier of innovation, competition and productivity. (may 2011)
                                                         7
WEB(squared)
1.Redefining Collective Intelligence:
New Sensory Input
2.Cooperating Data Subsystems
3.How the Web Learns: Explicit vs.
Implicit Meaning
4.Web Meets World: The
"Information Shadow" and the
Internet of Things
5.The Rise of Real Time: A Collective
Mind
Ref: Tim O’Reilly and John Battelle (2009), Web Squared: Web 2.0 Five Years On.
http://www.web2summit.com/web2009/public/schedule/detail/10194



                                                              8
Digital technology could enable an extraordinary range of
ordinary people to become part of a creative process.
 (The future of ideas, Lawrence Lessig)




                             9
When I say that innovation is being democratized, I mean
that users of products and services—both firms and individual
consumers—are increasingly able to innovate for themselves.
(Democratizing Innovation, Eric Von Hippel)




                            10
The value of metrics

              • Data     Hal Varian, Google’s Chief Economist




              • Information
              • Knowledge
              • Value
         11
12
DATA as a SERVICE




Data are not closed inside applications but they are consumed on-demand as
a service
RESTful API make possible to access data as a web resource (trough URI)

                                    13
Business Models
A. Data owner: paid to publish / revenue share.
B. Data user: pay for data delivery/trasformation/
   analysis services.

     New Generation Marketplace
3.   Works with open and not-open data
4.   Provide data on-the-fly through API (evan custom).
5.   Sometime the community of data curators in
     involved to maintain and expand the data crowd-
     sourcing (e.g. Factual).
6.   Provide tools (web based) to explore the data

                            14
What open data means?
 Open Data is a model to extract value from
 public sector information by using the data
 to build new tools and to create innovative
 services


                       15
PSI (public sector information) mines

• The Public Sector produces
  and manages huge amount of
  data, opening PSI information
  in EU produces economic
  growth 140 billion € / year
  (aggregate)

• Public Data are the raw
  material to create new
  products and services
                                       COURTESY/RON WHEELER. The 8,000-foot deep Homestake Gold
                                       Mine in South Dakota is the site where scientists, including UC
                                       Berkeley researchers, plan to construct the world's deepest research
                                       center.

                                  16
data.gov
                                “Openness will strengthen our democracy and
                                      promote efficiency and effectiveness in
                                                               Government”
                                          Transparency and Open Government
                                      Memorandum for the Heads of Executive
                                             Departments and Agencies (2009)




[…] As you know, transparency is at the
heart of our agenda for Government. We
recognise that transparency and open data
can be a powerful tool to help reform public
services, foster innovation and empower
citizens.
David Cameron - Letter to Cabinet Ministers
(2011)
                                      17
Information is the currency of democracy
Benjamin Franklin (attribution)




                                  18
Raw data now!




"... give us the unadulterated data, we want the data, we want
unadulterated data. We have to ask for raw data now."
Tim Berners-Lee, advisor data.gov.uk
                               19
data.gov: leading examples




USA - data.gov                                UK - data.gov.uk




                    Australia - data.gov.au
                                20
Legislation in EU, Italy and
                                                      Piedmont
                       EUROPA
                       Direttiva 2003/98/CE del 17 novembre 2003
  The evolution towards an information and knowledge society influences the life of every citizen in
the Com-munity, inter alia, by enabling them to gain new ways of accessing and acquiring knowledge.
      DIRECTIVE 2003/98/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 17
                                          November 2003 on the re-use of public sector information

                        ITALY
                        Decreto Legislativo n. 36 January, 24 2006 and 
                        L. 96/2010.

                        PIEDMONT
                        Delibera di Giunta regionale 36 - 1109 November
                        2010

                                                21
WHY : civil society


• Accountability
• Tansparency
• Collaboration
• Participation

                   22
WHY : (digital) market

•Innovation
• Cooperation
• Competition
• Digital
 commons
                23
The first example in Italy - dati.piemonte.it
                     24
apps4italy
•   All EU citizens can participate (!!) & 40K€
    in cash prizes

•   Building useful, innovative projects based on italian
    public data (not only open data)

•   Four main categories (growing):
       1. Ideas
       2. Apps                         Ref: appsforitaly.org
       3. Visualization
       4. Datasets
                             25
Open Data: definitions
          26
Open Knowledge Definition v.1.1 by OKF
  A work is open if its manner of distribution satisfies the
                   following conditions:
1. Access
2. Redistribution                  8. No discrimination (fields
                                   or endeavor)
3. Reuse
                                   9. Distribution of license
4. Absence of technological
restriction                        10. License must not be
                                   specific to a package
5. Attribution
                                   11. License must not
6. Integrity                       restrict the distribution of
                                   other works
7. No discrimination
(persons or groups)
                              27
Open Definition - http://opendefinition.org/okd/
Version 1.1

Terminology

The term knowledge is taken to include:

# 1.# Content such as music, films, books
# 2.# Data be it scientific, historical, geographic or otherwise
# 3.# Government and other administrative information


Software is excluded [...]

The term work will be used to denote the item or piece of knowledge
which is being transferred.

The term package may also be used to denote a collection of works. [...]

The term license refers to the legal license under which the work is made
available. Where no license has been made this should be interpreted as
referring to the resulting default legal conditions under which the work is
available (for example copyright).
                                    28
The Definition - A work is open if its manner of distribution
satisfies the following conditions:


1. ACCESS
The work shall be available as a whole and at no more than a reasonable
reproduction cost, preferably downloading via the Internet without charge. The
work must also be available in a convenient and modifiable form.

2. REDISTRIBUTION
The license shall not restrict any party from selling or giving away the work either
on its own or as part of a package made from works from many different sources.
The license shall not require a royalty or other fee for such sale or distribution.

3. REUSE
The license must allow for modifications and derivative works and must allow
them to be distributed under the terms of the original work.



                                        29
4. ABSENCE OF TECHNOLOGICAL RESTRICTION
The work must be provided in such a form that there are no technological
obstacles to the performance of the above activities. This can be achieved by the
provision of the work in an open data format, i.e. one whose specification is publicly
and freely available and which places no restrictions monetary or otherwise upon
its use.

5. ATTRIBUTION
The license may require as a condition for redistribution and re-use the attribution
of the contributors and creators to the work. If this condition is imposed it must
not be onerous. For example if attribution is required a list of those requiring
attribution should accompany the work.

6. INTEGRITY
The license may require as a condition for the work being distributed in modified
form that the resulting work carry a different name or version number from the
original work.


                                        30
7. NO DISCRIMINATION AGAINST PERSONS OR GROUPS
The license must not discriminate against any person or group of persons.

8. NO DISCRIMINATION AGAINST FIELDS OF ENDEAVOR
The license must not restrict anyone from making use of the work in a specific
field of endeavor. For example, it may not restrict the work from being used in a
business, or from being used for genetic research.

9. DISTRIBUTION OF LICENSE
The rights attached to the work must apply to all to whom it is redistributed
without the need for execution of an additional license by those parties.

10. LICENSE MUST NOT BE SPECIFIC TO A PACKAGE
The rights attached to the work must not depend on the work being part of a
particular package. If the work is extracted from that package and used or
distributed within the terms of the work’s license, all parties to whom the work is
redistributed should have the same rights as those that are granted in conjunction
with the original package.

11. LICENSE MUST NOT RESTRICT THE DISTRIBUTION OF OTHER WORKS
The license must not place restrictions on other works that are distributed along
with the licensed work. For example, the license must not insist that all other
works distributed on the same medium are open.
                                       31
Open Data: prices
        32
A paradigmatic shift:
         information economy
• The transition from a physically-based to a knowledge-based
  economic environment made information a primary
  wealth-creating asset.

• Digital access to information seems to have changed the
  structure of many industries, promoting services-oriented
  business models based on disclosure and sharing of
  information and knowledge.




                             33
A paradigmatic shift:
            PSI data mines
• The Public Sector holds and manages huge amounts of
  data and information. Fostering access to those repositories
  enables new business opportunities that can broaden
  market volumes in such sectors.

• PSI represents the raw material from which value added
  products and services can be designed.




                             34
The use/value of PSI
  PSI can be used and reused in
      many ways (non rivalry in
                                             Several supply chain
           consumption):
                                                 configurations.
1.Broad range of sectors
                                       1.Linear models (private re-users
2.Different sets of actors             add value)
3.PSI holders                          2.User generated contents
4.Private re-users                     3.Information sharing between
5.Regulatory bodies                    public bodies
6.Citizens




                                  35
The price of PSI:
         the “free data” approach
• The peculiar cost structure of digital data collecting, processing
  and delivering (high fixed costs, zero marginal cost) strongly
  influences the possible pricing strategies to be adopted by PSI
  holders.

• Pollock (2008): a price that equals marginal costs (i.e. PSI free of
  charge) is socially optimal provided that elasticity of demand
  and positive externalities overcome a given threshold.
    ✓ Empirics: those conditions are likely to be verified in most of
         the PSI domains.


                                 36
The price of PSI:
          cost recovery approach
• Although a cost recovery regime may bound potential demand
  and distort competition, several critical issues could trigger its
  adoption.

• Underestimation of downstream demand and network
  externalities.
 ✓Lack of long-run commitment in subsidizing PSI collection.
 ✓Short-term decision making.
 ✓Moral hazard (?).

                                   37
The price of PSI: possible scenarios
Directive 2003/98/EC is aimed at fostering PSI reuse mainly by promoting:
1.PSI availability in digital format
2.Transparency of reuse conditions and pricing
3.Non discrimination

                   Which market configurations are likely to emerge?
MEPSIR (2006)

                             Directive impact                                   Main condition                            Example



                                                                 Information is strongly liked with the functioning        Cadastral
  Closed shop      Minor. Public Sector bodies continue to
                                                                                  of public bodies.                       information
                         control the supply chain.


                 Non-negligible. New entrants step into the      Information is important while not strategic for
   Battlefield                                                                                                         Meteorological data
                          downstream market.                                           PA.

                  Strong. Public Sector enlarges its influence    Digitalization offers new opportunities for value
                                                                                                                       Legal information
                       over the downstream stages.                                   extraction.
   Playground
                 Non-negligible. Public Sector has the only          Information reuse generates high demand          Traffic and transport
                       role of information holder.                        volumes from citizens and firms                  information


                                                                38
The price of PSI:
        Externalities & Policy
All pricing strategies encompass potential risks of inefficiency
 for PSI holders (due to lack of incentives in reducing costs
                   and/or improving quality).
  The importance of the regulatory framework


  The Central Role of Externalities


                             39
Open Data: formats
           40
Linked open data and Semantic web

             The Semantic Web isn't just about putting data on
             the web. It is about making links, so that a person
             or machine can explore the web of data. With
             linked data, when you have some of it, you can find
             other, related, data. (by Tim Berners-Lee)

          1. Use URIs as names for things

          2. Use HTTP URIs so that people can look up those
             names.

          3. When someone looks up a URI, provide useful
             information, using the standards (RDF*, SPARQL)

          4. Include links to other URIs. so that they can
             discover more things.

             Ref: http://www.w3.org/DesignIssues/
             LinkedData.html
                    41
42
Linked open data: basic
              principles
1. Everything has a name (people, locations,
   etc.)
1. Every name starts with http://

3. All data are described by using RDF
   (Resource Description Framework is a W3C
   standard).
  Tim Berners Lee talk on linked data:
  http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
                                  43
Data as a RDF graph




         44
The Vision - A global
interconnected database




           45
The Vision - Mix data on-the-fly




               46
Linked data - hands on
DBPedia provide information of wikipedia as Linked Data.
Example, Turin airport: http://dbpedia.org/page/
Turin_Caselle_Airport




                             47
Open Data: license

            48
Open Data license 1 (OKF)

  Open Knowledge foundation licences
1. Public Domain Dedication and License (PDDL) —
   “Public Domain for data/databases”
2. Open Data Commons Attribution License (ODC-
   By) — “Attribution for data/databases”
3. Open Data Commons Open Database License
   (ODC-ODbL) — “Attribution Share-Alike for data/
   databases”
  Ref: http://www.opendatacommons.org/licenses/

                         49
Open Data licenses 2 (CC e IODL)
    Creative Commons Licenses (http://creativecommons.org/
    licenses/)
1. CC Zero
2. CC by - Atribution
3. CC SA - Share alike
4. CC BY-SA - Attribution and Share alike


    Italian open data license (http://www.formez.it/iodl/)

•   IODL - Italian Open Data License (BY-SA)

                                50
examples
   51
2 groups
I. Transparency
II. Information services


                     52
Transparency
• Public assembly
  (parliament,
  councils)

• Public Budget and
  expenses

• Public
  procurement
                      53
Ref: http://traintimes.org.uk/map/tube/




Info services
• Transportation
• Environment
• Cultural
  heritage


                   54
food




 55
kids




 56
environment




     57
Ref: http://traintimes.org.uk/map/tube/




transportation

58
Ref: http://webdesignledger.com/inspiration/
                                       15-stunning-examples-of-data-visualization




Ref: http: //www.gapminder.org/




                                   Data VIZ
                                  59
Where to find open data
Open (and not open) data archive
http://ckan.net/
http://it.ckan.net/

Example of italian datasets:
Dati.gov.it: http://www.dati.gov.it/
5T: http://biennaledemocrazia.it/dataset/
Dati Piemonte: http://dati.piemonte.it
ISTAT: http://dati.istat.it/
Enel: http://data.enel.com/

                     60
Tools and links
ONLINE DATA VISUALIZATION
G visualization Api: http://code.google.com/intl/it-IT/apis/chart/
Tableau Public: http://www.tableausoftware.com/public
Open Heat Map: http://www.openheatmap.com/

ONLINE STORAGE+VISUALIZATION
Google Public Data explorer: http://www.google.com/publicdata/home
IBM Many Eyes: http://www-958.ibm.com/software/data/cognos/manyeyes/
Google Fusion tables: http://www.google.com/fusiontables/Home
Impure: http://www.impure.com/

CURATION & LINKING
Google Refine
Data Wrangler: http://vis.stanford.edu/wrangler/

OFFLINE TOOLS
R: http://www.r-project.org/
Jscript Library for data viz: http://thejit.org/
Anche questa: http://vis.stanford.edu/protovis/
Network / graph analysis / visualization: http://gephi.org/
Language turing complete for dataviz for visual artist: http://processing.org/

                                               61
wrap-up
1. Not all public data are open data
2. Public data and gov data are
   often “broken” (strange formats
   and ambiguous IP)
3. Open Data make sense if we put
   it in perspective - the rise of Big
   Data

      62
everything is changing



          63
thanks
lorenzo.benussi@top-ix.org




            64

Contenu connexe

Tendances

PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長
PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長
PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長PunNode 科技創業新聞網
 
Infoactivism - Michł Mach
Infoactivism - Michł MachInfoactivism - Michł Mach
Infoactivism - Michł Machcentrumcyfrowe
 
Privacy impact assessment
Privacy impact assessmentPrivacy impact assessment
Privacy impact assessmentSpringer
 
The use of Digital Tools and Geoinformation for Development
The use of Digital Tools and Geoinformation for DevelopmentThe use of Digital Tools and Geoinformation for Development
The use of Digital Tools and Geoinformation for Developmentbfnd
 
ICT4D: Tecnologie digitali per lo sviluppo
ICT4D: Tecnologie digitali per lo sviluppoICT4D: Tecnologie digitali per lo sviluppo
ICT4D: Tecnologie digitali per lo sviluppoRoberto Polillo
 
Hong Kong Knowledge Management Conference 2013
Hong Kong Knowledge Management Conference 2013Hong Kong Knowledge Management Conference 2013
Hong Kong Knowledge Management Conference 20132016
 
Angry birds view of open data v6 public
Angry birds view of open data v6   publicAngry birds view of open data v6   public
Angry birds view of open data v6 publicsnewell4
 
Moving to a read-write government
Moving to a read-write governmentMoving to a read-write government
Moving to a read-write governmentPatrick McCormick
 
Osimopolitika20v2
Osimopolitika20v2Osimopolitika20v2
Osimopolitika20v2osimod
 
Abc gov2 presentation melb
Abc gov2 presentation melbAbc gov2 presentation melb
Abc gov2 presentation melbNicholas Gruen
 
Open Data - Challenges and Opportunities for the GEO and Citizen Community
Open Data - Challenges and Opportunities for the GEO and Citizen CommunityOpen Data - Challenges and Opportunities for the GEO and Citizen Community
Open Data - Challenges and Opportunities for the GEO and Citizen CommunityJury Konga
 
W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data3 Round Stones
 
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Matthias Stürmer
 
Digital innovation v8
Digital innovation v8Digital innovation v8
Digital innovation v8Verinote
 
KRDB2010-GoodRelations
KRDB2010-GoodRelationsKRDB2010-GoodRelations
KRDB2010-GoodRelationsMartin Hepp
 

Tendances (20)

PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長
PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長
PunProbe x NII 協進會《網路治理 Internet Governance》——吳國維執行長
 
Infoactivism - Michł Mach
Infoactivism - Michł MachInfoactivism - Michł Mach
Infoactivism - Michł Mach
 
Privacy impact assessment
Privacy impact assessmentPrivacy impact assessment
Privacy impact assessment
 
The use of Digital Tools and Geoinformation for Development
The use of Digital Tools and Geoinformation for DevelopmentThe use of Digital Tools and Geoinformation for Development
The use of Digital Tools and Geoinformation for Development
 
Changing media landscape 08022011
Changing media landscape 08022011Changing media landscape 08022011
Changing media landscape 08022011
 
ICT4D: Tecnologie digitali per lo sviluppo
ICT4D: Tecnologie digitali per lo sviluppoICT4D: Tecnologie digitali per lo sviluppo
ICT4D: Tecnologie digitali per lo sviluppo
 
Hong Kong Knowledge Management Conference 2013
Hong Kong Knowledge Management Conference 2013Hong Kong Knowledge Management Conference 2013
Hong Kong Knowledge Management Conference 2013
 
cscw
cscwcscw
cscw
 
110 koenig
110 koenig110 koenig
110 koenig
 
Angry birds view of open data v6 public
Angry birds view of open data v6   publicAngry birds view of open data v6   public
Angry birds view of open data v6 public
 
Moving to a read-write government
Moving to a read-write governmentMoving to a read-write government
Moving to a read-write government
 
Osimopolitika20v2
Osimopolitika20v2Osimopolitika20v2
Osimopolitika20v2
 
Free Software in Government
Free Software in GovernmentFree Software in Government
Free Software in Government
 
Abc gov2 presentation melb
Abc gov2 presentation melbAbc gov2 presentation melb
Abc gov2 presentation melb
 
Open Data - Challenges and Opportunities for the GEO and Citizen Community
Open Data - Challenges and Opportunities for the GEO and Citizen CommunityOpen Data - Challenges and Opportunities for the GEO and Citizen Community
Open Data - Challenges and Opportunities for the GEO and Citizen Community
 
W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data
 
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
Data Colonialism and Digital Sustainability: Problems and Solutions to Curren...
 
Polinter08
Polinter08Polinter08
Polinter08
 
Digital innovation v8
Digital innovation v8Digital innovation v8
Digital innovation v8
 
KRDB2010-GoodRelations
KRDB2010-GoodRelationsKRDB2010-GoodRelations
KRDB2010-GoodRelations
 

Similaire à What is opendata

Lorenzo Benussi - DataGov
Lorenzo Benussi - DataGovLorenzo Benussi - DataGov
Lorenzo Benussi - DataGovSegnalazionIT
 
dati.piemonte.it
dati.piemonte.itdati.piemonte.it
dati.piemonte.itFPA
 
Behind the Scenes with Data.gov
Behind the Scenes with Data.govBehind the Scenes with Data.gov
Behind the Scenes with Data.govJeanne Holm
 
Introduction: Open Data Business
Introduction: Open Data BusinessIntroduction: Open Data Business
Introduction: Open Data BusinessMartin Kaltenböck
 
Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea Haklae Kim
 
COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)Comit Projects Ltd
 
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuro
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuroOpen Data per il riuso della PSI: l'Europa spinge sull'economia del futuro
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuroMatteo Brunati
 
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...AnthonyOtuonye
 
Eurocham PSI seminar Hong Kong
Eurocham PSI seminar Hong KongEurocham PSI seminar Hong Kong
Eurocham PSI seminar Hong Kongvalrit
 
Innovating through public sector information
Innovating through public sector informationInnovating through public sector information
Innovating through public sector informationJerry Fishenden
 
Raimondo Iemma - Open Government Data in Italy - may 2012
Raimondo Iemma - Open Government Data in Italy - may 2012Raimondo Iemma - Open Government Data in Italy - may 2012
Raimondo Iemma - Open Government Data in Italy - may 2012RaimondoIemma
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceGSDI Association
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the futureSlim Turki, Dr.
 
Open Data Movement around the Globe
Open Data Movement around the GlobeOpen Data Movement around the Globe
Open Data Movement around the GlobeChingteng Hsiao
 
Open Knowledge Regime for an Innovation Economy
Open Knowledge Regime for an Innovation EconomyOpen Knowledge Regime for an Innovation Economy
Open Knowledge Regime for an Innovation EconomyLinuxmalaysia Malaysia
 
Open Data Institute presentation of european context
Open Data Institute presentation of european contextOpen Data Institute presentation of european context
Open Data Institute presentation of european contextliberTIC
 

Similaire à What is opendata (20)

Lorenzo Benussi - DataGov
Lorenzo Benussi - DataGovLorenzo Benussi - DataGov
Lorenzo Benussi - DataGov
 
Data gov and data(reg)
Data gov and data(reg)   Data gov and data(reg)
Data gov and data(reg)
 
dati.piemonte.it
dati.piemonte.itdati.piemonte.it
dati.piemonte.it
 
dati.piemonte.it
dati.piemonte.itdati.piemonte.it
dati.piemonte.it
 
Behind the Scenes with Data.gov
Behind the Scenes with Data.govBehind the Scenes with Data.gov
Behind the Scenes with Data.gov
 
Introduction: Open Data Business
Introduction: Open Data BusinessIntroduction: Open Data Business
Introduction: Open Data Business
 
Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea
 
COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)
 
EISCO opendata euskadi
EISCO opendata euskadiEISCO opendata euskadi
EISCO opendata euskadi
 
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuro
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuroOpen Data per il riuso della PSI: l'Europa spinge sull'economia del futuro
Open Data per il riuso della PSI: l'Europa spinge sull'economia del futuro
 
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
 
Eurocham PSI seminar Hong Kong
Eurocham PSI seminar Hong KongEurocham PSI seminar Hong Kong
Eurocham PSI seminar Hong Kong
 
Innovating through public sector information
Innovating through public sector informationInnovating through public sector information
Innovating through public sector information
 
CO3 - Open Data
CO3 - Open DataCO3 - Open Data
CO3 - Open Data
 
Raimondo Iemma - Open Government Data in Italy - may 2012
Raimondo Iemma - Open Government Data in Italy - may 2012Raimondo Iemma - Open Government Data in Italy - may 2012
Raimondo Iemma - Open Government Data in Italy - may 2012
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 Conference
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the future
 
Open Data Movement around the Globe
Open Data Movement around the GlobeOpen Data Movement around the Globe
Open Data Movement around the Globe
 
Open Knowledge Regime for an Innovation Economy
Open Knowledge Regime for an Innovation EconomyOpen Knowledge Regime for an Innovation Economy
Open Knowledge Regime for an Innovation Economy
 
Open Data Institute presentation of european context
Open Data Institute presentation of european contextOpen Data Institute presentation of european context
Open Data Institute presentation of european context
 

Plus de Lorenzo Benussi

Small/Big/Open Data for Public Empowerment and Freedom
Small/Big/Open Data for Public Empowerment and FreedomSmall/Big/Open Data for Public Empowerment and Freedom
Small/Big/Open Data for Public Empowerment and FreedomLorenzo Benussi
 
Open data e Comunità Intelligenti in italia
Open data e Comunità Intelligenti in italiaOpen data e Comunità Intelligenti in italia
Open data e Comunità Intelligenti in italiaLorenzo Benussi
 
Open Data nell'agenda digitale italiana
Open Data nell'agenda digitale italianaOpen Data nell'agenda digitale italiana
Open Data nell'agenda digitale italianaLorenzo Benussi
 
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italiana
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italianaDati di tipo aperto: cosa cambia con la nuova Agenda Digitale italiana
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italianaLorenzo Benussi
 
Strategia open data - long version
Strategia open data - long versionStrategia open data - long version
Strategia open data - long versionLorenzo Benussi
 
How the Open Source model adapts to the cloud computing environment
How the Open Source model adapts to the cloud computing environmentHow the Open Source model adapts to the cloud computing environment
How the Open Source model adapts to the cloud computing environmentLorenzo Benussi
 
Elementi di proprietà intellettuale digitale
Elementi di proprietà intellettuale digitaleElementi di proprietà intellettuale digitale
Elementi di proprietà intellettuale digitaleLorenzo Benussi
 
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...Lorenzo Benussi
 

Plus de Lorenzo Benussi (15)

Small/Big/Open Data for Public Empowerment and Freedom
Small/Big/Open Data for Public Empowerment and FreedomSmall/Big/Open Data for Public Empowerment and Freedom
Small/Big/Open Data for Public Empowerment and Freedom
 
Big data & smart city 1
Big data & smart city 1Big data & smart city 1
Big data & smart city 1
 
Open, big, small data
Open, big, small dataOpen, big, small data
Open, big, small data
 
Open data e Comunità Intelligenti in italia
Open data e Comunità Intelligenti in italiaOpen data e Comunità Intelligenti in italia
Open data e Comunità Intelligenti in italia
 
Open Data nell'agenda digitale italiana
Open Data nell'agenda digitale italianaOpen Data nell'agenda digitale italiana
Open Data nell'agenda digitale italiana
 
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italiana
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italianaDati di tipo aperto: cosa cambia con la nuova Agenda Digitale italiana
Dati di tipo aperto: cosa cambia con la nuova Agenda Digitale italiana
 
Italia open data
Italia open dataItalia open data
Italia open data
 
Apps for Italy - a4i
Apps for Italy - a4iApps for Italy - a4i
Apps for Italy - a4i
 
Strategia open data - long version
Strategia open data - long versionStrategia open data - long version
Strategia open data - long version
 
Open Data
Open DataOpen Data
Open Data
 
How the Open Source model adapts to the cloud computing environment
How the Open Source model adapts to the cloud computing environmentHow the Open Source model adapts to the cloud computing environment
How the Open Source model adapts to the cloud computing environment
 
Elementi di proprietà intellettuale digitale
Elementi di proprietà intellettuale digitaleElementi di proprietà intellettuale digitale
Elementi di proprietà intellettuale digitale
 
Mind your own business
Mind your own businessMind your own business
Mind your own business
 
Media 2.0
Media 2.0Media 2.0
Media 2.0
 
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...
Il ruolo dei social network per la consapevolezza dell'impatto ambientale del...
 

Dernier

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 

Dernier (20)

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 

What is opendata

  • 1. DATAVIZ: VISUAL REPRESENTATION OF COMPLEX PHENOMENA data visualization & computational design @ Better Nouveau Workshop 14/12/2011 What is Open Data? Lorenzo Benussi, TOP-IX Consotium lorenzo.benussi@top-ix.org 1
  • 2. About me Research & Business Development TOP-IX Consortium Fellow, NEXA Centre Polytechnic of Turin Fellow, Department of Economics University of Turin 2
  • 3. agenda 1. Background 2. Definitions I. Open Knowledge Definition II. Open Data Licenses III. Pricing models IV. Formats 3. Examples 3
  • 4. Did you take the bus today? 4
  • 5. Ref: National Geographic http://ngm.nationalgeographic.com/big-idea/14/augmented-reality Background 5
  • 6. BIG DATA stylized facts 1 • $600 to buy a disk drive that can store all the world's music. • 5 billion mobile phone in use in 2010. • 30 billion pieces of content shared on Facebook every month. • 40% of projected growth in global data generated per year VS 5% growth in global IT spending. • 235 terabytes data collected by US Library of Congress in April 2011. • 15 out of 17 sectors in the United States have more data stored per company than the US Library of Congress McKinsey: Big Data:The next frontier of innovation, competition and productivity. (may 2011) 6
  • 7. BIG DATA stylized facts 2 $300 billion potential annual value to US health care - more than X 2 total annual health care spending in Spain. • €250 billion potential annual value to Europe's public sector administration - more than GDP of Greece. • $600 billion potential annual consumer surplus from using personal location data globally. • 60% potential increase in retailers' operating margins possible with big data. • 140.000-190.000 more deep analytical talent position and 1.5 million more data-savvy managers needed to take full advantage of big data in the USA. McKinsey: Big Data:The next frontier of innovation, competition and productivity. (may 2011) 7
  • 8. WEB(squared) 1.Redefining Collective Intelligence: New Sensory Input 2.Cooperating Data Subsystems 3.How the Web Learns: Explicit vs. Implicit Meaning 4.Web Meets World: The "Information Shadow" and the Internet of Things 5.The Rise of Real Time: A Collective Mind Ref: Tim O’Reilly and John Battelle (2009), Web Squared: Web 2.0 Five Years On. http://www.web2summit.com/web2009/public/schedule/detail/10194 8
  • 9. Digital technology could enable an extraordinary range of ordinary people to become part of a creative process. (The future of ideas, Lawrence Lessig) 9
  • 10. When I say that innovation is being democratized, I mean that users of products and services—both firms and individual consumers—are increasingly able to innovate for themselves. (Democratizing Innovation, Eric Von Hippel) 10
  • 11. The value of metrics • Data Hal Varian, Google’s Chief Economist • Information • Knowledge • Value 11
  • 12. 12
  • 13. DATA as a SERVICE Data are not closed inside applications but they are consumed on-demand as a service RESTful API make possible to access data as a web resource (trough URI) 13
  • 14. Business Models A. Data owner: paid to publish / revenue share. B. Data user: pay for data delivery/trasformation/ analysis services. New Generation Marketplace 3. Works with open and not-open data 4. Provide data on-the-fly through API (evan custom). 5. Sometime the community of data curators in involved to maintain and expand the data crowd- sourcing (e.g. Factual). 6. Provide tools (web based) to explore the data 14
  • 15. What open data means? Open Data is a model to extract value from public sector information by using the data to build new tools and to create innovative services 15
  • 16. PSI (public sector information) mines • The Public Sector produces and manages huge amount of data, opening PSI information in EU produces economic growth 140 billion € / year (aggregate) • Public Data are the raw material to create new products and services COURTESY/RON WHEELER. The 8,000-foot deep Homestake Gold Mine in South Dakota is the site where scientists, including UC Berkeley researchers, plan to construct the world's deepest research center. 16
  • 17. data.gov “Openness will strengthen our democracy and promote efficiency and effectiveness in Government” Transparency and Open Government Memorandum for the Heads of Executive Departments and Agencies (2009) […] As you know, transparency is at the heart of our agenda for Government. We recognise that transparency and open data can be a powerful tool to help reform public services, foster innovation and empower citizens. David Cameron - Letter to Cabinet Ministers (2011) 17
  • 18. Information is the currency of democracy Benjamin Franklin (attribution) 18
  • 19. Raw data now! "... give us the unadulterated data, we want the data, we want unadulterated data. We have to ask for raw data now." Tim Berners-Lee, advisor data.gov.uk 19
  • 20. data.gov: leading examples USA - data.gov UK - data.gov.uk Australia - data.gov.au 20
  • 21. Legislation in EU, Italy and Piedmont EUROPA Direttiva 2003/98/CE del 17 novembre 2003 The evolution towards an information and knowledge society influences the life of every citizen in the Com-munity, inter alia, by enabling them to gain new ways of accessing and acquiring knowledge. DIRECTIVE 2003/98/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 17 November 2003 on the re-use of public sector information ITALY Decreto Legislativo n. 36 January, 24 2006 and  L. 96/2010. PIEDMONT Delibera di Giunta regionale 36 - 1109 November 2010 21
  • 22. WHY : civil society • Accountability • Tansparency • Collaboration • Participation 22
  • 23. WHY : (digital) market •Innovation • Cooperation • Competition • Digital commons 23
  • 24. The first example in Italy - dati.piemonte.it 24
  • 25. apps4italy • All EU citizens can participate (!!) & 40K€ in cash prizes • Building useful, innovative projects based on italian public data (not only open data) • Four main categories (growing): 1. Ideas 2. Apps Ref: appsforitaly.org 3. Visualization 4. Datasets 25
  • 27. Open Knowledge Definition v.1.1 by OKF A work is open if its manner of distribution satisfies the following conditions: 1. Access 2. Redistribution 8. No discrimination (fields or endeavor) 3. Reuse 9. Distribution of license 4. Absence of technological restriction 10. License must not be specific to a package 5. Attribution 11. License must not 6. Integrity restrict the distribution of other works 7. No discrimination (persons or groups) 27
  • 28. Open Definition - http://opendefinition.org/okd/ Version 1.1 Terminology The term knowledge is taken to include: # 1.# Content such as music, films, books # 2.# Data be it scientific, historical, geographic or otherwise # 3.# Government and other administrative information Software is excluded [...] The term work will be used to denote the item or piece of knowledge which is being transferred. The term package may also be used to denote a collection of works. [...] The term license refers to the legal license under which the work is made available. Where no license has been made this should be interpreted as referring to the resulting default legal conditions under which the work is available (for example copyright). 28
  • 29. The Definition - A work is open if its manner of distribution satisfies the following conditions: 1. ACCESS The work shall be available as a whole and at no more than a reasonable reproduction cost, preferably downloading via the Internet without charge. The work must also be available in a convenient and modifiable form. 2. REDISTRIBUTION The license shall not restrict any party from selling or giving away the work either on its own or as part of a package made from works from many different sources. The license shall not require a royalty or other fee for such sale or distribution. 3. REUSE The license must allow for modifications and derivative works and must allow them to be distributed under the terms of the original work. 29
  • 30. 4. ABSENCE OF TECHNOLOGICAL RESTRICTION The work must be provided in such a form that there are no technological obstacles to the performance of the above activities. This can be achieved by the provision of the work in an open data format, i.e. one whose specification is publicly and freely available and which places no restrictions monetary or otherwise upon its use. 5. ATTRIBUTION The license may require as a condition for redistribution and re-use the attribution of the contributors and creators to the work. If this condition is imposed it must not be onerous. For example if attribution is required a list of those requiring attribution should accompany the work. 6. INTEGRITY The license may require as a condition for the work being distributed in modified form that the resulting work carry a different name or version number from the original work. 30
  • 31. 7. NO DISCRIMINATION AGAINST PERSONS OR GROUPS The license must not discriminate against any person or group of persons. 8. NO DISCRIMINATION AGAINST FIELDS OF ENDEAVOR The license must not restrict anyone from making use of the work in a specific field of endeavor. For example, it may not restrict the work from being used in a business, or from being used for genetic research. 9. DISTRIBUTION OF LICENSE The rights attached to the work must apply to all to whom it is redistributed without the need for execution of an additional license by those parties. 10. LICENSE MUST NOT BE SPECIFIC TO A PACKAGE The rights attached to the work must not depend on the work being part of a particular package. If the work is extracted from that package and used or distributed within the terms of the work’s license, all parties to whom the work is redistributed should have the same rights as those that are granted in conjunction with the original package. 11. LICENSE MUST NOT RESTRICT THE DISTRIBUTION OF OTHER WORKS The license must not place restrictions on other works that are distributed along with the licensed work. For example, the license must not insist that all other works distributed on the same medium are open. 31
  • 33. A paradigmatic shift: information economy • The transition from a physically-based to a knowledge-based economic environment made information a primary wealth-creating asset. • Digital access to information seems to have changed the structure of many industries, promoting services-oriented business models based on disclosure and sharing of information and knowledge. 33
  • 34. A paradigmatic shift: PSI data mines • The Public Sector holds and manages huge amounts of data and information. Fostering access to those repositories enables new business opportunities that can broaden market volumes in such sectors. • PSI represents the raw material from which value added products and services can be designed. 34
  • 35. The use/value of PSI PSI can be used and reused in many ways (non rivalry in Several supply chain consumption): configurations. 1.Broad range of sectors 1.Linear models (private re-users 2.Different sets of actors add value) 3.PSI holders 2.User generated contents 4.Private re-users 3.Information sharing between 5.Regulatory bodies public bodies 6.Citizens 35
  • 36. The price of PSI: the “free data” approach • The peculiar cost structure of digital data collecting, processing and delivering (high fixed costs, zero marginal cost) strongly influences the possible pricing strategies to be adopted by PSI holders. • Pollock (2008): a price that equals marginal costs (i.e. PSI free of charge) is socially optimal provided that elasticity of demand and positive externalities overcome a given threshold. ✓ Empirics: those conditions are likely to be verified in most of the PSI domains. 36
  • 37. The price of PSI: cost recovery approach • Although a cost recovery regime may bound potential demand and distort competition, several critical issues could trigger its adoption. • Underestimation of downstream demand and network externalities. ✓Lack of long-run commitment in subsidizing PSI collection. ✓Short-term decision making. ✓Moral hazard (?). 37
  • 38. The price of PSI: possible scenarios Directive 2003/98/EC is aimed at fostering PSI reuse mainly by promoting: 1.PSI availability in digital format 2.Transparency of reuse conditions and pricing 3.Non discrimination Which market configurations are likely to emerge? MEPSIR (2006) Directive impact Main condition Example Information is strongly liked with the functioning Cadastral Closed shop Minor. Public Sector bodies continue to of public bodies. information control the supply chain. Non-negligible. New entrants step into the Information is important while not strategic for Battlefield Meteorological data downstream market. PA. Strong. Public Sector enlarges its influence Digitalization offers new opportunities for value Legal information over the downstream stages. extraction. Playground Non-negligible. Public Sector has the only Information reuse generates high demand Traffic and transport role of information holder. volumes from citizens and firms information 38
  • 39. The price of PSI: Externalities & Policy All pricing strategies encompass potential risks of inefficiency for PSI holders (due to lack of incentives in reducing costs and/or improving quality). The importance of the regulatory framework The Central Role of Externalities 39
  • 41. Linked open data and Semantic web The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data. (by Tim Berners-Lee) 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. Ref: http://www.w3.org/DesignIssues/ LinkedData.html 41
  • 42. 42
  • 43. Linked open data: basic principles 1. Everything has a name (people, locations, etc.) 1. Every name starts with http:// 3. All data are described by using RDF (Resource Description Framework is a W3C standard). Tim Berners Lee talk on linked data: http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html 43
  • 44. Data as a RDF graph 44
  • 45. The Vision - A global interconnected database 45
  • 46. The Vision - Mix data on-the-fly 46
  • 47. Linked data - hands on DBPedia provide information of wikipedia as Linked Data. Example, Turin airport: http://dbpedia.org/page/ Turin_Caselle_Airport 47
  • 49. Open Data license 1 (OKF) Open Knowledge foundation licences 1. Public Domain Dedication and License (PDDL) — “Public Domain for data/databases” 2. Open Data Commons Attribution License (ODC- By) — “Attribution for data/databases” 3. Open Data Commons Open Database License (ODC-ODbL) — “Attribution Share-Alike for data/ databases” Ref: http://www.opendatacommons.org/licenses/ 49
  • 50. Open Data licenses 2 (CC e IODL) Creative Commons Licenses (http://creativecommons.org/ licenses/) 1. CC Zero 2. CC by - Atribution 3. CC SA - Share alike 4. CC BY-SA - Attribution and Share alike Italian open data license (http://www.formez.it/iodl/) • IODL - Italian Open Data License (BY-SA) 50
  • 51. examples 51
  • 52. 2 groups I. Transparency II. Information services 52
  • 53. Transparency • Public assembly (parliament, councils) • Public Budget and expenses • Public procurement 53
  • 54. Ref: http://traintimes.org.uk/map/tube/ Info services • Transportation • Environment • Cultural heritage 54
  • 59. Ref: http://webdesignledger.com/inspiration/ 15-stunning-examples-of-data-visualization Ref: http: //www.gapminder.org/ Data VIZ 59
  • 60. Where to find open data Open (and not open) data archive http://ckan.net/ http://it.ckan.net/ Example of italian datasets: Dati.gov.it: http://www.dati.gov.it/ 5T: http://biennaledemocrazia.it/dataset/ Dati Piemonte: http://dati.piemonte.it ISTAT: http://dati.istat.it/ Enel: http://data.enel.com/ 60
  • 61. Tools and links ONLINE DATA VISUALIZATION G visualization Api: http://code.google.com/intl/it-IT/apis/chart/ Tableau Public: http://www.tableausoftware.com/public Open Heat Map: http://www.openheatmap.com/ ONLINE STORAGE+VISUALIZATION Google Public Data explorer: http://www.google.com/publicdata/home IBM Many Eyes: http://www-958.ibm.com/software/data/cognos/manyeyes/ Google Fusion tables: http://www.google.com/fusiontables/Home Impure: http://www.impure.com/ CURATION & LINKING Google Refine Data Wrangler: http://vis.stanford.edu/wrangler/ OFFLINE TOOLS R: http://www.r-project.org/ Jscript Library for data viz: http://thejit.org/ Anche questa: http://vis.stanford.edu/protovis/ Network / graph analysis / visualization: http://gephi.org/ Language turing complete for dataviz for visual artist: http://processing.org/ 61
  • 62. wrap-up 1. Not all public data are open data 2. Public data and gov data are often “broken” (strange formats and ambiguous IP) 3. Open Data make sense if we put it in perspective - the rise of Big Data 62