SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
F I N D A N D U N D E R S TA N D D ATA




                  Best Practices for

       Publishing Data


Hjalmar Gislason, founder & CEO - hg@datamarket.com   October, 2012
Hjalmar
                Gislason
                Founder and CEO




Twitter: @datamarket
Slides: http://blog.datamarket.com/
Best Practices for Publishing Data
Best Practices for Publishing Data
Heavy

Data Consumers

    Providers of

 Data Delivery
  Technology
Computers                                                         Humans

• Structure                                                             • Search
                                                                        • Visualization
                                                                        • Download




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Computers                                                         Humans

• Structure                                                             • Search
                                                                        • Visualization
                                                                        • Download




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
"Don't anthropomorphize computers
           - they hate it."
                     - Unknown
Simple Formats
Simple Formats:
Tim Berners-Lee’s Five Stars




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Simple formats:
You lost me at “Semantics”




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Standards will emerge and there will
be more and more of them



                     • RDF
                     • OData vs. GData
                     • DSPL




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Must: Unique ID, Title, Last updated
  • Should: Meta-data


  • Why?
   • No need for scraping
       • Less load on your end
   • Ensures full coverage
   • Ensures content removal and updates




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Hard to emphasize enough!


  • Unique IDs for everything: Datsets, columns, entities, ...


  • Why?
    • Continuity: A small change for a man = giant leap for a
      computer




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Any relevant contextual information
   • URL(s), descriptions, methodology, next updated, authors,
     keywords, units, license information, ...




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




   “There are errors in the data and I don't
       want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




      “There are errors in the data and I do
         want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
Computers                                                         Humans

• Structure                                                             • Search
                                                                        • Visualization
                                                                        • Download




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
F I N D A N D U N D E R S TA N D D ATA



              Hjalmar Gislason, founder & CEO



Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com

Contenu connexe

Similaire à Best Practices for Publishing Data

The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big DataClark Boyd
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016StampedeCon
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?DATAVERSITY
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeDATAVERSITY
 
Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceDATAVERSITY
 
Bp presentation business intelligence and advanced data analytics september ...
Bp presentation business intelligence  and advanced data analytics september ...Bp presentation business intelligence  and advanced data analytics september ...
Bp presentation business intelligence and advanced data analytics september ...Barrett Peterson
 
Data Visualization With Trendalyzer
Data Visualization With TrendalyzerData Visualization With Trendalyzer
Data Visualization With TrendalyzerMolham Al-Maleh
 
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your Business
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your BusinessData-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your Business
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your BusinessDATAVERSITY
 
Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...DATAVERSITY
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsHjalmar Gislason
 
Master Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceMaster Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceDATAVERSITY
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersJanet Dorenkott
 
Webinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data ScienceWebinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data ScienceDATAVERSITY
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementAccess Innovations, Inc.
 
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...Business Development Institute
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?Findwise
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
Big analytics best practices @ PARC
Big analytics best practices @ PARCBig analytics best practices @ PARC
Big analytics best practices @ PARCJim Kaskade
 
build your brand on LinkedIn
build your brand on LinkedInbuild your brand on LinkedIn
build your brand on LinkedInRandstad USA
 
Retrofitting Search Marketing Teams for Success in Display Advertising
Retrofitting Search Marketing Teams for Success in Display AdvertisingRetrofitting Search Marketing Teams for Success in Display Advertising
Retrofitting Search Marketing Teams for Success in Display AdvertisingDana Todd
 

Similaire à Best Practices for Publishing Data (20)

The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
 
Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-Service
 
Bp presentation business intelligence and advanced data analytics september ...
Bp presentation business intelligence  and advanced data analytics september ...Bp presentation business intelligence  and advanced data analytics september ...
Bp presentation business intelligence and advanced data analytics september ...
 
Data Visualization With Trendalyzer
Data Visualization With TrendalyzerData Visualization With Trendalyzer
Data Visualization With Trendalyzer
 
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your Business
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your BusinessData-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your Business
Data-Ed Slides: Data-Centric Strategy & Roadmap - Supercharging Your Business
 
Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic Techpolitics
 
Master Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceMaster Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and Governance
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturers
 
Webinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data ScienceWebinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data Science
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and Management
 
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...
Big Data 101, What It Means for Business - BDI 12/4/13 The Future of Financia...
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Big analytics best practices @ PARC
Big analytics best practices @ PARCBig analytics best practices @ PARC
Big analytics best practices @ PARC
 
build your brand on LinkedIn
build your brand on LinkedInbuild your brand on LinkedIn
build your brand on LinkedIn
 
Retrofitting Search Marketing Teams for Success in Display Advertising
Retrofitting Search Marketing Teams for Success in Display AdvertisingRetrofitting Search Marketing Teams for Success in Display Advertising
Retrofitting Search Marketing Teams for Success in Display Advertising
 

Plus de Hjalmar Gislason

What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?Hjalmar Gislason
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemHjalmar Gislason
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012Hjalmar Gislason
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataHjalmar Gislason
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up businessHjalmar Gislason
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Hjalmar Gislason
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataHjalmar Gislason
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011Hjalmar Gislason
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenHjalmar Gislason
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)Hjalmar Gislason
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009Hjalmar Gislason
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010Hjalmar Gislason
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Hjalmar Gislason
 
Landsins gögn og nauðsynjar - FT 30. okt 2009
Landsins gögn og nauðsynjar - FT 30. okt 2009Landsins gögn og nauðsynjar - FT 30. okt 2009
Landsins gögn og nauðsynjar - FT 30. okt 2009Hjalmar Gislason
 
Gagnadrifin Ákvarðanataka
Gagnadrifin ÁkvarðanatakaGagnadrifin Ákvarðanataka
Gagnadrifin ÁkvarðanatakaHjalmar Gislason
 

Plus de Hjalmar Gislason (20)

What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?
 
Unified Intelligence
Unified IntelligenceUnified Intelligence
Unified Intelligence
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve System
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with data
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up business
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with data
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in Bergen
 
The Business of Open Data
The Business of Open DataThe Business of Open Data
The Business of Open Data
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)
 
Dokkan sept-2010
Dokkan sept-2010Dokkan sept-2010
Dokkan sept-2010
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010
 
Landsins gögn og nauðsynjar - FT 30. okt 2009
Landsins gögn og nauðsynjar - FT 30. okt 2009Landsins gögn og nauðsynjar - FT 30. okt 2009
Landsins gögn og nauðsynjar - FT 30. okt 2009
 
Silfur Egils 2009 10 25
Silfur Egils 2009 10 25Silfur Egils 2009 10 25
Silfur Egils 2009 10 25
 
Gagnadrifin Ákvarðanataka
Gagnadrifin ÁkvarðanatakaGagnadrifin Ákvarðanataka
Gagnadrifin Ákvarðanataka
 
Gögn sem markaðsvara
Gögn sem markaðsvaraGögn sem markaðsvara
Gögn sem markaðsvara
 

Best Practices for Publishing Data

  • 1. F I N D A N D U N D E R S TA N D D ATA Best Practices for Publishing Data Hjalmar Gislason, founder & CEO - hg@datamarket.com October, 2012
  • 2. Hjalmar Gislason Founder and CEO Twitter: @datamarket Slides: http://blog.datamarket.com/
  • 5. Heavy Data Consumers Providers of Data Delivery Technology
  • 6. Computers Humans • Structure • Search • Visualization • Download | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 7. Computers Humans • Structure • Search • Visualization • Download | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 8. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 9. "Don't anthropomorphize computers - they hate it." - Unknown
  • 11. Simple Formats: Tim Berners-Lee’s Five Stars | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 12. Simple formats: You lost me at “Semantics” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 13. Standards will emerge and there will be more and more of them • RDF • OData vs. GData • DSPL | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 14. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 15. Indexes, unique IDs and meta-data • Must: Unique ID, Title, Last updated • Should: Meta-data • Why? • No need for scraping • Less load on your end • Ensures full coverage • Ensures content removal and updates | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 16. Indexes, unique IDs and meta-data • Hard to emphasize enough! • Unique IDs for everything: Datsets, columns, entities, ... • Why? • Continuity: A small change for a man = giant leap for a computer | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 17. Indexes, unique IDs and meta-data • Any relevant contextual information • URL(s), descriptions, methodology, next updated, authors, keywords, units, license information, ... | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 18. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I don't want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 19. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I do want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 20. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 21. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 22. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 23. Computers Humans • Structure • Search • Visualization • Download | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 24. F I N D A N D U N D E R S TA N D D ATA Hjalmar Gislason, founder & CEO Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com