SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
F I N D A N D U N D E R S TA N D D ATA




                  Best Practices for

       Publishing Data


Hjalmar Gislason, founder & CEO - hg@datamarket.com   October, 2012
Hjalmar
                Gislason
                Founder and CEO




Twitter: @datamarket
Slides: http://blog.datamarket.com/
Heavy
Data Consumers

    Providers of

 Data Delivery
  Technology
Computers                                                    Humans




    |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Computers                                                      Humans

• Structure                                                          • Understand and
                                                                       use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Computers                                                      Humans

• Structure                                                          • Understand and
                                                                       use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
Simple Formats




"Don't anthropomorphize computers
           - they hate it."
                     - Unknown
Simple Formats
Simple Formats:
Tim Berners-Lee’s Five Stars




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Simple Formats:
You lost me at “Semantics”




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Standards will emerge and there will
be more and more of them



                     • RDF
                     • OData vs. GData
                     • DSPL
                     • SDMX




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique ids and meta-data




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Must: Unique ID, Title, Last updated
  • Should: Meta-data


  • Why?
   • No need for scraping
       • Less load on your end
   • Ensures full coverage
   • Ensures content removal and updates




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Hard to emphasize enough!


  • Unique IDs for everything: Datsets, columns, entities, ...


  • Why?
    • Continuity: A small change for a man = giant leap for a
      computer




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Indexes, unique IDs and meta-data

  • Any relevant contextual information
   • URL(s), descriptions, methodology, next updated, authors,
     keywords, units, license information, ...




        |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




   “There are errors in the data and I don't
       want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels

   #1 reason for not publishing data:




      “There are errors in the data and I do
         want others to discover them”




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
FAQs and feedback channels




     |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Computers


1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
Computers                                                         Humans

• Structure                                                             • Understand
                                                                          and use




      |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
Publishing for Humans


1. Search / Discovery
2. Visualization
3. Download
Search / Discovery

  • Requirements differ from web/text search
   • A lot less textual content to base on
     • Synonyms, dictionaries, autocomplete
   • But (hopefully) good meta-data = facets and filtering


  • Give people ways to browse
   • Categories vs. tags vs. search
   • Serendipity: Random, related, interesting...
Search / Discovery
Visualize
109 columns
     x
  340 lines
     =
37.060 cells
Visualize

  • What you should offer depends on the data


  • Statistical data
    • Focus on the most common charts and get them right
    • Do NOT invent new visualizations or chart types


  • Use standards compatible technologies
    • No Flash!
    • Charting and visualization libraries
Visualize
Visualize
Download

  • Make it easy to use your data outside your tools
   • Play nicely with those providing functionality beyond what
     you can offer: Tableau, R, SAS, MathLab, Mathematica,
     SPSS, ...




  • Provide downloads in the formats most commonly used by your
    users:
   • Raw data: Excel, CSV, feeds (R, Excel live feeds, APIs)
   • Charts and visualizations: Bitmap, vector, PPT, embeds?
Computers                                                       Humans

• Structure                                                           • Understand and use
 • Simple formats                                                         • Search / Discovery
 • Indexes, unique IDs and                                                • Visualization
   meta-data                                                              • Download
 • FAQs and feedback
   channels




       |   B EST PR ACT ICE S fo r PUBL IS HI NG D ATA   |   Hjalmar Gislason, hg@datamarket.com   |   October 2012
F I N D A N D U N D E R S TA N D D ATA



              Hjalmar Gislason, founder & CEO



Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com

Contenu connexe

Similaire à Strata NY: Best Practices for Publishing Data

Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceDATAVERSITY
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big DataClark Boyd
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeDATAVERSITY
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?DATAVERSITY
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...jybufgofasfbkpoovh
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...DATAVERSITY
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdfngVnThng12
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016StampedeCon
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceDATAVERSITY
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data IntegrationDATAVERSITY
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata ManagementDATAVERSITY
 
Changing nature of data and its implications on analytics
Changing nature of data and its implications on analyticsChanging nature of data and its implications on analytics
Changing nature of data and its implications on analyticsAshnikbiz
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersJanet Dorenkott
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Sujit Ghosh
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of DataMatt Turner
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling TechniquesDATAVERSITY
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3varshakumar21
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 

Similaire à Strata NY: Best Practices for Publishing Data (20)

Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-Service
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdf
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
Changing nature of data and its implications on analytics
Changing nature of data and its implications on analyticsChanging nature of data and its implications on analytics
Changing nature of data and its implications on analytics
 
Big data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturersBig data why big data is huge for CPG manufacturers
Big data why big data is huge for CPG manufacturers
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 

Plus de Hjalmar Gislason

Icelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurshipIcelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurshipHjalmar Gislason
 
Níu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpunNíu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpunHjalmar Gislason
 
What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?Hjalmar Gislason
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemHjalmar Gislason
 
Data Visualizations and Storytelling
Data Visualizations and StorytellingData Visualizations and Storytelling
Data Visualizations and StorytellingHjalmar Gislason
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012Hjalmar Gislason
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataHjalmar Gislason
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up businessHjalmar Gislason
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Hjalmar Gislason
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataHjalmar Gislason
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011Hjalmar Gislason
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsHjalmar Gislason
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenHjalmar Gislason
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)Hjalmar Gislason
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009Hjalmar Gislason
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010Hjalmar Gislason
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Hjalmar Gislason
 

Plus de Hjalmar Gislason (20)

Icelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurshipIcelandic environment for innovation and entrepreneurship
Icelandic environment for innovation and entrepreneurship
 
Níu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpunNíu atriði sem enginn sagði mér um nýsköpun
Níu atriði sem enginn sagði mér um nýsköpun
 
What does a random place on Earth look like?
What does a random place on Earth look like?What does a random place on Earth look like?
What does a random place on Earth look like?
 
Unified Intelligence
Unified IntelligenceUnified Intelligence
Unified Intelligence
 
Eruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve SystemEruptions, Open Data and the Earth's Nerve System
Eruptions, Open Data and the Earth's Nerve System
 
Data Visualizations and Storytelling
Data Visualizations and StorytellingData Visualizations and Storytelling
Data Visualizations and Storytelling
 
ICIJ Conference April 2012
ICIJ Conference April 2012ICIJ Conference April 2012
ICIJ Conference April 2012
 
Data Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with dataData Visualization: Where (normal) people fall in love with data
Data Visualization: Where (normal) people fall in love with data
 
9 things nobody told me about the start-up business
9 things nobody told me about the start-up business9 things nobody told me about the start-up business
9 things nobody told me about the start-up business
 
Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)Effective Data Visualization - Strata (Feb 2012)
Effective Data Visualization - Strata (Feb 2012)
 
Data visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with dataData visualizition - where normal people fall in love with data
Data visualizition - where normal people fall in love with data
 
DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011DataMarket á Haustráðstefnu Skýrr 2011
DataMarket á Haustráðstefnu Skýrr 2011
 
DataMarket at Nordic Techpolitics
DataMarket at Nordic TechpoliticsDataMarket at Nordic Techpolitics
DataMarket at Nordic Techpolitics
 
DataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in BergenDataMarket at Media 3.0 in Bergen
DataMarket at Media 3.0 in Bergen
 
The Business of Open Data
The Business of Open DataThe Business of Open Data
The Business of Open Data
 
DataMarket - Iceland (english)
DataMarket - Iceland (english)DataMarket - Iceland (english)
DataMarket - Iceland (english)
 
Dokkan sept-2010
Dokkan sept-2010Dokkan sept-2010
Dokkan sept-2010
 
DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009DataMarket í Silfri Egils 26. september 2009
DataMarket í Silfri Egils 26. september 2009
 
DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010DataMarket: Haustráðstefna Skýrr, sept 2010
DataMarket: Haustráðstefna Skýrr, sept 2010
 
Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010Landsins gögn og nauðsynjar - HR 9. apríl 2010
Landsins gögn og nauðsynjar - HR 9. apríl 2010
 

Dernier

2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?Olivia Kresic
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy Verified Accounts
 
India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportMintel Group
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCRashishs7044
 
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxContemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxMarkAnthonyAurellano
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03DallasHaselhorst
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionMintel Group
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncrdollysharma2066
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...ShrutiBose4
 

Dernier (20)

2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail Accounts
 
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
 
India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample Report
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR
 
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxContemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted Version
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
 

Strata NY: Best Practices for Publishing Data

  • 1. F I N D A N D U N D E R S TA N D D ATA Best Practices for Publishing Data Hjalmar Gislason, founder & CEO - hg@datamarket.com October, 2012
  • 2. Hjalmar Gislason Founder and CEO Twitter: @datamarket Slides: http://blog.datamarket.com/
  • 3.
  • 4.
  • 5. Heavy Data Consumers Providers of Data Delivery Technology
  • 6. Computers Humans | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 7. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 8. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 9. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 10. Simple Formats "Don't anthropomorphize computers - they hate it." - Unknown
  • 12. Simple Formats: Tim Berners-Lee’s Five Stars | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 13. Simple Formats: You lost me at “Semantics” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 14. Standards will emerge and there will be more and more of them • RDF • OData vs. GData • DSPL • SDMX | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 15. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 16. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 17. Indexes, unique ids and meta-data | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 18. Indexes, unique IDs and meta-data • Must: Unique ID, Title, Last updated • Should: Meta-data • Why? • No need for scraping • Less load on your end • Ensures full coverage • Ensures content removal and updates | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 19. Indexes, unique IDs and meta-data • Hard to emphasize enough! • Unique IDs for everything: Datsets, columns, entities, ... • Why? • Continuity: A small change for a man = giant leap for a computer | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 20. Indexes, unique IDs and meta-data • Any relevant contextual information • URL(s), descriptions, methodology, next updated, authors, keywords, units, license information, ... | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 21. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I don't want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 22. FAQs and feedback channels #1 reason for not publishing data: “There are errors in the data and I do want others to discover them” | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 23. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 24. FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 25. Publishing for Computers 1. Simple formats 2. Indexes, unique IDs and meta-data 3. FAQs and feedback channels
  • 26. Computers Humans • Structure • Understand and use | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 27. Publishing for Humans 1. Search / Discovery 2. Visualization 3. Download
  • 28. Search / Discovery • Requirements differ from web/text search • A lot less textual content to base on • Synonyms, dictionaries, autocomplete • But (hopefully) good meta-data = facets and filtering • Give people ways to browse • Categories vs. tags vs. search • Serendipity: Random, related, interesting...
  • 31.
  • 32. 109 columns x 340 lines = 37.060 cells
  • 33.
  • 34.
  • 35.
  • 36. Visualize • What you should offer depends on the data • Statistical data • Focus on the most common charts and get them right • Do NOT invent new visualizations or chart types • Use standards compatible technologies • No Flash! • Charting and visualization libraries
  • 39. Download • Make it easy to use your data outside your tools • Play nicely with those providing functionality beyond what you can offer: Tableau, R, SAS, MathLab, Mathematica, SPSS, ... • Provide downloads in the formats most commonly used by your users: • Raw data: Excel, CSV, feeds (R, Excel live feeds, APIs) • Charts and visualizations: Bitmap, vector, PPT, embeds?
  • 40. Computers Humans • Structure • Understand and use • Simple formats • Search / Discovery • Indexes, unique IDs and • Visualization meta-data • Download • FAQs and feedback channels | B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
  • 41. F I N D A N D U N D E R S TA N D D ATA Hjalmar Gislason, founder & CEO Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com