6. Computers Humans
• Structure • Search
• Visualization
• Download
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
7. Computers Humans
• Structure • Search
• Visualization
• Download
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
8. Publishing for Computers
1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
11. Simple Formats:
Tim Berners-Lee’s Five Stars
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
12. Simple formats:
You lost me at “Semantics”
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
13. Standards will emerge and there will
be more and more of them
• RDF
• OData vs. GData
• DSPL
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
14. Indexes, unique ids and meta-data
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
15. Indexes, unique IDs and meta-data
• Must: Unique ID, Title, Last updated
• Should: Meta-data
• Why?
• No need for scraping
• Less load on your end
• Ensures full coverage
• Ensures content removal and updates
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
16. Indexes, unique IDs and meta-data
• Hard to emphasize enough!
• Unique IDs for everything: Datsets, columns, entities, ...
• Why?
• Continuity: A small change for a man = giant leap for a
computer
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
17. Indexes, unique IDs and meta-data
• Any relevant contextual information
• URL(s), descriptions, methodology, next updated, authors,
keywords, units, license information, ...
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
18. FAQs and feedback channels
#1 reason for not publishing data:
“There are errors in the data and I don't
want others to discover them”
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
19. FAQs and feedback channels
#1 reason for not publishing data:
“There are errors in the data and I do
want others to discover them”
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
20. FAQs and feedback channels
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
21. FAQs and feedback channels
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
22. Publishing for Computers
1. Simple formats
2. Indexes, unique IDs and meta-data
3. FAQs and feedback channels
23. Computers Humans
• Structure • Search
• Visualization
• Download
| B EST PR ACT ICE S fo r PUBL IS HI NG D ATA | Hjalmar Gislason, hg@datamarket.com | October 2012
24. F I N D A N D U N D E R S TA N D D ATA
Hjalmar Gislason, founder & CEO
Twitter: @datamarket · Facebook: DataMarket · E-mail: hg@datamarket.com