SlideShare une entreprise Scribd logo
1  sur  53
Télécharger pour lire hors ligne
(Linked) Data Marketplaces

     Marin Dimitrov (Ontotext)



          v0.6 / Mar 2011
Contents

• Introduction
• Data Marketplaces
  – Factual, InfoChimps, Azure DataMarket, Freebase, Socrata,
    Kasabi
  – Data Market, Timetric, xIgnite
• Data Marketplaces for Linked Data




                     (Linked) Data Marketplaces    Jan 2011     #2
INTRODUCTION




          (Linked) Data Marketplaces   Jan 2011   #3
Definitions

• Data-as-a-Service (DaaS)
   – “Like all members of the "as a Service" (XaaS) family, DaaS is based on
     the concept that the product, data in this case, can be provided on
     demand to the user regardless of geographic or organizational
     separation of provider and consumer. Additionally, the emergence of
     service-oriented architecture (SOA) has rendered the actual platform
     on which the data resides also irrelevant” (Wikipedia)

• Data Marketplaces
   – “Services that make it easy to find data from a range of secondary
     data sources, then consume the data in a usable and unified format.
     Several of these services are trying to create marketplaces for data,
     envisioning that data providers can offer their data sets for sale to
     data seekers” (DataMarket.com)


                           (Linked) Data Marketplaces           Jan 2011       #4
Data Marketplaces properties

• Proposed classification by Bauereiss & Fensel
   1.   Data domain
   2.   Population of content
   3.   Community management
   4.   Operating party
   5.   Pricing models
   6.   Data exchange
• Some additional differentiating characteristics
   – Data model, Data size, Data export
   – Branded marketplaces, SLA
   – Query languages, Data tools
                      (Linked) Data Marketplaces   Jan 2011   #5
DATA MARKETPLACES




          (Linked) Data Marketplaces   Jan 2011   #6
Factual

• www.factual.com / @factual




                  (Linked) Data Marketplaces   Jan 2011   #7
Factual (2)

• Data domain
  – Travel, finance, sports, autos, movies, music, TV, books,
    health, food, politics, education, science, arts, …
  – High quality local data
     • USA, Germany, France, Italy, UK, Japan, Switzerland, Australia, …
     • Used by Facebook Places

• Data population
  – Crawling the web
  – Public data sources
  – Community contributions
     • Upload XLS/ODS, CSV
                         (Linked) Data Marketplaces           Jan 2011     #8
Factual (3)

• Data model
   – tabular
   – Taxonomy of 400 categories
      • 13 Level 1 categories: Arts, Automotive, Business, Government, …

• Data size – 500,000 datasets
• Company info
   – Factual Inc. (USA)
   – $27M VC funding so far




                         (Linked) Data Marketplaces          Jan 2011      #9
Factual (4)

• Monetization model
   – Pricing model not finalised yet (currently free)
   – Pay-per-use pricing (per API call) with subscriptions
      • Companies that contribute data will have a fee reduction

• Data access options
   – REST API
      • Read from table, Add/Write to table, Get schema info
   – Web applications
      • Read/write raw data from a web page (JavaScript)
      • Web widgets for visualising, filtering and sorting data


                          (Linked) Data Marketplaces              Jan 2011   #10
Factual (5)

• Data tools
   – AutoClipper – find tables on the web
   – PageClipper – extract tabular data from a web page
   – FactClipper – find individual facts (query templates)




                       (Linked) Data Marketplaces      Jan 2011   #11
InfoChimps

• www.infochimps.com / @infochimps




                 (Linked) Data Marketplaces   Jan 2011   #12
InfoChimps (2)

• Data domain
   – All purpose
      • Including data from Freebase, Wikipedia infoboxes, CKAN, Twitter,
        Data.gov, Data.gov.uk, GeoNames, …

• Data population
   – Public datasets
   – User submitted datasets
• Data model is dataset specific
• 10,000+ datasets organised in 13 collections


                         (Linked) Data Marketplaces          Jan 2011   #13
InfoChimps (3)

• Company info
  – InfoChimps (USA)
  – $1.6M VC funding so far
  – Acquired DataMarketplace in 12/2010
• Monetization model
  – Charge data sellers
     • Data sellers choose the price & licensing of their data
     • Charge for data storage
     • 30% commission for InfoChimps on each sale



                         (Linked) Data Marketplaces              Jan 2011   #14
InfoChimps (4)

• Monetization model (2)
   – Charge data buyers
      • Baboon – free, 100K API calls / mo
      • Brass Monkey – $20/mo, 500K API calls / mo
      • Silverback – $250/mo, 2M API calls / mo
      • Golden Ape – $4,000/mo, 15M API calls / mo

• Data access options
   – REST API
      • api.infochimps.com/DATASET/METHOD.json?PARAM=VALUE
   – YQL tables

                         (Linked) Data Marketplaces   Jan 2011   #15
Azure DataMarket

• https://datamarket.azure.com




                  (Linked) Data Marketplaces   Jan 2011   #16
Azure DataMarket (2)

• Data domain
   – All purpose, incl. Data.gov, UN data, Wolfram|Alpha, ESRI
• Data population
   – Data publishers (need prior approval)
      • Data can be stored on SQL Azure, Azure Storage or 3rd party clouds
        (via Data Access Layers)

• Data model
   – Depends on the dataset and the storage, but always
     presented as OData to consumers
• Data size – 90 datasets

                         (Linked) Data Marketplaces           Jan 2011   #17
Azure DataMarket (3)




                                    (c) Microsoft


   (Linked) Data Marketplaces   Jan 2011            #18
Azure DataMarket (4)

• Company info
   – Microsoft
• Monetization model
   – Subscription for data buyers (limited/unlimited API calls)
• Access options
   – OData (feeds, queries, updates)
• Data tools
   – Service Explorer
   – Excel add-in (find, purchase, consume data)
   – Integration with SQL Server Reporting Services /
     Integration Services
                       (Linked) Data Marketplaces       Jan 2011   #19
DataMarket

• www.datamarket.com / @datamarket




                 (Linked) Data Marketplaces   Jan 2011   #20
DataMarket (2)

• Data domain
   – Statistical data from 2,000 providers, incl. UN, Eurostat,
     World Bank, US agencies, BP, FIFA, …
• Data population
   – Data aggregation (2,000 data providers)
• Data size
   – 13K datasets, 100M time series, 600M facts
• Company info
   – DataMarket (Iceland)


                        (Linked) Data Marketplaces      Jan 2011   #21
DataMarket (3)

• Monetization model
  – Charge data sellers
     • Free datasets – $249/mo; Paid datasets – 25% commission;
       Branded datasets – $699/mo + commission
  – Charge data buyers
     • Free – 50 API calls/mo; $99 – 500 API calls/mo; $299 – 10K API
       calls/mo; $799 – 100K API calls/mo

• Data access
  – REST API



                        (Linked) Data Marketplaces           Jan 2011   #22
Socrata

• www.socrata.com / @socrata




                  (Linked) Data Marketplaces   Jan 2011   #23
Socrata (2)

• Data domain
   – Business, education, government data
• Data population
   – Uploads from data publishers
• Data size
   – 13K datasets
• Data model
   – tabular



                      (Linked) Data Marketplaces   Jan 2011   #24
Socrata (3)

• Company info
  – Socrata (USA)
• Monetization model
  – Charge data buyers (“Plans starting at $499 per month”)
     • Basic – 100K API calls/mo + 50GB traffic; Plus – 250K API calls/mo
       + 250GB traffic; Premium – 1M API calls/mo + 1.2TB traffic;
       Ultimate – 10M API calls/mo + 5TB traffic

• Data access
  – REST API (Socrata Open Data API)
  – Data export (XLS, CSV, RDF, XML)
  – RSS updates
                         (Linked) Data Marketplaces           Jan 2011      #25
Kasabi

• www.kasabi.com / @TeamKasabi




                 (Linked) Data Marketplaces   Jan 2011   #26
Kasabi (2)

• Data domain
   – All purpose, incl. DBpedia, GeoNames, BBC Linked Data, …
• Data population
   – Public datasets
   – User submitted datasets
• Data size
   – 55 datasets
• Data model
   – RDF


                       (Linked) Data Marketplaces   Jan 2011    #27
Kasabi (3)

• Company info
  – Talis (UK)
• Monetization model
  – Charge data consumers
  – Data hosting is free
• Data access
  –   SPARQL / Linked Data endpoint
  –   REST API
  –   Additional APIs
  –   PHP & Ruby client libraries

                      (Linked) Data Marketplaces   Jan 2011   #28
Freebase

• www.freebase.com / @fbase




                 (Linked) Data Marketplaces   Jan 2011   #29
Freebase (2)

• Data domain
   – General purpose
• Data model
   – Graph (RDF dumps available)
• Data population
   – Community curated data (licensed as CC-BY)
   – Import of public data sources (Wikipedia, MusicBrainz,
     WordNet, LoC, …)
• Data size
   – 20M entities
                       (Linked) Data Marketplaces    Jan 2011   #30
Freebase (3)

• Company info
  – Metaweb (USA), now Google
• Monetization model
  – Free for 100K read API calls per day (10K write)
  – Paid for higher volumes
• Data access
  –   REST API
  –   Linked Data endpoint (http://rdf.freebase.com)
  –   Triple uploader / RDF dumps
  –   Acre (application hosting platform)

                       (Linked) Data Marketplaces      Jan 2011   #31
Freebase (4)

• Data tools
   – Web based – schema editor, review queue, viewers, …
   – GridWorks (Google Refine)
      • Exploring, data cleaning, transformation of tabular data
      • Map data to Freebase schema & RDF export (3rd party extension)
   – Acre
      • Application hosting platform
            – User contributed JavaScript code (converted to Java with Rhino)
      • Access & store data directly into Freebase




                             (Linked) Data Marketplaces              Jan 2011   #32
timetric

• www.timetric.com / @timetric




                  (Linked) Data Marketplaces   Jan 2011   #33
timetric (2)

• Data domain
   – Economic data
• Data population
   – aggregate data from the world's leading sources of
     economic data (World Bank, Eurostat, …)
   – User uploaded data
• Data size
   – 2.5M public statistics




                        (Linked) Data Marketplaces   Jan 2011   #34
timetric (3)

• Company info
  – Timetric Ltd. (UK)
• Monetization model
  – Free public datasets
  – Paid exclusive datasets
• Data access
  – REST API




                         (Linked) Data Marketplaces   Jan 2011   #35
xIgnite

• www.xignite.com




                    (Linked) Data Marketplaces   Jan 2011   #36
xIgnite (2)

• Data domain
  – Financial data
• Data population
  – aggregate data from leading sources (Dow Jones, Thomson
    Reuters, stock exchanges, …)
  – Public datasets (national banks, SEC, Federal Reserve, …)
  – User uploaded data
• Company info
  – Xignite (USA)


                      (Linked) Data Marketplaces     Jan 2011   #37
xIgnite (3)

• Monetization model
  – Paid subscriptions
• Data access
  – Web services (REST/SOAP)




                         (Linked) Data Marketplaces   Jan 2011   #38
Coming soon…

• BuzzData
  – www.buzzdata.com / @buzzdata
  – Company: BuzzData




                   (Linked) Data Marketplaces   Jan 2011   #39
Data marketplaces – features summary

• Data
  – Data model, domain, export options
• Monetization
  – Charge buyers/ sellers
  – free API calls
  – branded marketplaces & Service Level Agreement
• For developers
  – REST API; query language
  – Tools for data management / integration
  – Application hosting

                     (Linked) Data Marketplaces   Jan 2011   #40
Feature matrix




                                                                    DataMarket


                                                                                 DataMarket
                                                       InfoChimps




                                                                                                                   Freebase



                                                                                                                                   timetric
                                                                                               Socrata
                                           Factual




                                                                                                                                              xIgnite
                                                                                                         Kasabi
                                                                    Azure
               Data from all domains       +           +              +             -           +        +         +                 -          -
               Data model                tabular     various        various         ?         tabular    RDF      graph              ?          ?
DATA




               Data export                   -            -           +             -           +         ?        +                 -          -
               RDF export                    -            -             -           -           +        +         +                 -          -
               Charge buyers               +          +/-             +          +/-            +        +        +/-             +/-         +
MONETIZATION




               Charge sellers               ?          +                -        +               -        ?          -              ?          ?
               Free API calls (month)       ?        100K              ?         50              -        ?       3M                ?           -
               Branded marketplaces          -            -           +          +              +         ?          -               -          -
               Service Level guarantee      ?             -             -           -            -        ?          -               -          -
               REST API                    +           +              +          +              +        +         +               +          +
               Query language              +              -           +             -            -       +         +                 -          -
TOOLS




               Tools                       +              -           +             -            -       +         +                 -          -
               App hosting                   -            -           +             -            -        ?        +                 -          -
                                                     (Linked) Data Marketplaces                                               Jan 2011              #41
LINKED DATA + MARKETPLACES




           (Linked) Data Marketplaces   Jan 2011   #42
Linked Data cloud (Sep 2010)




                                     (c) R. Cyganiak and A. Jentzsch


        (Linked) Data Marketplaces                  Jan 2011           #43
Benefits of Linked Data for Data Marketplaces

• Unified data representation model (RDF)
   – Easy consumption of the data
• Global identifiers for all objects (URI)
   – Makes incremental data integration & federation easier
• Interlinked datasets
   – New data added to the marketplace can be integrated
     with existing data
   – Network effects
• Data marketplace interoperability
   – Data from different marketplaces can be easily integrated

                       (Linked) Data Marketplaces    Jan 2011    #44
Benefits of Linked Data for Data Marketplaces (2)

• Derived knowledge / facts
   – RDF inference of additional implicit facts
   – (see FactForge and LinkedLifeData)
• Rich queries
   – SPARQL offers unmatched query expressivity
• Easy import of existing LOD datasets
   – Linked Open Data cloud already includes 200+ datasets
     with 20+ billion RDF triples




                        (Linked) Data Marketplaces   Jan 2011   #45
Linked Data for marketplaces – challenges

• Quality of data
   – Different (public) datasets may come with inconsistent or
     controversial data
   – Quality more important than quantity
• Large scale data integration
   – Ontology (schema) mapping of different datasets &
     vocabularies
• Licensing
   – Some datasets come with “CC-BY-NC” or unclear licensing
• Billing
   – API calls / SPARQL queries with varying computational
     cost              (Linked) Data Marketplaces   Jan 2011     #46
Linked Data for marketplaces – challenges (2)

• Billing
   – API calls / SPARQL queries with varying computational
     cost
• Operations
   – Service Level guarantees
   – Availability & scalability challenges
      • Most Linked Data endpoints at present are neither scalable, nor
        available




                         (Linked) Data Marketplaces          Jan 2011     #47
LinkedLifeData & FactForge




FactForge

LinkedLifeData
                                                     (c) R. Cyganiak and A. Jentzsch


                        (Linked) Data Marketplaces                  Jan 2011           #48
LinkedLifeData & FactForge

• FactForge
   –   Integrates some of the most central LOD datasets
   –   General-purpose information (not specific to a domain)
   –   1.2 billion explicit and 1 billion inferred statements
   –   The largest upper-level knowledge base
   –    http://www.FactForge.net
• Linked Life Data
   – 25 of the most popular life-science datasets
   – 2.7 billion explicit and 1.4 billion inferred statements
   – http://www.LinkedLifeData.com


                        (Linked) Data Marketplaces      Jan 2011   #49
Strategic questions

• Monetization strategy
  – which (linked) datasets can be monetized
  – Charge buyers / charge sellers / free quota
  – Branded marketplaces
• Community building
  – Crowdsource the data curation to the community
  – How to provide incentives to data curators?




                      (Linked) Data Marketplaces   Jan 2011   #50
Strategic questions (2)

• Operations
  – How to ensure Service Level guarantees?
  – How to deal with licensing issues?
  – Account management, metering, billing
• Platform
  –   RDF database – data volume, query volume
  –   ETL tools
  –   Curation tools
  –   Data export & consumption



                      (Linked) Data Marketplaces   Jan 2011   #51
Data monetization with WebServius




                                                   (c) WebServius


• Benefits
   – user management, quotas & restrictions
   – Metering, pricing, billing
   – Security, scalability, SLAs



                      (Linked) Data Marketplaces                    Jan 2011   #52
Q&A




Questions?
           @ontotext



 (Linked) Data Marketplaces   Jan 2011   #53

Contenu connexe

Tendances

Webinar: How MongoDB is making Government Better, Faster, Smarter
Webinar: How MongoDB is making Government Better, Faster, SmarterWebinar: How MongoDB is making Government Better, Faster, Smarter
Webinar: How MongoDB is making Government Better, Faster, Smarter
MongoDB
 

Tendances (12)

Webinar: How MongoDB is making Government Better, Faster, Smarter
Webinar: How MongoDB is making Government Better, Faster, SmarterWebinar: How MongoDB is making Government Better, Faster, Smarter
Webinar: How MongoDB is making Government Better, Faster, Smarter
 
Data as a service
Data as a serviceData as a service
Data as a service
 
Data Modeling for Big Data
Data Modeling for Big DataData Modeling for Big Data
Data Modeling for Big Data
 
Prague data management meetup #31 2020-01-27
Prague data management meetup #31 2020-01-27Prague data management meetup #31 2020-01-27
Prague data management meetup #31 2020-01-27
 
Open Data and News Analytics Demo
Open Data and News Analytics DemoOpen Data and News Analytics Demo
Open Data and News Analytics Demo
 
How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk Analytics
 
euBusinessGraph Company and Economic Data
euBusinessGraph Company and Economic DataeuBusinessGraph Company and Economic Data
euBusinessGraph Company and Economic Data
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum Computing
 
Teradata
TeradataTeradata
Teradata
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data Smarter
 

En vedette

Open Source GIS Stack: Data hub for flexibility, performance and effectiveness
Open Source GIS Stack: Data hub for flexibility, performance and effectivenessOpen Source GIS Stack: Data hub for flexibility, performance and effectiveness
Open Source GIS Stack: Data hub for flexibility, performance and effectiveness
eHealth Africa
 
Towards a Vocabulary for Data Quality Management in Semantic Web Architectures
Towards a Vocabulary for Data Quality Management in Semantic Web ArchitecturesTowards a Vocabulary for Data Quality Management in Semantic Web Architectures
Towards a Vocabulary for Data Quality Management in Semantic Web Architectures
Christian Fuerber
 
Semantic web-and-public-data
Semantic web-and-public-dataSemantic web-and-public-data
Semantic web-and-public-data
Tenforce
 
Are we with-it? - Lucia Schoombee
Are we with-it? - Lucia SchoombeeAre we with-it? - Lucia Schoombee
Are we with-it? - Lucia Schoombee
HELIGLIASA
 
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
atelier t*h
 
Radioactivity (1)
Radioactivity (1)Radioactivity (1)
Radioactivity (1)
palzz
 

En vedette (17)

Open Source GIS Stack: Data hub for flexibility, performance and effectiveness
Open Source GIS Stack: Data hub for flexibility, performance and effectivenessOpen Source GIS Stack: Data hub for flexibility, performance and effectiveness
Open Source GIS Stack: Data hub for flexibility, performance and effectiveness
 
Integrating cloud with existing IBM Systems
Integrating cloud with existing IBM SystemsIntegrating cloud with existing IBM Systems
Integrating cloud with existing IBM Systems
 
Towards a Vocabulary for Data Quality Management in Semantic Web Architectures
Towards a Vocabulary for Data Quality Management in Semantic Web ArchitecturesTowards a Vocabulary for Data Quality Management in Semantic Web Architectures
Towards a Vocabulary for Data Quality Management in Semantic Web Architectures
 
java jdbc connection
java jdbc connectionjava jdbc connection
java jdbc connection
 
Social Media Strategies for Powerful Communications
Social Media Strategies for Powerful CommunicationsSocial Media Strategies for Powerful Communications
Social Media Strategies for Powerful Communications
 
Configuracion de IP windows XP
Configuracion de IP windows XPConfiguracion de IP windows XP
Configuracion de IP windows XP
 
Semantic web-and-public-data
Semantic web-and-public-dataSemantic web-and-public-data
Semantic web-and-public-data
 
Semantic web-and-public-data - en
Semantic web-and-public-data - enSemantic web-and-public-data - en
Semantic web-and-public-data - en
 
14 de Dezembro 2009
14 de Dezembro 200914 de Dezembro 2009
14 de Dezembro 2009
 
Are we with-it? - Lucia Schoombee
Are we with-it? - Lucia SchoombeeAre we with-it? - Lucia Schoombee
Are we with-it? - Lucia Schoombee
 
Trabalho 1
Trabalho 1Trabalho 1
Trabalho 1
 
Skolkovo
SkolkovoSkolkovo
Skolkovo
 
Personalising Customer Experience in the Hospitality Industry June 2016
Personalising Customer Experience in the Hospitality Industry June 2016Personalising Customer Experience in the Hospitality Industry June 2016
Personalising Customer Experience in the Hospitality Industry June 2016
 
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
윈도 Xp 종료, 오픈소스 소프트웨어에 기회가 될 것인가
 
和菓子復興大作戦〜萌えキャラで和菓子ブームを〜
和菓子復興大作戦〜萌えキャラで和菓子ブームを〜和菓子復興大作戦〜萌えキャラで和菓子ブームを〜
和菓子復興大作戦〜萌えキャラで和菓子ブームを〜
 
Radioactivity (1)
Radioactivity (1)Radioactivity (1)
Radioactivity (1)
 
Valeriia Mozharova and Natalia Loukachevitch - Combining Knowledge and CRF-b...
Valeriia Mozharova and  Natalia Loukachevitch - Combining Knowledge and CRF-b...Valeriia Mozharova and  Natalia Loukachevitch - Combining Knowledge and CRF-b...
Valeriia Mozharova and Natalia Loukachevitch - Combining Knowledge and CRF-b...
 

Similaire à Linked Data Marketplaces

DataWeave Introduction - Startup Saturday
DataWeave Introduction - Startup SaturdayDataWeave Introduction - Startup Saturday
DataWeave Introduction - Startup Saturday
DataWeave
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
Dublinked .
 
Presentation at Google Day on Big Data
Presentation at Google Day on Big DataPresentation at Google Day on Big Data
Presentation at Google Day on Big Data
Rezaur Rahman
 

Similaire à Linked Data Marketplaces (20)

Monetize your APIs and datasets or make them available as open data
Monetize your APIs and datasets or make them available as open dataMonetize your APIs and datasets or make them available as open data
Monetize your APIs and datasets or make them available as open data
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptx
 
Big data in telecom
Big data in telecomBig data in telecom
Big data in telecom
 
Lecture1 BIG DATA and Types of data in details
Lecture1 BIG DATA and Types of data in detailsLecture1 BIG DATA and Types of data in details
Lecture1 BIG DATA and Types of data in details
 
Big Data Overview 2013-2014
Big Data Overview 2013-2014Big Data Overview 2013-2014
Big Data Overview 2013-2014
 
Introduction to Harnessing Big Data
Introduction to Harnessing Big DataIntroduction to Harnessing Big Data
Introduction to Harnessing Big Data
 
bigdataintro.pptx
bigdataintro.pptxbigdataintro.pptx
bigdataintro.pptx
 
Lecture1
Lecture1Lecture1
Lecture1
 
Big data.ppt
Big data.pptBig data.ppt
Big data.ppt
 
Big data – An Introduction, July 2013
Big data – An Introduction, July 2013Big data – An Introduction, July 2013
Big data – An Introduction, July 2013
 
DataWeave Introduction - Startup Saturday
DataWeave Introduction - Startup SaturdayDataWeave Introduction - Startup Saturday
DataWeave Introduction - Startup Saturday
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
A Data-driven Approach for Internet of Things Applications: Methods and Case ...
A Data-driven Approach for Internet of Things Applications: Methods and Case ...A Data-driven Approach for Internet of Things Applications: Methods and Case ...
A Data-driven Approach for Internet of Things Applications: Methods and Case ...
 
From Ambition to Go Live SWIB.pdf
From Ambition to Go Live SWIB.pdfFrom Ambition to Go Live SWIB.pdf
From Ambition to Go Live SWIB.pdf
 
From Ambition to Go Live
From Ambition to Go LiveFrom Ambition to Go Live
From Ambition to Go Live
 
Presentation at Google Day on Big Data
Presentation at Google Day on Big DataPresentation at Google Day on Big Data
Presentation at Google Day on Big Data
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 

Plus de Marin Dimitrov

DataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-ServiceDataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-Service
Marin Dimitrov
 

Plus de Marin Dimitrov (20)

Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
 
Mapping Your Career Journey
Mapping Your Career JourneyMapping Your Career Journey
Mapping Your Career Journey
 
Open Source @ Uber
Open Source @ Uber Open Source @ Uber
Open Source @ Uber
 
Trust - the Key Success Factor for Teams & Organisations
Trust - the Key Success Factor for Teams & OrganisationsTrust - the Key Success Factor for Teams & Organisations
Trust - the Key Success Factor for Teams & Organisations
 
Uber @ Telerik Academy 2018
Uber @ Telerik Academy 2018Uber @ Telerik Academy 2018
Uber @ Telerik Academy 2018
 
Machine Learning @ Uber
Machine Learning @ UberMachine Learning @ Uber
Machine Learning @ Uber
 
Career Advice for My Younger Self
Career Advice for My Younger SelfCareer Advice for My Younger Self
Career Advice for My Younger Self
 
Scaling Your Engineering Organization with Distributed Sites
Scaling Your Engineering Organization with Distributed SitesScaling Your Engineering Organization with Distributed Sites
Scaling Your Engineering Organization with Distributed Sites
 
Building, Scaling and Leading High-Performance Teams
Building, Scaling and Leading High-Performance TeamsBuilding, Scaling and Leading High-Performance Teams
Building, Scaling and Leading High-Performance Teams
 
Uber @ Career Days 2017 (Sofia University)
Uber @ Career Days 2017 (Sofia University)Uber @ Career Days 2017 (Sofia University)
Uber @ Career Days 2017 (Sofia University)
 
GraphDB Connectors – Powering Complex SPARQL Queries
GraphDB Connectors – Powering Complex SPARQL QueriesGraphDB Connectors – Powering Complex SPARQL Queries
GraphDB Connectors – Powering Complex SPARQL Queries
 
DataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-ServiceDataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-Service
 
On-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the CloudOn-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the Cloud
 
Low-cost Open Data As-a-Service
Low-cost Open Data As-a-ServiceLow-cost Open Data As-a-Service
Low-cost Open Data As-a-Service
 
Text Analytics & Linked Data Management As-a-Service
Text Analytics & Linked Data Management As-a-ServiceText Analytics & Linked Data Management As-a-Service
Text Analytics & Linked Data Management As-a-Service
 
RDF Database-as-a-Service with S4
RDF Database-as-a-Service with S4RDF Database-as-a-Service with S4
RDF Database-as-a-Service with S4
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Enabling Low-cost Open Data Publishing and Reuse
Enabling Low-cost Open Data Publishing and ReuseEnabling Low-cost Open Data Publishing and Reuse
Enabling Low-cost Open Data Publishing and Reuse
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic Suite
 
Scaling to Millions of Concurrent SPARQL Queries on the Cloud
Scaling to Millions of Concurrent SPARQL Queries on the CloudScaling to Millions of Concurrent SPARQL Queries on the Cloud
Scaling to Millions of Concurrent SPARQL Queries on the Cloud
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Linked Data Marketplaces

  • 1. (Linked) Data Marketplaces Marin Dimitrov (Ontotext) v0.6 / Mar 2011
  • 2. Contents • Introduction • Data Marketplaces – Factual, InfoChimps, Azure DataMarket, Freebase, Socrata, Kasabi – Data Market, Timetric, xIgnite • Data Marketplaces for Linked Data (Linked) Data Marketplaces Jan 2011 #2
  • 3. INTRODUCTION (Linked) Data Marketplaces Jan 2011 #3
  • 4. Definitions • Data-as-a-Service (DaaS) – “Like all members of the "as a Service" (XaaS) family, DaaS is based on the concept that the product, data in this case, can be provided on demand to the user regardless of geographic or organizational separation of provider and consumer. Additionally, the emergence of service-oriented architecture (SOA) has rendered the actual platform on which the data resides also irrelevant” (Wikipedia) • Data Marketplaces – “Services that make it easy to find data from a range of secondary data sources, then consume the data in a usable and unified format. Several of these services are trying to create marketplaces for data, envisioning that data providers can offer their data sets for sale to data seekers” (DataMarket.com) (Linked) Data Marketplaces Jan 2011 #4
  • 5. Data Marketplaces properties • Proposed classification by Bauereiss & Fensel 1. Data domain 2. Population of content 3. Community management 4. Operating party 5. Pricing models 6. Data exchange • Some additional differentiating characteristics – Data model, Data size, Data export – Branded marketplaces, SLA – Query languages, Data tools (Linked) Data Marketplaces Jan 2011 #5
  • 6. DATA MARKETPLACES (Linked) Data Marketplaces Jan 2011 #6
  • 7. Factual • www.factual.com / @factual (Linked) Data Marketplaces Jan 2011 #7
  • 8. Factual (2) • Data domain – Travel, finance, sports, autos, movies, music, TV, books, health, food, politics, education, science, arts, … – High quality local data • USA, Germany, France, Italy, UK, Japan, Switzerland, Australia, … • Used by Facebook Places • Data population – Crawling the web – Public data sources – Community contributions • Upload XLS/ODS, CSV (Linked) Data Marketplaces Jan 2011 #8
  • 9. Factual (3) • Data model – tabular – Taxonomy of 400 categories • 13 Level 1 categories: Arts, Automotive, Business, Government, … • Data size – 500,000 datasets • Company info – Factual Inc. (USA) – $27M VC funding so far (Linked) Data Marketplaces Jan 2011 #9
  • 10. Factual (4) • Monetization model – Pricing model not finalised yet (currently free) – Pay-per-use pricing (per API call) with subscriptions • Companies that contribute data will have a fee reduction • Data access options – REST API • Read from table, Add/Write to table, Get schema info – Web applications • Read/write raw data from a web page (JavaScript) • Web widgets for visualising, filtering and sorting data (Linked) Data Marketplaces Jan 2011 #10
  • 11. Factual (5) • Data tools – AutoClipper – find tables on the web – PageClipper – extract tabular data from a web page – FactClipper – find individual facts (query templates) (Linked) Data Marketplaces Jan 2011 #11
  • 12. InfoChimps • www.infochimps.com / @infochimps (Linked) Data Marketplaces Jan 2011 #12
  • 13. InfoChimps (2) • Data domain – All purpose • Including data from Freebase, Wikipedia infoboxes, CKAN, Twitter, Data.gov, Data.gov.uk, GeoNames, … • Data population – Public datasets – User submitted datasets • Data model is dataset specific • 10,000+ datasets organised in 13 collections (Linked) Data Marketplaces Jan 2011 #13
  • 14. InfoChimps (3) • Company info – InfoChimps (USA) – $1.6M VC funding so far – Acquired DataMarketplace in 12/2010 • Monetization model – Charge data sellers • Data sellers choose the price & licensing of their data • Charge for data storage • 30% commission for InfoChimps on each sale (Linked) Data Marketplaces Jan 2011 #14
  • 15. InfoChimps (4) • Monetization model (2) – Charge data buyers • Baboon – free, 100K API calls / mo • Brass Monkey – $20/mo, 500K API calls / mo • Silverback – $250/mo, 2M API calls / mo • Golden Ape – $4,000/mo, 15M API calls / mo • Data access options – REST API • api.infochimps.com/DATASET/METHOD.json?PARAM=VALUE – YQL tables (Linked) Data Marketplaces Jan 2011 #15
  • 16. Azure DataMarket • https://datamarket.azure.com (Linked) Data Marketplaces Jan 2011 #16
  • 17. Azure DataMarket (2) • Data domain – All purpose, incl. Data.gov, UN data, Wolfram|Alpha, ESRI • Data population – Data publishers (need prior approval) • Data can be stored on SQL Azure, Azure Storage or 3rd party clouds (via Data Access Layers) • Data model – Depends on the dataset and the storage, but always presented as OData to consumers • Data size – 90 datasets (Linked) Data Marketplaces Jan 2011 #17
  • 18. Azure DataMarket (3) (c) Microsoft (Linked) Data Marketplaces Jan 2011 #18
  • 19. Azure DataMarket (4) • Company info – Microsoft • Monetization model – Subscription for data buyers (limited/unlimited API calls) • Access options – OData (feeds, queries, updates) • Data tools – Service Explorer – Excel add-in (find, purchase, consume data) – Integration with SQL Server Reporting Services / Integration Services (Linked) Data Marketplaces Jan 2011 #19
  • 20. DataMarket • www.datamarket.com / @datamarket (Linked) Data Marketplaces Jan 2011 #20
  • 21. DataMarket (2) • Data domain – Statistical data from 2,000 providers, incl. UN, Eurostat, World Bank, US agencies, BP, FIFA, … • Data population – Data aggregation (2,000 data providers) • Data size – 13K datasets, 100M time series, 600M facts • Company info – DataMarket (Iceland) (Linked) Data Marketplaces Jan 2011 #21
  • 22. DataMarket (3) • Monetization model – Charge data sellers • Free datasets – $249/mo; Paid datasets – 25% commission; Branded datasets – $699/mo + commission – Charge data buyers • Free – 50 API calls/mo; $99 – 500 API calls/mo; $299 – 10K API calls/mo; $799 – 100K API calls/mo • Data access – REST API (Linked) Data Marketplaces Jan 2011 #22
  • 23. Socrata • www.socrata.com / @socrata (Linked) Data Marketplaces Jan 2011 #23
  • 24. Socrata (2) • Data domain – Business, education, government data • Data population – Uploads from data publishers • Data size – 13K datasets • Data model – tabular (Linked) Data Marketplaces Jan 2011 #24
  • 25. Socrata (3) • Company info – Socrata (USA) • Monetization model – Charge data buyers (“Plans starting at $499 per month”) • Basic – 100K API calls/mo + 50GB traffic; Plus – 250K API calls/mo + 250GB traffic; Premium – 1M API calls/mo + 1.2TB traffic; Ultimate – 10M API calls/mo + 5TB traffic • Data access – REST API (Socrata Open Data API) – Data export (XLS, CSV, RDF, XML) – RSS updates (Linked) Data Marketplaces Jan 2011 #25
  • 26. Kasabi • www.kasabi.com / @TeamKasabi (Linked) Data Marketplaces Jan 2011 #26
  • 27. Kasabi (2) • Data domain – All purpose, incl. DBpedia, GeoNames, BBC Linked Data, … • Data population – Public datasets – User submitted datasets • Data size – 55 datasets • Data model – RDF (Linked) Data Marketplaces Jan 2011 #27
  • 28. Kasabi (3) • Company info – Talis (UK) • Monetization model – Charge data consumers – Data hosting is free • Data access – SPARQL / Linked Data endpoint – REST API – Additional APIs – PHP & Ruby client libraries (Linked) Data Marketplaces Jan 2011 #28
  • 29. Freebase • www.freebase.com / @fbase (Linked) Data Marketplaces Jan 2011 #29
  • 30. Freebase (2) • Data domain – General purpose • Data model – Graph (RDF dumps available) • Data population – Community curated data (licensed as CC-BY) – Import of public data sources (Wikipedia, MusicBrainz, WordNet, LoC, …) • Data size – 20M entities (Linked) Data Marketplaces Jan 2011 #30
  • 31. Freebase (3) • Company info – Metaweb (USA), now Google • Monetization model – Free for 100K read API calls per day (10K write) – Paid for higher volumes • Data access – REST API – Linked Data endpoint (http://rdf.freebase.com) – Triple uploader / RDF dumps – Acre (application hosting platform) (Linked) Data Marketplaces Jan 2011 #31
  • 32. Freebase (4) • Data tools – Web based – schema editor, review queue, viewers, … – GridWorks (Google Refine) • Exploring, data cleaning, transformation of tabular data • Map data to Freebase schema & RDF export (3rd party extension) – Acre • Application hosting platform – User contributed JavaScript code (converted to Java with Rhino) • Access & store data directly into Freebase (Linked) Data Marketplaces Jan 2011 #32
  • 33. timetric • www.timetric.com / @timetric (Linked) Data Marketplaces Jan 2011 #33
  • 34. timetric (2) • Data domain – Economic data • Data population – aggregate data from the world's leading sources of economic data (World Bank, Eurostat, …) – User uploaded data • Data size – 2.5M public statistics (Linked) Data Marketplaces Jan 2011 #34
  • 35. timetric (3) • Company info – Timetric Ltd. (UK) • Monetization model – Free public datasets – Paid exclusive datasets • Data access – REST API (Linked) Data Marketplaces Jan 2011 #35
  • 36. xIgnite • www.xignite.com (Linked) Data Marketplaces Jan 2011 #36
  • 37. xIgnite (2) • Data domain – Financial data • Data population – aggregate data from leading sources (Dow Jones, Thomson Reuters, stock exchanges, …) – Public datasets (national banks, SEC, Federal Reserve, …) – User uploaded data • Company info – Xignite (USA) (Linked) Data Marketplaces Jan 2011 #37
  • 38. xIgnite (3) • Monetization model – Paid subscriptions • Data access – Web services (REST/SOAP) (Linked) Data Marketplaces Jan 2011 #38
  • 39. Coming soon… • BuzzData – www.buzzdata.com / @buzzdata – Company: BuzzData (Linked) Data Marketplaces Jan 2011 #39
  • 40. Data marketplaces – features summary • Data – Data model, domain, export options • Monetization – Charge buyers/ sellers – free API calls – branded marketplaces & Service Level Agreement • For developers – REST API; query language – Tools for data management / integration – Application hosting (Linked) Data Marketplaces Jan 2011 #40
  • 41. Feature matrix DataMarket DataMarket InfoChimps Freebase timetric Socrata Factual xIgnite Kasabi Azure Data from all domains + + + - + + + - - Data model tabular various various ? tabular RDF graph ? ? DATA Data export - - + - + ? + - - RDF export - - - - + + + - - Charge buyers + +/- + +/- + + +/- +/- + MONETIZATION Charge sellers ? + - + - ? - ? ? Free API calls (month) ? 100K ? 50 - ? 3M ? - Branded marketplaces - - + + + ? - - - Service Level guarantee ? - - - - ? - - - REST API + + + + + + + + + Query language + - + - - + + - - TOOLS Tools + - + - - + + - - App hosting - - + - - ? + - - (Linked) Data Marketplaces Jan 2011 #41
  • 42. LINKED DATA + MARKETPLACES (Linked) Data Marketplaces Jan 2011 #42
  • 43. Linked Data cloud (Sep 2010) (c) R. Cyganiak and A. Jentzsch (Linked) Data Marketplaces Jan 2011 #43
  • 44. Benefits of Linked Data for Data Marketplaces • Unified data representation model (RDF) – Easy consumption of the data • Global identifiers for all objects (URI) – Makes incremental data integration & federation easier • Interlinked datasets – New data added to the marketplace can be integrated with existing data – Network effects • Data marketplace interoperability – Data from different marketplaces can be easily integrated (Linked) Data Marketplaces Jan 2011 #44
  • 45. Benefits of Linked Data for Data Marketplaces (2) • Derived knowledge / facts – RDF inference of additional implicit facts – (see FactForge and LinkedLifeData) • Rich queries – SPARQL offers unmatched query expressivity • Easy import of existing LOD datasets – Linked Open Data cloud already includes 200+ datasets with 20+ billion RDF triples (Linked) Data Marketplaces Jan 2011 #45
  • 46. Linked Data for marketplaces – challenges • Quality of data – Different (public) datasets may come with inconsistent or controversial data – Quality more important than quantity • Large scale data integration – Ontology (schema) mapping of different datasets & vocabularies • Licensing – Some datasets come with “CC-BY-NC” or unclear licensing • Billing – API calls / SPARQL queries with varying computational cost (Linked) Data Marketplaces Jan 2011 #46
  • 47. Linked Data for marketplaces – challenges (2) • Billing – API calls / SPARQL queries with varying computational cost • Operations – Service Level guarantees – Availability & scalability challenges • Most Linked Data endpoints at present are neither scalable, nor available (Linked) Data Marketplaces Jan 2011 #47
  • 48. LinkedLifeData & FactForge FactForge LinkedLifeData (c) R. Cyganiak and A. Jentzsch (Linked) Data Marketplaces Jan 2011 #48
  • 49. LinkedLifeData & FactForge • FactForge – Integrates some of the most central LOD datasets – General-purpose information (not specific to a domain) – 1.2 billion explicit and 1 billion inferred statements – The largest upper-level knowledge base – http://www.FactForge.net • Linked Life Data – 25 of the most popular life-science datasets – 2.7 billion explicit and 1.4 billion inferred statements – http://www.LinkedLifeData.com (Linked) Data Marketplaces Jan 2011 #49
  • 50. Strategic questions • Monetization strategy – which (linked) datasets can be monetized – Charge buyers / charge sellers / free quota – Branded marketplaces • Community building – Crowdsource the data curation to the community – How to provide incentives to data curators? (Linked) Data Marketplaces Jan 2011 #50
  • 51. Strategic questions (2) • Operations – How to ensure Service Level guarantees? – How to deal with licensing issues? – Account management, metering, billing • Platform – RDF database – data volume, query volume – ETL tools – Curation tools – Data export & consumption (Linked) Data Marketplaces Jan 2011 #51
  • 52. Data monetization with WebServius (c) WebServius • Benefits – user management, quotas & restrictions – Metering, pricing, billing – Security, scalability, SLAs (Linked) Data Marketplaces Jan 2011 #52
  • 53. Q&A Questions? @ontotext (Linked) Data Marketplaces Jan 2011 #53