SlideShare une entreprise Scribd logo
1  sur  17
DATALIFT
free! your data! link! your data! free! your data! link! your data! free!
your data! link! your data! free! your data! link! your data! free! your
data! link! your data! free! your data! link! your data! free! your data!
link! your data!
in september 2010…
you are here
goal of datalift
accelerate the lifting from raw data
to linked public data
Who ?
phase 1: an easy open
end for data
provide an open source
platform assisting publication
assist the selection of data
Selection of data to publish
- Usage studies
- Business models questions
- Selection tools
- Apply to real data from users in the project
assist the selection of data
identify appropriate schemas
Schema catalog
- Methods and metrics to select schemas
- Build and publish a catalog
- Model specifics needs and extensions
- Apply to real data from users in the project
identify appropriate schemas
assist the selection of data
format conversion & connectors
Converting data
- Principles for identifiers
- Tools to convert to RDF (CSV, etc.)
- Application to data of the users of the project
- Attach provenance, licenses, rights, etc.
format conversion & connectors
identify appropriate schemas
assist the selection of data
data publication
Architecture and infrastructure to publish data
- Benchmarks and integration
- Architecture and component for publication
- Real scale experiment with users of the project
data publication
interconnecting data
Data interconnection
- Statistical methods to calculate keys in a dataset
- Tools to interconnect datasets
- Interconnect datasets of the users of the project
format conversion & connectors
identify appropriate schemas
assist the selection of data
phase 2: publish real datasets
validate, apply and test the
platform on real datasets.
other topics identified
● toolbox to visualize, browse and query data
● API for mobile applications
● clouds &distribution
● legal advice and granularity
● cookbook & best practices
R&D challenges
 methods & metrics for schema selection
 balance specific needs & reusability
 data conversion & identifiers generation
 automation of dataset interconnection
 named graphs, provenance, licenses and rights
 integration and scalability
milestones
M1
October 2010
M6
April 2011
M12
Sept. 2011
M24
Sept. 2012
M18
April 2012
M30
April 2013
M36
Sept. 2013
WWW 2012
architecture
specification
forge for the
open-source
state of the art
& benchmarks
datasets from
IGN & INSEE
first integrated
prototype
data publication tools
& experimentation
schema
catalog
conversion tools
interconnection
prototype
usages and
business models
2nd version
(cloud)
more and more
datasets
encourage external adoption
and application development
functional platform…
other identified data providers
and users’ club
• Regards Citoyens
• Direction de l’information légale et administrative
• Fédération des parcs naturels régionaux de France
• Eurostat
• Communauté Urbaine de Bordeaux
• Data Publica
…
other projects
● Data Publica
● LATC
● LOD2
● Planet Data
● Publink (LOD2 + LATC)
● RPI Data-gov wiki
● Data incubator
DATALIFT

Contenu connexe

Plus de Fabien Gandon

CovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebCovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebFabien Gandon
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Fabien Gandon
 
from linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsfrom linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsFabien Gandon
 
The Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webThe Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webFabien Gandon
 
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Fabien Gandon
 
Web science AI and IA
Web science AI and IAWeb science AI and IA
Web science AI and IAFabien Gandon
 
Normative Requirements as Linked Data
Normative Requirements as Linked DataNormative Requirements as Linked Data
Normative Requirements as Linked DataFabien Gandon
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Fabien Gandon
 
On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. Fabien Gandon
 
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...Fabien Gandon
 
How to supervise your supervisor?
How to supervise your supervisor?How to supervise your supervisor?
How to supervise your supervisor?Fabien Gandon
 
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Fabien Gandon
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportFabien Gandon
 
Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Fabien Gandon
 
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Fabien Gandon
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"Fabien Gandon
 
ESWC2015 opening ceremony
ESWC2015 opening ceremonyESWC2015 opening ceremony
ESWC2015 opening ceremonyFabien Gandon
 
Les (r)évolutions de la planète Web
Les (r)évolutions de la planète WebLes (r)évolutions de la planète Web
Les (r)évolutions de la planète WebFabien Gandon
 
Données liées et Web sémantique : quand le lien fait sens.
Données liées et Web sémantique : quand le lien fait sens. Données liées et Web sémantique : quand le lien fait sens.
Données liées et Web sémantique : quand le lien fait sens. Fabien Gandon
 
Data protection and security on the web, ESWC2014 Panel
Data protection and security on the web, ESWC2014 PanelData protection and security on the web, ESWC2014 Panel
Data protection and security on the web, ESWC2014 PanelFabien Gandon
 

Plus de Fabien Gandon (20)

CovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebCovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the Web
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...
 
from linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsfrom linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphs
 
The Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webThe Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient web
 
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018
 
Web science AI and IA
Web science AI and IAWeb science AI and IA
Web science AI and IA
 
Normative Requirements as Linked Data
Normative Requirements as Linked DataNormative Requirements as Linked Data
Normative Requirements as Linked Data
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017
 
On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links.
 
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
 
How to supervise your supervisor?
How to supervise your supervisor?How to supervise your supervisor?
How to supervise your supervisor?
 
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity Report
 
Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"
 
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"
 
ESWC2015 opening ceremony
ESWC2015 opening ceremonyESWC2015 opening ceremony
ESWC2015 opening ceremony
 
Les (r)évolutions de la planète Web
Les (r)évolutions de la planète WebLes (r)évolutions de la planète Web
Les (r)évolutions de la planète Web
 
Données liées et Web sémantique : quand le lien fait sens.
Données liées et Web sémantique : quand le lien fait sens. Données liées et Web sémantique : quand le lien fait sens.
Données liées et Web sémantique : quand le lien fait sens.
 
Data protection and security on the web, ESWC2014 Panel
Data protection and security on the web, ESWC2014 PanelData protection and security on the web, ESWC2014 Panel
Data protection and security on the web, ESWC2014 Panel
 

Datalift, publishing public data in France

  • 1. DATALIFT free! your data! link! your data! free! your data! link! your data! free! your data! link! your data! free! your data! link! your data! free! your data! link! your data! free! your data! link! your data! free! your data! link! your data!
  • 3. goal of datalift accelerate the lifting from raw data to linked public data
  • 5. phase 1: an easy open end for data provide an open source platform assisting publication
  • 6. assist the selection of data Selection of data to publish - Usage studies - Business models questions - Selection tools - Apply to real data from users in the project
  • 7. assist the selection of data identify appropriate schemas Schema catalog - Methods and metrics to select schemas - Build and publish a catalog - Model specifics needs and extensions - Apply to real data from users in the project
  • 8. identify appropriate schemas assist the selection of data format conversion & connectors Converting data - Principles for identifiers - Tools to convert to RDF (CSV, etc.) - Application to data of the users of the project - Attach provenance, licenses, rights, etc.
  • 9. format conversion & connectors identify appropriate schemas assist the selection of data data publication Architecture and infrastructure to publish data - Benchmarks and integration - Architecture and component for publication - Real scale experiment with users of the project
  • 10. data publication interconnecting data Data interconnection - Statistical methods to calculate keys in a dataset - Tools to interconnect datasets - Interconnect datasets of the users of the project format conversion & connectors identify appropriate schemas assist the selection of data
  • 11. phase 2: publish real datasets validate, apply and test the platform on real datasets.
  • 12. other topics identified ● toolbox to visualize, browse and query data ● API for mobile applications ● clouds &distribution ● legal advice and granularity ● cookbook & best practices
  • 13. R&D challenges  methods & metrics for schema selection  balance specific needs & reusability  data conversion & identifiers generation  automation of dataset interconnection  named graphs, provenance, licenses and rights  integration and scalability
  • 14. milestones M1 October 2010 M6 April 2011 M12 Sept. 2011 M24 Sept. 2012 M18 April 2012 M30 April 2013 M36 Sept. 2013 WWW 2012 architecture specification forge for the open-source state of the art & benchmarks datasets from IGN & INSEE first integrated prototype data publication tools & experimentation schema catalog conversion tools interconnection prototype usages and business models 2nd version (cloud) more and more datasets encourage external adoption and application development functional platform…
  • 15. other identified data providers and users’ club • Regards Citoyens • Direction de l’information légale et administrative • Fédération des parcs naturels régionaux de France • Eurostat • Communauté Urbaine de Bordeaux • Data Publica …
  • 16. other projects ● Data Publica ● LATC ● LOD2 ● Planet Data ● Publink (LOD2 + LATC) ● RPI Data-gov wiki ● Data incubator