SlideShare une entreprise Scribd logo
1  sur  14
Télécharger pour lire hors ligne
You	
  rang,	
  M’LOD?	
                                                        ì	
  
Google	
  Refine	
  in	
  the	
  world	
  of	
  LOD	
  




Mateja	
  Verlic	
  




                       Seman/c	
  Tech	
  &	
  Business	
  Conference	
  
                          June	
  3-­‐7,	
  2012	
  |	
  San	
  Francisco	
  
2	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
     June	
  7,	
  2012	
  
3	
  




                                                                                     Google	
  Refine	
  

                                   ì  What	
  we’ve	
  seen	
  so	
  far	
  
                                       ì  Messy	
  data	
  gone	
  clean	
  
                                              ì  Filtering,	
  faceted	
  browsing	
  
                                              ì  Edi/ng	
  cells	
  and	
  columns,	
  clustering,	
  expor/ng	
  
                                              ì  Bulk	
  transforma/ons	
  
                                              ì  History	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                          June	
  7,	
  2012	
  
4	
  




                                                …	
  and	
  the	
  powerful	
  dark	
  side	
  

                                   ì  Reconcilia/on	
  

                                   ì  Extending	
  data	
  	
  

                                   ì  Regular	
  expressions	
  

                                   ì  Integrated	
  GREL	
  commands	
  

                                   ì  Jython	
  

                                   ì  Extensions	
  (actually	
  not	
  so	
  many	
  of	
  them)	
  


©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                 June	
  7,	
  2012	
  
5	
  




                                                                                                                      LOD2	
  

                                   ì  Crea/ng	
  knowledge	
  out	
  of	
  Interlinked	
  Data	
  

                                   ì  EU	
  FP7	
  project	
  

                                   ì  15	
  partners	
  

                                   ì  LOD2	
  in	
  a	
  	
  	
  	
  	
  	
  	
  	
  	
  :	
  	
  
                                              LOD2	
  is	
  like	
  Batman	
  and	
  Robin:	
  business	
  and	
  academics	
  
                                              -­‐	
  mighty	
  tools	
  and	
  a	
  bunch	
  of	
  real	
  &	
  good	
  use	
  cases	
  
                                              supported	
  by	
  science	
  for	
  success.	
  

                                                                                                      	
  

©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                                       June	
  7,	
  2012	
  
6	
  




                                                                             Linked	
  Open	
  Data	
  

                                   ì  Distributed	
  data,	
  different	
  sources,	
  formats	
  

                                   ì  Open	
  Government	
  data	
  

                                   ì  Open	
  Data	
  Business	
  &	
  Business	
  of	
  Open	
  Data	
  

                                   ì  CKAN	
  



                                   ì  LOD2:	
  hap://www.lod2.eu	
  

                                   	
  
©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                     June	
  7,	
  2012	
  
7	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
     June	
  7,	
  2012	
  
8	
  




                                                                                               LODGrefine	
  

                                   ì  LOD-­‐friendly	
  GoogleRefine	
  extensions	
  


                                              ì  RDF	
  extension	
  
                                              	
         hap://lab.linkeddata.deri.ie/2010/grefine-­‐rdf-­‐extension/	
  
                                              	
  
                                              ì  DBpedia	
  extension	
  	
  


                                   ì  LODGrefine:	
  Google	
  Refine	
  +	
  integrated	
  extensions	
  


©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                               June	
  7,	
  2012	
  
9	
  




                                                                             LODGrefine	
  toolbox	
  

                                   ì  Google	
  Refine	
  func/onali/es	
  +	
  	
  
                                       ì  Registering	
  reconcilia/on	
  service	
  based	
  on	
  a	
  SPARQL	
  
                                           endpoint,	
  RDF	
  dump	
  or	
  Sindice	
  search	
  
                                       ì  RDF	
  Export	
  
                                              ì  Extending	
  reconciled	
  column	
  with	
  data	
  from	
  
                                                  DBpedia	
  	
  
                                              ì  Extrac/ng	
  en//es	
  from	
  full	
  text	
  using	
  Zemanta	
  API	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                             June	
  7,	
  2012	
  
10	
  




                                                                             Mechanical	
  Future	
  

                                   ì  Integra/on	
  with	
  Amazon	
  Mechanical	
  Turk?	
  

                                   ì  Leveraging	
  crowd	
  intelligence	
  

                                   ì  Do	
  Workers	
  dream	
  reconciled	
  data?	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                         June	
  7,	
  2012	
  
11	
  




                                                                                 Short	
  Summary	
  

                                   ì  Google	
  Refine	
  -­‐	
  what	
  we	
  had	
  

                                   ì  LOD	
  1st	
  class	
  ci/zen	
  in	
  GR	
  -­‐	
  what	
  we	
  wanted	
  

                                   ì  Google	
  Refine	
  extension(s)	
  -­‐	
  what	
  we	
  did	
  

                                   ì  LODGrefine	
  -­‐	
  what	
  we	
  have	
  

                                   ì  Mechanical	
  future	
  –	
  what	
  (we	
  think)	
  we	
  want	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                              June	
  7,	
  2012	
  
12	
  




                                                                                                      Demo	
  #1	
  

                                   ì  Pimp	
  my	
  data	
  under	
  10	
  minutes:	
  A	
  showcase	
  how	
  
                                              to	
  convert	
  data	
  from	
  a	
  website	
  into	
  a	
  linked	
  dataset	
  
                                              under	
  10	
  minutes.	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                                June	
  7,	
  2012	
  
13	
  




                                                                                                    Demo	
  #2	
  

                                   ì  Yes,	
  we	
  C(K)AN:	
  Conver/ng	
  one	
  of	
  the	
  CKAN	
  Open	
  
                                              Data	
  datasets	
  into	
  a	
  LOD	
  dataset	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                                     June	
  7,	
  2012	
  
14	
  




                                                                                Thank	
  you	
  

                                   Mateja	
  Verlic	
  

                                   E-­‐mail:	
  mateja.verlic@zemanta.com	
  

                                   Web:	
  hap://www-­‐zemanta.com	
  

                                   LODGrefine:	
  hap://code.zemanta.com/sparkica	
  

                                   Twiaer:	
  @sparkica	
  




©2012	
  Seman/c	
  Technology	
  Conference	
  June	
  3-­‐7,	
  2012	
                  June	
  7,	
  2012	
  

Contenu connexe

Similaire à You rang, M’LOD? Google Refine in the world of LOD

Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting
 
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
EDF2012   Chris Taggart - How the biggest Open Database of Companies was builtEDF2012   Chris Taggart - How the biggest Open Database of Companies was built
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
European Data Forum
 
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
Yamika Mehra
 
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integrationPal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integration
Mustafa Jarrar
 

Similaire à You rang, M’LOD? Google Refine in the world of LOD (20)

Productivity Future Vision
Productivity Future VisionProductivity Future Vision
Productivity Future Vision
 
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012Advocate Consulting - Tangoe Summit Keynote Presentation 2012
Advocate Consulting - Tangoe Summit Keynote Presentation 2012
 
JIST 2012
JIST 2012JIST 2012
JIST 2012
 
Building Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph HughesBuilding Agile Data Warehouses with Ralph Hughes
Building Agile Data Warehouses with Ralph Hughes
 
Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012Benchmark METRICS THAT MATTER October 4 2012
Benchmark METRICS THAT MATTER October 4 2012
 
ORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout sessionORCID Outreach Meeting dev breakout session
ORCID Outreach Meeting dev breakout session
 
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
EDF2012   Chris Taggart - How the biggest Open Database of Companies was builtEDF2012   Chris Taggart - How the biggest Open Database of Companies was built
EDF2012 Chris Taggart - How the biggest Open Database of Companies was built
 
Virtual Worlds: A Future History
Virtual Worlds: A Future HistoryVirtual Worlds: A Future History
Virtual Worlds: A Future History
 
Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012Ashnik corporate presentation Dec 2012
Ashnik corporate presentation Dec 2012
 
SMX Landing Page Optimization
SMX Landing Page OptimizationSMX Landing Page Optimization
SMX Landing Page Optimization
 
True Drivers of MDM webinar
True Drivers of MDM webinarTrue Drivers of MDM webinar
True Drivers of MDM webinar
 
Who’s using my apps
Who’s using my appsWho’s using my apps
Who’s using my apps
 
EDF2012 Nuria de Lama - BIG
EDF2012   Nuria de Lama - BIGEDF2012   Nuria de Lama - BIG
EDF2012 Nuria de Lama - BIG
 
2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for sale2012 - 2013 bulk ieee projects for sale
2012 - 2013 bulk ieee projects for sale
 
2012-2013 IEEE PROJECT TITLES
2012-2013 IEEE PROJECT TITLES2012-2013 IEEE PROJECT TITLES
2012-2013 IEEE PROJECT TITLES
 
Lod2
Lod2Lod2
Lod2
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
 
Pal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integrationPal gov.tutorial2.session13 1.data schema integration
Pal gov.tutorial2.session13 1.data schema integration
 
3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine Billanz3 Jahre OGD In Österreich - eine Billanz
3 Jahre OGD In Österreich - eine Billanz
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

You rang, M’LOD? Google Refine in the world of LOD

  • 1. You  rang,  M’LOD?   ì   Google  Refine  in  the  world  of  LOD   Mateja  Verlic   Seman/c  Tech  &  Business  Conference   June  3-­‐7,  2012  |  San  Francisco  
  • 2. 2   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 3. 3   Google  Refine   ì  What  we’ve  seen  so  far   ì  Messy  data  gone  clean   ì  Filtering,  faceted  browsing   ì  Edi/ng  cells  and  columns,  clustering,  expor/ng   ì  Bulk  transforma/ons   ì  History   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 4. 4   …  and  the  powerful  dark  side   ì  Reconcilia/on   ì  Extending  data     ì  Regular  expressions   ì  Integrated  GREL  commands   ì  Jython   ì  Extensions  (actually  not  so  many  of  them)   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 5. 5   LOD2   ì  Crea/ng  knowledge  out  of  Interlinked  Data   ì  EU  FP7  project   ì  15  partners   ì  LOD2  in  a                  :     LOD2  is  like  Batman  and  Robin:  business  and  academics   -­‐  mighty  tools  and  a  bunch  of  real  &  good  use  cases   supported  by  science  for  success.     ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 6. 6   Linked  Open  Data   ì  Distributed  data,  different  sources,  formats   ì  Open  Government  data   ì  Open  Data  Business  &  Business  of  Open  Data   ì  CKAN   ì  LOD2:  hap://www.lod2.eu     ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 7. 7   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 8. 8   LODGrefine   ì  LOD-­‐friendly  GoogleRefine  extensions   ì  RDF  extension     hap://lab.linkeddata.deri.ie/2010/grefine-­‐rdf-­‐extension/     ì  DBpedia  extension     ì  LODGrefine:  Google  Refine  +  integrated  extensions   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 9. 9   LODGrefine  toolbox   ì  Google  Refine  func/onali/es  +     ì  Registering  reconcilia/on  service  based  on  a  SPARQL   endpoint,  RDF  dump  or  Sindice  search   ì  RDF  Export   ì  Extending  reconciled  column  with  data  from   DBpedia     ì  Extrac/ng  en//es  from  full  text  using  Zemanta  API   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 10. 10   Mechanical  Future   ì  Integra/on  with  Amazon  Mechanical  Turk?   ì  Leveraging  crowd  intelligence   ì  Do  Workers  dream  reconciled  data?   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 11. 11   Short  Summary   ì  Google  Refine  -­‐  what  we  had   ì  LOD  1st  class  ci/zen  in  GR  -­‐  what  we  wanted   ì  Google  Refine  extension(s)  -­‐  what  we  did   ì  LODGrefine  -­‐  what  we  have   ì  Mechanical  future  –  what  (we  think)  we  want   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 12. 12   Demo  #1   ì  Pimp  my  data  under  10  minutes:  A  showcase  how   to  convert  data  from  a  website  into  a  linked  dataset   under  10  minutes.   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 13. 13   Demo  #2   ì  Yes,  we  C(K)AN:  Conver/ng  one  of  the  CKAN  Open   Data  datasets  into  a  LOD  dataset   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012  
  • 14. 14   Thank  you   Mateja  Verlic   E-­‐mail:  mateja.verlic@zemanta.com   Web:  hap://www-­‐zemanta.com   LODGrefine:  hap://code.zemanta.com/sparkica   Twiaer:  @sparkica   ©2012  Seman/c  Technology  Conference  June  3-­‐7,  2012   June  7,  2012