SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
Identity Crisis in Linked Data	

                                        Ora Lassila (Nokia)	
                                    Ryan J. McDonough (Nokia)	
                                       Susan Malaika (IBM)	
                                                  	
                                                  	
                                                  	
                            Position Statement for the W3C Workshop on	
                                   Linked Enterprise Data Patterns	
                                            2011-12-06	



1	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Background: Web Architecture and Identity	
•  Ability to identify things (resources, objects, concepts, etc.) is a
   cornerstone of the Web Architecture
•  Identity via URIs is critical for the Semantic Web and Linked Data
•  W3C’s guidance is clear: All things “worth talking about” deserve to
   have a URI as a global, unambiguous identifier
•  But, problems exist, all related to URIs as identifiers:
       1.  identity vs. location
       2.  missing or ambiguous identity
       3.  versioning of data and identity
       4.  lack of stable identity
•  More guidance is needed…

 2	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location	
•  Two mutually confusing observations:
       − URIs are used to provide identity for resources
       − URLs are used as queries (= “locators”)
•  Many different query URLs can “yield” the same resource
       − (i.e., a resource whose identity is a particular URI)
•  Particularly difficult with REST
       − identity URI often “embedded” in a query URL
•  Protocol URIs confuse the hell out of Web developers




 3	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
URL Query with Internal Identity	
!
                                             http://a.com/things?id=13	


•  Exposes an internal identifier (often a primary key in some
   database) – identity (as a URI) now tied to a particular domain or
   server
•  There may be practical reasons why developers prefer internal
   identifiers in databases
       − no tie-ins with servers/domains
       − no perceived need of “external” linking
       − efficiency…?

 4	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
“RESTish” URL Query	
                                                                !
                                                 http://a.com/things/13	


•  As an identity URI, people typically do not treat this as opaque
•  The identity is still internal, but is now embedded in the URL as a
   path parameter




 5	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
URL Query with URI Identifier	
             http://b.org/things?id=http%3A//a.com/foo/13	


•  Makes sense, but does not look particularly “pretty”




 6	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
URL Query with URN Identifier	
                                                                !
http://a.com/things?id=urn%3Acom%3Aa%3Athings%3A13	


•  Still not so pretty, but makes sense




 7	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
URL Query with URI Identifier (same domain)	
                                                                !
            http://a.com/things?id=http%3A//a.com/foo/13	


•  Confusing to many as it does not appear to make sense
•  Has the perception of being somehow inefficient




 8	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #1: Identity vs. Location
Separating Identity from Location	
•  Web Developers have a difficult time with protocol URIs
•  If a URI starts with “http”, it must be dereferencable
•  It seems that the new JSON specs (json-schema, json-ref, etc.) do
   not even attempt to make the distinction between identity and
   location




 9	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #2: Missing or Ambiguous Identity	
•  Many “real-world” objects do not have a URI, but they can be
   uniquely identified via some attribute
        − e.g., SSN for people living in the US
        − use via owl:InverseFunctionalProperty, but this implies
          the need for some kind of reasoning services
•  Some attributes normally thought of as unique in fact are not
        − e.g., ISBN numbers for books
•  A set of attributes can uniquely identify an object, but is this a real
   identity or merely a query?




 10	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #3: Versioning of Data and Identity	
•  W3C has largely ignored versioning
        − certainly true of versioning of data
        − sometimes also true of specifications (e.g., RDF)
•  Should version information be part of the identity of an object?
•  Related to “lifecycle” issues
•  Note that recent SQL standardization has added features that may
   help in resource (data) versioning




 11	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Problem #4: Lack of Stable Identity	
•  W3C proclaims that “cool URIs do not change”
        − (but in reality we know they do…)
        − actually, we see a confusion between URIs and URLs here
•  Mitigating lack of stability via redirection (or “address resolution”)
        − PURLs tied to network access, not database access
        − lifecycle issues might benefit as well
        − “hash vs. slash”, httpRange-14, …
•  “Local” URLs are not globally unique and lend themselves to
   ambiguity, errors and confusion
        − e.g., file:///C:/My Documents/Resume.doc
•  With reasoning support, owl:sameAs could be used as a means
   of declaring mappings (some caveats apply)
 12	
    © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
Conclusions	
•  Confusion in matters of identity leads to a lack of interoperability
•  Consequently, this delays and hinders the deployment of
   Semantic Web and Linked Data systems
•  We would like to alleviate the confusion, and are seeking for
   discussion, guidance, etc.




 13	
   © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora

Contenu connexe

Similaire à Identity Crisis in Linked Data

MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesDorothea Salo
 
Building_Decentralized_Web_Apps.pdf
Building_Decentralized_Web_Apps.pdfBuilding_Decentralized_Web_Apps.pdf
Building_Decentralized_Web_Apps.pdfzkxlnpn696
 
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Project
 
2009.10.22 S308460 Cloud Data Services
2009.10.22 S308460  Cloud Data Services2009.10.22 S308460  Cloud Data Services
2009.10.22 S308460 Cloud Data ServicesJeffrey T. Pollock
 
Data-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata StrategiesData-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata StrategiesDATAVERSITY
 
Fine Grain Access Control for Big Data: ORC Column Encryption
Fine Grain Access Control for Big Data: ORC Column EncryptionFine Grain Access Control for Big Data: ORC Column Encryption
Fine Grain Access Control for Big Data: ORC Column EncryptionOwen O'Malley
 
SemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise DataSemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise Data3 Round Stones
 
jQuery Mobile: the easy peasy mobile development framework
jQuery Mobile: the easy peasy mobile development frameworkjQuery Mobile: the easy peasy mobile development framework
jQuery Mobile: the easy peasy mobile development frameworkLaura Watts
 
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data Services
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data ServicesSharePoint 2010 Managed Metadata vs SQL 2012 Master Data Services
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data ServicesHenry Ong
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsPaul Walk
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRXDATAVERSITY
 
Information-centric networking and relaton to legal and regulatory issues
Information-centric networking and relaton to legal and regulatory issuesInformation-centric networking and relaton to legal and regulatory issues
Information-centric networking and relaton to legal and regulatory issuesSAIL
 
ODI 11g in the Enterprise - BIWA 2013
ODI 11g in the Enterprise - BIWA 2013ODI 11g in the Enterprise - BIWA 2013
ODI 11g in the Enterprise - BIWA 2013Mark Rittman
 

Similaire à Identity Crisis in Linked Data (20)

Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltpHpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
 
Anti-social Databases
Anti-social DatabasesAnti-social Databases
Anti-social Databases
 
Schema and Identity for Linked Data
Schema and Identity for Linked DataSchema and Identity for Linked Data
Schema and Identity for Linked Data
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Building_Decentralized_Web_Apps.pdf
Building_Decentralized_Web_Apps.pdfBuilding_Decentralized_Web_Apps.pdf
Building_Decentralized_Web_Apps.pdf
 
Going Mobile !
Going Mobile !Going Mobile !
Going Mobile !
 
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
Future Cities Conference´13 / Peter Steenkiste - "The eXpressive Internet Arc...
 
2009.10.22 S308460 Cloud Data Services
2009.10.22 S308460  Cloud Data Services2009.10.22 S308460  Cloud Data Services
2009.10.22 S308460 Cloud Data Services
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
 
Data-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata StrategiesData-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata Strategies
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
 
Fine Grain Access Control for Big Data: ORC Column Encryption
Fine Grain Access Control for Big Data: ORC Column EncryptionFine Grain Access Control for Big Data: ORC Column Encryption
Fine Grain Access Control for Big Data: ORC Column Encryption
 
SemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise DataSemTechBiz 2012 Panel on Linking Enterprise Data
SemTechBiz 2012 Panel on Linking Enterprise Data
 
jQuery Mobile: the easy peasy mobile development framework
jQuery Mobile: the easy peasy mobile development frameworkjQuery Mobile: the easy peasy mobile development framework
jQuery Mobile: the easy peasy mobile development framework
 
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data Services
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data ServicesSharePoint 2010 Managed Metadata vs SQL 2012 Master Data Services
SharePoint 2010 Managed Metadata vs SQL 2012 Master Data Services
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRX
 
Information-centric networking and relaton to legal and regulatory issues
Information-centric networking and relaton to legal and regulatory issuesInformation-centric networking and relaton to legal and regulatory issues
Information-centric networking and relaton to legal and regulatory issues
 
SWXG 2010.6.9 v2
SWXG 2010.6.9 v2SWXG 2010.6.9 v2
SWXG 2010.6.9 v2
 
ODI 11g in the Enterprise - BIWA 2013
ODI 11g in the Enterprise - BIWA 2013ODI 11g in the Enterprise - BIWA 2013
ODI 11g in the Enterprise - BIWA 2013
 

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Identity Crisis in Linked Data

  • 1. Identity Crisis in Linked Data Ora Lassila (Nokia) Ryan J. McDonough (Nokia) Susan Malaika (IBM) Position Statement for the W3C Workshop on Linked Enterprise Data Patterns 2011-12-06 1 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 2. Background: Web Architecture and Identity •  Ability to identify things (resources, objects, concepts, etc.) is a cornerstone of the Web Architecture •  Identity via URIs is critical for the Semantic Web and Linked Data •  W3C’s guidance is clear: All things “worth talking about” deserve to have a URI as a global, unambiguous identifier •  But, problems exist, all related to URIs as identifiers: 1.  identity vs. location 2.  missing or ambiguous identity 3.  versioning of data and identity 4.  lack of stable identity •  More guidance is needed… 2 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 3. Problem #1: Identity vs. Location •  Two mutually confusing observations: − URIs are used to provide identity for resources − URLs are used as queries (= “locators”) •  Many different query URLs can “yield” the same resource − (i.e., a resource whose identity is a particular URI) •  Particularly difficult with REST − identity URI often “embedded” in a query URL •  Protocol URIs confuse the hell out of Web developers 3 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 4. Problem #1: Identity vs. Location URL Query with Internal Identity ! http://a.com/things?id=13 •  Exposes an internal identifier (often a primary key in some database) – identity (as a URI) now tied to a particular domain or server •  There may be practical reasons why developers prefer internal identifiers in databases − no tie-ins with servers/domains − no perceived need of “external” linking − efficiency…? 4 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 5. Problem #1: Identity vs. Location “RESTish” URL Query ! http://a.com/things/13 •  As an identity URI, people typically do not treat this as opaque •  The identity is still internal, but is now embedded in the URL as a path parameter 5 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 6. Problem #1: Identity vs. Location URL Query with URI Identifier http://b.org/things?id=http%3A//a.com/foo/13 •  Makes sense, but does not look particularly “pretty” 6 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 7. Problem #1: Identity vs. Location URL Query with URN Identifier ! http://a.com/things?id=urn%3Acom%3Aa%3Athings%3A13 •  Still not so pretty, but makes sense 7 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 8. Problem #1: Identity vs. Location URL Query with URI Identifier (same domain) ! http://a.com/things?id=http%3A//a.com/foo/13 •  Confusing to many as it does not appear to make sense •  Has the perception of being somehow inefficient 8 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 9. Problem #1: Identity vs. Location Separating Identity from Location •  Web Developers have a difficult time with protocol URIs •  If a URI starts with “http”, it must be dereferencable •  It seems that the new JSON specs (json-schema, json-ref, etc.) do not even attempt to make the distinction between identity and location 9 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 10. Problem #2: Missing or Ambiguous Identity •  Many “real-world” objects do not have a URI, but they can be uniquely identified via some attribute − e.g., SSN for people living in the US − use via owl:InverseFunctionalProperty, but this implies the need for some kind of reasoning services •  Some attributes normally thought of as unique in fact are not − e.g., ISBN numbers for books •  A set of attributes can uniquely identify an object, but is this a real identity or merely a query? 10 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 11. Problem #3: Versioning of Data and Identity •  W3C has largely ignored versioning − certainly true of versioning of data − sometimes also true of specifications (e.g., RDF) •  Should version information be part of the identity of an object? •  Related to “lifecycle” issues •  Note that recent SQL standardization has added features that may help in resource (data) versioning 11 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 12. Problem #4: Lack of Stable Identity •  W3C proclaims that “cool URIs do not change” − (but in reality we know they do…) − actually, we see a confusion between URIs and URLs here •  Mitigating lack of stability via redirection (or “address resolution”) − PURLs tied to network access, not database access − lifecycle issues might benefit as well − “hash vs. slash”, httpRange-14, … •  “Local” URLs are not globally unique and lend themselves to ambiguity, errors and confusion − e.g., file:///C:/My Documents/Resume.doc •  With reasoning support, owl:sameAs could be used as a means of declaring mappings (some caveats apply) 12 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora
  • 13. Conclusions •  Confusion in matters of identity leads to a lack of interoperability •  Consequently, this delays and hinders the deployment of Semantic Web and Linked Data systems •  We would like to alleviate the confusion, and are seeking for discussion, guidance, etc. 13 © 2011 Nokia LOD-identity-crisis.pptx 2011-12-06 ora