SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
The	
  ARK	
  Iden+fier	
  Scheme	
  at	
  
          Ten	
  Years	
  Old	
  
                                       7 	
   M a y 	
   2 0 1 2 	
  

                                         J o h n 	
   K u n z e 	
  
      U n i v e r s i t y 	
   o f 	
   C a l i f o r n i a 	
   C u r a + o n 	
   C e n t e r 	
  
                    C a l i f o r n i a 	
   D i g i t a l 	
   L i b r a r y 	
  
California	
  Digital	
  Library	
  
Serving	
  the	
  University	
  of	
  California	
     CDL	
  supports	
  the	
  research	
  lifecycle	
  	
  
•  10	
  campuses	
                                    •  Collec+ons	
  
•  360K	
  students,	
  faculty,	
  and	
  staff	
      •  Digital	
  Special	
  Collec+ons	
  
•  100’s	
  of	
  museums,	
  art	
  galleries,	
      •  Discovery	
  &	
  Delivery	
  
   observatories,	
  marine	
  centers,	
              •  Publishing	
  Group	
  
   botanical	
  gardens	
                              •  UC	
  Cura+on	
  Center	
  (UC3)	
  
•  5	
  medical	
  centers	
  
•  5	
  law	
  schools	
  
•  3	
  Na+onal	
  Laboratories	
  
California	
  Digital	
  Library	
  (CDL)	
  
Today’s	
  journey	
  

           • What	
  are	
  ARKs?	
  
           • Separa+on	
  of	
  concerns	
  
               • Naming	
  ≠	
  hos+ng	
  
               • Scheme	
  ≠	
  resolu+on	
  
               • Syntax	
  ≠	
  persistence	
  
           • Inflec+ons	
  and	
  metadata	
  
           • EZID	
  (easy	
  iden+fiers)	
  and	
  
           N2T	
  (name-­‐to-­‐thing)	
  
           • Data	
  cita+on,	
  passthrough	
  
What’s	
  an	
  ARK	
  iden+fier?	
  
ARK	
  =	
  Archival	
  Resource	
  Key	
  
ARKs	
  support	
  long-­‐term	
  access	
  to	
  informa+on	
  objects	
  
ARKs	
  iden+fy	
  objects	
  of	
  any	
  type:	
  
•  digital	
  objects	
  –	
  data,	
  documents,	
  images,	
  sodware,	
  ...	
  
•  physical	
  objects	
  –	
  books,	
  bones,	
  statues,	
  ...	
  
•  groups	
  &	
  living	
  beings	
  –	
  people,	
  animals,	
  orchestras,	
  ...	
  
•  Intangibles	
  –	
  places,	
  chemicals,	
  diseases,	
  terms,	
  ...	
  
The	
  URL	
  is	
  dead,	
  long	
  live	
  the	
  URL!	
  
Fallacy	
  #1:	
  	
  URLs	
  are	
  unreliable,	
  so	
  instead	
  use	
  
  this...	
  um...	
  well...	
  ah	
  ...	
  (shhh!)	
  “URL”	
  
Some	
  of	
  your	
  best	
  friends	
  are	
  URLs:	
  
                        hlp://dx.doi.org/10.1234/98765	
  
               hlp://hdl.handle.net/10.1234/98765	
  
                          hlp://purl.org/10.1234/98765	
  
                       hlp://n2t.net/ark:/101234/98765	
  
Persistence	
  is	
  about	
  service	
  
•  Imagine	
  the	
  “perfect”	
  golden	
  iden+fier	
  
•  Apply	
  bankruptcy,	
  disk	
  crash,	
  human	
  error,	
  or	
  
   war,	
  and	
  there’s	
  nothing	
  that	
  syntax,	
  scheme,	
  or	
  
   resolver	
  can	
  do	
  to	
  prevent	
  iden+fier	
  breakage.	
  
What’s	
  an	
  ARK	
  iden+fier?	
  (take	
  2)	
  
An	
  ARK	
  is	
  a	
  URL,	
  with	
  some	
  extra	
  rules	
  
ARK	
  reserves	
  /	
  and	
  .	
  for	
  what	
  we	
  oden	
  assume	
  
•  A/B/C	
  means	
  C	
  is	
  contained	
  in	
  A/B,	
  and	
  B	
  in	
  A	
  
•  A.pdf,	
  A.html,	
  and	
  A.docx	
  are	
  all	
  variants	
  of	
  A	
  
Could	
  dras+cally	
  improve	
  search	
  result	
  display	
  
•  No	
  need	
  to	
  lookup	
  rela+onships	
  
ARK	
  inflec+ons	
  (declina+ons)	
  
An	
  ARK	
  is	
  a	
  special	
  URL	
  with	
  access	
  to	
  3	
  things	
  
1.  An	
  informa+on	
  object	
  
2.  Its	
  metadata,	
  by	
  appending	
  ‘?’	
  inflec+on	
  
3.  A	
  provider’s	
  promise,	
  by	
  appending	
  a	
  ‘??’	
  
An	
  inflec1on	
  changes	
  a	
  name	
  ending	
  for	
  a	
  purpose	
  
•  Reduces	
  the	
  number	
  of	
  different	
  names	
  needed	
  
•  Use	
  seman+c	
  web	
  without	
  hiring	
  a	
  programmer	
  
‘?’	
  Inflec+on	
  returns	
  Dublin	
  Kernel	
  
Same	
  machine-­‐readable	
  informa+on	
  as	
  before:	
  
erc:!
who:       National Research Council!
what:      The Digital Dilemma!
when:      2000!
where:     http://books.nap.edu/html/digital%5Fdilemma!

Even	
  shorter:	
  
erc: National Research Council!
     | The Digital Dilemma | 2000 !
     | http://books.nap.edu/html/digital%5Fdilemma!

See	
  hlp://dublincore.org/groups/kernel/	
  for	
  more	
  informa+on!
Why	
  use	
  ARKs?	
  
ARKs	
  are	
  assigned	
  for	
  a	
  variety	
  of	
  reasons:	
  
•  affordability	
  –	
  there	
  are	
  no	
  fees	
  to	
  assign	
  or	
  use	
  ARKs	
  
•  self-­‐sufficiency	
  –	
  can	
  host	
  ARKs	
  on	
  your	
  own	
  web	
  server	
  
•  portability	
  –	
  can	
  move	
  ARKs	
  without	
  change	
  of	
  iden+ty	
  
         http://cdlib.org/ark:/12025/654xz321
       http://rutgers.edu/ark:/12025/654xz321
           http://n2t.net/ark:/12025/654xz321	
  

•  global	
  resolvability	
  –	
  can	
  host	
  ARKs	
  at	
  N2T	
  resolver	
  
•  density	
  –	
  mixed	
  case	
  means	
  CD,	
  Cd,	
  cD,	
  cd	
  are	
  all	
  dis+nct	
  
Some	
  unique	
  advantages	
  of	
  ARKs	
  
•  simplicity	
  –	
  uses	
  only	
  ordinary	
  "redirects”	
  &	
  "get"	
  requests	
  
•  versa+lity	
  –	
  with	
  "inflec+ons"	
  (different	
  endings),	
  an	
  ARK	
  
   should	
  access	
  data,	
  metadata,	
  promises,	
  and	
  more	
  
•  transparency	
  –	
  no	
  iden+fier	
  can	
  guarantee	
  stability,	
  and	
  
   ARK	
  inflec+ons	
  help	
  users	
  make	
  informed	
  judgments	
  
•  visibility	
  –	
  syntax	
  rules	
  make	
  ARKs	
  easy	
  to	
  extract	
  and	
  to	
  
   compare	
  for	
  containment	
  and	
  	
  variant	
  rela+onships	
  
•  reserved	
  characters:	
  	
  -­‐	
  (hyphen),	
  	
  /	
  (slash),	
  	
  .	
  (period)	
  
What’s	
  an	
  ARK	
  iden+fier?	
  (take	
  3)	
  
ARK	
  is	
  a	
  collec+on	
  of	
  good	
  ideas	
  
•  Separates	
  scheme	
  syntax	
  from	
  resolver	
  rules	
  
    –  Resolu1on	
  is	
  a	
  process	
  of	
  mapping	
  an	
  id	
  to	
  a	
  thing	
  
•  Separates	
  name	
  assigning	
  from	
  name	
  mapping	
  
•  All	
  schemes	
  encouraged	
  to	
  use	
  these	
  ideas,	
  even	
  
   ordinary	
  URLs	
  
•  N2T	
  resolver	
  can	
  support	
  them	
  for	
  any	
  scheme	
  
Iden+fier	
  schemes	
  are	
  highly	
  parallel	
  
                           Scheme
                              :
Name Mapping Authority        : Name Assigning Authority
          :     (NMA)         :    :        Number (NAAN)
          v                   v    v
|..........................|....+..................|
           http://dx.doi.org/doi:10.30/tqb3kh97gh8w
       http://hdl.handle.net/hdl:13030/tqb3kh97gh8w
                       http://purl.org/tqb3kh97gh8w
                       ...   urn:13030:tqb3kh97gh8w
             http://n2t.net/ark:/13030/tqb3kh97gh8w
 http://OwlBike.example.org/ark:/13030/tqb3kh97gh8w
|..........................|.......................|......
    Branded or neutral          Base identifier     Suffix
Locksmith	
  jargon:	
  shoulder,	
  blade,	
  +p,	
  bow,	
  cover	
  
       _____    slips on     _____
    .-' ,_,'-.. ----> .-'          '-.
   /    (o,o)         /             
  :     {`"'} ||       :               `____
 / .-. -"-"- ||       / .-.                 '--^.   .^--^.        .^.
{ (    )       ||    { (     )                   `-'      `-^--^-'   '--^.
  `-'    _o   ||      '-'            ===================================}
  :     _|<,_ ||       :               __________________________________/
      (*)/(*) /                     /
    `-._____.-'          `-._____.-'
|....................|...............|....|..........................|..|
          ^                      ^        ^                ^            ^
          :                      :        :                :            :
        Cover=                Bow=     Shoulder .------ Blade          Tip
         NMA              Scheme+NAAN     :       : .-------------------'
          :                   :     :     :      : :
          v                    v    v     v       v v
|..........................|....+.....|...|......|.|
 http://OwlBike.example.org/ark:/13030/tqb3kh97gh8w     <---- Example Key
                              doi:10.30/tqb3kh97gh8w          with parallel
                              hdl:13030/tqb3kh97gh8w         parts in other
                              urn:13030:tqb3kh97gh8w           id schemes.
|..........................|.......................|....
   Name Mapping Authority         Base identifier      ...
ARK	
  usage	
  in	
  10	
  years	
  
•  In	
  2001-­‐2011	
  ~100	
  organiza+ons	
  registered	
  for	
  ARKs	
  
•  Registry	
  is	
  replicated	
  at	
  BnF	
  and	
  NLM	
  
•  Some	
  of	
  the	
  largest	
  users	
  are	
  
    –  The	
  California	
  Digital	
  Library	
  
    –  The	
  Internet	
  Archive	
  
    –  Bibliothèque	
  na+onale	
  de	
  France	
  
    –  Por+co	
  Digital	
  Preserva+on	
  Service	
  
    –  University	
  of	
  California	
  Berkeley	
  
    –  University	
  of	
  Chicago	
  
Some	
  other	
  ARK	
  registrants	
  
	
  	
  	
  	
  	
  	
  12025	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  US	
  Na+onal	
  Library	
  of	
  Medicine	
  
	
  	
  	
  	
  	
  	
  86077	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Cornell	
  Ins+tute	
  for	
  Social	
  and	
  Economic	
  Research	
  
	
  	
  	
  	
  	
  	
  26677	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Library	
  and	
  Archives	
  Canada	
  
	
  	
  	
  	
  	
  	
  77635	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Humboldt-­‐Universität	
  zu	
  Berlin	
  
	
  	
  	
  	
  	
  	
  13038	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  World	
  Intellectual	
  Property	
  Organiza+on	
  
	
  	
  	
  	
  	
  	
  78319	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Google	
  
	
  	
  	
  	
  	
  	
  61001	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  University	
  of	
  Chicago	
  
	
  	
  	
  	
  	
  	
  28722	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  University	
  of	
  California	
  Berkeley	
  
	
  	
  	
  	
  	
  	
  64269	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  UK	
  Digital	
  Cura+on	
  Centre	
  
	
  	
  	
  	
  	
  	
  87895	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Centre	
  Informa+que	
  Na+onal	
  de	
  l'Enseignement	
  Supérieur	
  
	
  	
  	
  	
  	
  	
  61903	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Family	
  Search	
  
	
  	
  	
  	
  	
  	
  52327	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Na+onal	
  Library	
  and	
  Archives	
  of	
  Quebec	
  
	
  	
  	
  	
  	
  	
  10261	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Jüdisches	
  Museum	
  Berlin	
  
	
  	
  	
  	
  	
  	
  71479	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Spanish	
  Na+onal	
  Research	
  Council	
  
	
  	
  	
  	
  	
  	
  32833	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Massachusels	
  Ins+tute	
  of	
  Technology	
  
	
  	
  	
  	
  	
  	
  81055	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Bri+sh	
  Library	
  
	
  	
  	
  	
  	
  	
  80713	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Biblioteca	
  Nacional	
  de	
  Portugal	
  
Immersion	
  vs	
  landing	
  page	
  
What	
  do	
  you	
  mean	
  by	
  “get	
  the	
  data”?	
  
What	
  inflec+ons	
  might	
  dis+nguish	
  these?	
  


• Immersion	
  –	
  a	
  consump+ve	
  experience	
  or	
  



• Landing	
  page	
  –	
  a	
  menu-­‐study	
  experience?	
  
Vision	
  for	
  a	
  “data	
  paper”	
  	
  
•  Wrap	
  the	
  unfamiliar	
  in	
  a	
  familiar	
  façade	
  
•  A	
  “data	
  paper”	
  is	
  minimally	
  a	
  cover	
  sheet	
  
   and	
  a	
  set	
  of	
  links	
  to	
  archived	
  ar+facts	
  	
  
•  Cover	
  sheet	
  contains	
  familiar	
  elements:	
  
   +tle,	
  date,	
  authors,	
  abstract,	
  and	
  
   persistent	
  iden+fier	
  (DOI,	
  ARK,	
  etc.)	
  
•  Just	
  enough	
  to	
  permit	
  basic	
  exposure	
  and	
  
   discovery	
  
–  Building	
  a	
  basic	
  data	
  cita+on	
  	
  
–  Indexing	
  by	
  services	
  such	
  as	
  Web	
  of	
  
   Science,	
  Google	
  Scholar	
  
–  Ins+lling	
  	
  confidence	
  in	
  the	
  iden+fier’s	
  	
  
   stability	
  	
  
New	
  distributed	
  framework	
  
           Coordina9ng	
  Nodes	
              Flexible,	
  scalable,	
  
              Member	
  Nodes	
  
•  retain	
  complete	
  metadata	
  
                                              sustainable	
  network	
  
• 	
  catalog	
  	
   ins+tu+ons	
  
      	
  diverse	
  
•  subset	
  of	
  all	
  data	
  
• 	
  	
  serve	
  local	
  community	
  
•  perform	
  basic	
  indexing	
  
• 	
  provide	
  network-­‐wide	
  
•  	
  provide	
  resources	
  for	
  
managing	
  their	
  data	
  
     services	
  
•  ensure	
  data	
  availability	
  
     (preserva+on)	
  	
  	
  
•  provide	
  replica+on	
  
     services	
  
ARKs	
  –	
  coming	
  soon	
  
•  Community	
  forum	
  
•  Standardiza+on	
  as	
  an	
  Internet	
  RFC	
  
•  New	
  inflec+ons	
  for	
  landing	
  page	
  &	
  immersion	
  
N2T/EZID	
  –	
  coming	
  soon	
  
•  Indexing	
  by	
  A&I	
  vendors	
  
•  Suffix	
  pass-­‐through	
  
   –  Register	
  Name	
  -­‐>	
  target	
  T	
  
   –  Resolve	
  Name/a/b/c	
  -­‐>	
  T/a/b/c	
  automa+cally	
  
   –  Greatly	
  reduce	
  number	
  of	
  ids	
  to	
  manage	
  
•  URNs	
  

Contenu connexe

Similaire à The ARK Identifier Scheme at Ten Years Old

Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...MongoDB
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingPlanetData Network of Excellence
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
Data Wrangling For Kaggle Data Science Competitions
Data Wrangling For Kaggle Data Science CompetitionsData Wrangling For Kaggle Data Science Competitions
Data Wrangling For Kaggle Data Science CompetitionsKrishna Sankar
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldSøren Schaffstein
 
Javascript done right - Open Web Camp III
Javascript done right - Open Web Camp IIIJavascript done right - Open Web Camp III
Javascript done right - Open Web Camp IIIDirk Ginader
 
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark MeetupPySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark MeetupFrens Jan Rumph
 
Greg Hogan – To Petascale and Beyond- Apache Flink in the Clouds
Greg Hogan – To Petascale and Beyond- Apache Flink in the CloudsGreg Hogan – To Petascale and Beyond- Apache Flink in the Clouds
Greg Hogan – To Petascale and Beyond- Apache Flink in the CloudsFlink Forward
 
"R, Hadoop, and Amazon Web Services (20 December 2011)"
"R, Hadoop, and Amazon Web Services (20 December 2011)""R, Hadoop, and Amazon Web Services (20 December 2011)"
"R, Hadoop, and Amazon Web Services (20 December 2011)"Portland R User Group
 
Tech, Reference, AND PATRON Views of our new Front-End
Tech, Reference, AND PATRON Views of our new Front-EndTech, Reference, AND PATRON Views of our new Front-End
Tech, Reference, AND PATRON Views of our new Front-Endkramsey
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and researchkchine3
 
OU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data ClusterOU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data ClusterEnrico Daga
 

Similaire à The ARK Identifier Scheme at Ten Years Old (20)

Labs_20210809.pdf
Labs_20210809.pdfLabs_20210809.pdf
Labs_20210809.pdf
 
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
Big Data Analytics 3: Machine Learning to Engage the Customer, with Apache Sp...
 
Accessing Treasure on lands and peoples
Accessing Treasure on lands and peoplesAccessing Treasure on lands and peoples
Accessing Treasure on lands and peoples
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Data Wrangling For Kaggle Data Science Competitions
Data Wrangling For Kaggle Data Science CompetitionsData Wrangling For Kaggle Data Science Competitions
Data Wrangling For Kaggle Data Science Competitions
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into Gold
 
Paris Data Geeks
Paris Data GeeksParis Data Geeks
Paris Data Geeks
 
Data Science
Data ScienceData Science
Data Science
 
Javascript done right - Open Web Camp III
Javascript done right - Open Web Camp IIIJavascript done right - Open Web Camp III
Javascript done right - Open Web Camp III
 
R tutorial
R tutorialR tutorial
R tutorial
 
Spark etl
Spark etlSpark etl
Spark etl
 
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark MeetupPySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
 
Greg Hogan – To Petascale and Beyond- Apache Flink in the Clouds
Greg Hogan – To Petascale and Beyond- Apache Flink in the CloudsGreg Hogan – To Petascale and Beyond- Apache Flink in the Clouds
Greg Hogan – To Petascale and Beyond- Apache Flink in the Clouds
 
"R, Hadoop, and Amazon Web Services (20 December 2011)"
"R, Hadoop, and Amazon Web Services (20 December 2011)""R, Hadoop, and Amazon Web Services (20 December 2011)"
"R, Hadoop, and Amazon Web Services (20 December 2011)"
 
R, Hadoop and Amazon Web Services
R, Hadoop and Amazon Web ServicesR, Hadoop and Amazon Web Services
R, Hadoop and Amazon Web Services
 
Tech, Reference, AND PATRON Views of our new Front-End
Tech, Reference, AND PATRON Views of our new Front-EndTech, Reference, AND PATRON Views of our new Front-End
Tech, Reference, AND PATRON Views of our new Front-End
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and research
 
OU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data ClusterOU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data Cluster
 

Plus de John Kunze

The YAMZ Metadictionary
The YAMZ MetadictionaryThe YAMZ Metadictionary
The YAMZ MetadictionaryJohn Kunze
 
YAMZ Metadata Vocabulary Builder
YAMZ Metadata Vocabulary BuilderYAMZ Metadata Vocabulary Builder
YAMZ Metadata Vocabulary BuilderJohn Kunze
 
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...John Kunze
 
EZID and N2T at CDL
EZID and N2T at CDLEZID and N2T at CDL
EZID and N2T at CDLJohn Kunze
 
YAMZ.net: better, faster, cheaper taxonomy building
YAMZ.net:  better, faster, cheaper taxonomy buildingYAMZ.net:  better, faster, cheaper taxonomy building
YAMZ.net: better, faster, cheaper taxonomy buildingJohn Kunze
 
A Vocabulary for Persistence
A Vocabulary for PersistenceA Vocabulary for Persistence
A Vocabulary for PersistenceJohn Kunze
 
Identifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesIdentifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesJohn Kunze
 
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsNames, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsJohn Kunze
 
YAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyYAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyJohn Kunze
 
DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014John Kunze
 
Selected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupSelected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupJohn Kunze
 
Annotating Research Datasets
Annotating Research DatasetsAnnotating Research Datasets
Annotating Research DatasetsJohn Kunze
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchJohn Kunze
 
Big Data's Long Tail
Big Data's Long TailBig Data's Long Tail
Big Data's Long TailJohn Kunze
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsJohn Kunze
 
Future-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayFuture-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayJohn Kunze
 
Supporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsSupporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsJohn Kunze
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 

Plus de John Kunze (20)

The YAMZ Metadictionary
The YAMZ MetadictionaryThe YAMZ Metadictionary
The YAMZ Metadictionary
 
YAMZ Metadata Vocabulary Builder
YAMZ Metadata Vocabulary BuilderYAMZ Metadata Vocabulary Builder
YAMZ Metadata Vocabulary Builder
 
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
 
EZID and N2T at CDL
EZID and N2T at CDLEZID and N2T at CDL
EZID and N2T at CDL
 
YAMZ.net: better, faster, cheaper taxonomy building
YAMZ.net:  better, faster, cheaper taxonomy buildingYAMZ.net:  better, faster, cheaper taxonomy building
YAMZ.net: better, faster, cheaper taxonomy building
 
A Vocabulary for Persistence
A Vocabulary for PersistenceA Vocabulary for Persistence
A Vocabulary for Persistence
 
Identifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesIdentifiers obey Resolvers not Schemes
Identifiers obey Resolvers not Schemes
 
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsNames, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
 
YAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyYAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabulary
 
DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014
 
Selected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupSelected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout group
 
Annotating Research Datasets
Annotating Research DatasetsAnnotating Research Datasets
Annotating Research Datasets
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich Research
 
Big Data's Long Tail
Big Data's Long TailBig Data's Long Tail
Big Data's Long Tail
 
Pamwg 2012ahm
Pamwg 2012ahmPamwg 2012ahm
Pamwg 2012ahm
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History Collections
 
Future-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayFuture-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do Today
 
Supporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsSupporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many Fronts
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 

Dernier

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesSanjay Willie
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Dernier (20)

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

The ARK Identifier Scheme at Ten Years Old

  • 1. The  ARK  Iden+fier  Scheme  at   Ten  Years  Old   7   M a y   2 0 1 2   J o h n   K u n z e   U n i v e r s i t y   o f   C a l i f o r n i a   C u r a + o n   C e n t e r   C a l i f o r n i a   D i g i t a l   L i b r a r y  
  • 2. California  Digital  Library   Serving  the  University  of  California   CDL  supports  the  research  lifecycle     •  10  campuses   •  Collec+ons   •  360K  students,  faculty,  and  staff   •  Digital  Special  Collec+ons   •  100’s  of  museums,  art  galleries,   •  Discovery  &  Delivery   observatories,  marine  centers,   •  Publishing  Group   botanical  gardens   •  UC  Cura+on  Center  (UC3)   •  5  medical  centers   •  5  law  schools   •  3  Na+onal  Laboratories  
  • 4. Today’s  journey   • What  are  ARKs?   • Separa+on  of  concerns   • Naming  ≠  hos+ng   • Scheme  ≠  resolu+on   • Syntax  ≠  persistence   • Inflec+ons  and  metadata   • EZID  (easy  iden+fiers)  and   N2T  (name-­‐to-­‐thing)   • Data  cita+on,  passthrough  
  • 5. What’s  an  ARK  iden+fier?   ARK  =  Archival  Resource  Key   ARKs  support  long-­‐term  access  to  informa+on  objects   ARKs  iden+fy  objects  of  any  type:   •  digital  objects  –  data,  documents,  images,  sodware,  ...   •  physical  objects  –  books,  bones,  statues,  ...   •  groups  &  living  beings  –  people,  animals,  orchestras,  ...   •  Intangibles  –  places,  chemicals,  diseases,  terms,  ...  
  • 6. The  URL  is  dead,  long  live  the  URL!   Fallacy  #1:    URLs  are  unreliable,  so  instead  use   this...  um...  well...  ah  ...  (shhh!)  “URL”   Some  of  your  best  friends  are  URLs:   hlp://dx.doi.org/10.1234/98765   hlp://hdl.handle.net/10.1234/98765   hlp://purl.org/10.1234/98765   hlp://n2t.net/ark:/101234/98765  
  • 7. Persistence  is  about  service   •  Imagine  the  “perfect”  golden  iden+fier   •  Apply  bankruptcy,  disk  crash,  human  error,  or   war,  and  there’s  nothing  that  syntax,  scheme,  or   resolver  can  do  to  prevent  iden+fier  breakage.  
  • 8. What’s  an  ARK  iden+fier?  (take  2)   An  ARK  is  a  URL,  with  some  extra  rules   ARK  reserves  /  and  .  for  what  we  oden  assume   •  A/B/C  means  C  is  contained  in  A/B,  and  B  in  A   •  A.pdf,  A.html,  and  A.docx  are  all  variants  of  A   Could  dras+cally  improve  search  result  display   •  No  need  to  lookup  rela+onships  
  • 9. ARK  inflec+ons  (declina+ons)   An  ARK  is  a  special  URL  with  access  to  3  things   1.  An  informa+on  object   2.  Its  metadata,  by  appending  ‘?’  inflec+on   3.  A  provider’s  promise,  by  appending  a  ‘??’   An  inflec1on  changes  a  name  ending  for  a  purpose   •  Reduces  the  number  of  different  names  needed   •  Use  seman+c  web  without  hiring  a  programmer  
  • 10. ‘?’  Inflec+on  returns  Dublin  Kernel   Same  machine-­‐readable  informa+on  as  before:   erc:! who: National Research Council! what: The Digital Dilemma! when: 2000! where: http://books.nap.edu/html/digital%5Fdilemma! Even  shorter:   erc: National Research Council! | The Digital Dilemma | 2000 ! | http://books.nap.edu/html/digital%5Fdilemma! See  hlp://dublincore.org/groups/kernel/  for  more  informa+on!
  • 11. Why  use  ARKs?   ARKs  are  assigned  for  a  variety  of  reasons:   •  affordability  –  there  are  no  fees  to  assign  or  use  ARKs   •  self-­‐sufficiency  –  can  host  ARKs  on  your  own  web  server   •  portability  –  can  move  ARKs  without  change  of  iden+ty   http://cdlib.org/ark:/12025/654xz321 http://rutgers.edu/ark:/12025/654xz321 http://n2t.net/ark:/12025/654xz321   •  global  resolvability  –  can  host  ARKs  at  N2T  resolver   •  density  –  mixed  case  means  CD,  Cd,  cD,  cd  are  all  dis+nct  
  • 12. Some  unique  advantages  of  ARKs   •  simplicity  –  uses  only  ordinary  "redirects”  &  "get"  requests   •  versa+lity  –  with  "inflec+ons"  (different  endings),  an  ARK   should  access  data,  metadata,  promises,  and  more   •  transparency  –  no  iden+fier  can  guarantee  stability,  and   ARK  inflec+ons  help  users  make  informed  judgments   •  visibility  –  syntax  rules  make  ARKs  easy  to  extract  and  to   compare  for  containment  and    variant  rela+onships   •  reserved  characters:    -­‐  (hyphen),    /  (slash),    .  (period)  
  • 13. What’s  an  ARK  iden+fier?  (take  3)   ARK  is  a  collec+on  of  good  ideas   •  Separates  scheme  syntax  from  resolver  rules   –  Resolu1on  is  a  process  of  mapping  an  id  to  a  thing   •  Separates  name  assigning  from  name  mapping   •  All  schemes  encouraged  to  use  these  ideas,  even   ordinary  URLs   •  N2T  resolver  can  support  them  for  any  scheme  
  • 14. Iden+fier  schemes  are  highly  parallel   Scheme : Name Mapping Authority : Name Assigning Authority : (NMA) : : Number (NAAN) v v v |..........................|....+..................| http://dx.doi.org/doi:10.30/tqb3kh97gh8w http://hdl.handle.net/hdl:13030/tqb3kh97gh8w http://purl.org/tqb3kh97gh8w ... urn:13030:tqb3kh97gh8w http://n2t.net/ark:/13030/tqb3kh97gh8w http://OwlBike.example.org/ark:/13030/tqb3kh97gh8w |..........................|.......................|...... Branded or neutral Base identifier Suffix
  • 15. Locksmith  jargon:  shoulder,  blade,  +p,  bow,  cover   _____ slips on _____ .-' ,_,'-.. ----> .-' '-. / (o,o) / : {`"'} || : `____ / .-. -"-"- || / .-. '--^. .^--^. .^. { ( ) || { ( ) `-' `-^--^-' '--^. `-' _o || '-' ===================================} : _|<,_ || : __________________________________/ (*)/(*) / / `-._____.-' `-._____.-' |....................|...............|....|..........................|..| ^ ^ ^ ^ ^ : : : : : Cover= Bow= Shoulder .------ Blade Tip NMA Scheme+NAAN : : .-------------------' : : : : : : v v v v v v |..........................|....+.....|...|......|.| http://OwlBike.example.org/ark:/13030/tqb3kh97gh8w <---- Example Key doi:10.30/tqb3kh97gh8w with parallel hdl:13030/tqb3kh97gh8w parts in other urn:13030:tqb3kh97gh8w id schemes. |..........................|.......................|.... Name Mapping Authority Base identifier ...
  • 16. ARK  usage  in  10  years   •  In  2001-­‐2011  ~100  organiza+ons  registered  for  ARKs   •  Registry  is  replicated  at  BnF  and  NLM   •  Some  of  the  largest  users  are   –  The  California  Digital  Library   –  The  Internet  Archive   –  Bibliothèque  na+onale  de  France   –  Por+co  Digital  Preserva+on  Service   –  University  of  California  Berkeley   –  University  of  Chicago  
  • 17. Some  other  ARK  registrants              12025                      US  Na+onal  Library  of  Medicine              86077                      Cornell  Ins+tute  for  Social  and  Economic  Research              26677                      Library  and  Archives  Canada              77635                      Humboldt-­‐Universität  zu  Berlin              13038                      World  Intellectual  Property  Organiza+on              78319                      Google              61001                      University  of  Chicago              28722                      University  of  California  Berkeley              64269                      UK  Digital  Cura+on  Centre              87895                      Centre  Informa+que  Na+onal  de  l'Enseignement  Supérieur              61903                      Family  Search              52327                      Na+onal  Library  and  Archives  of  Quebec              10261                      Jüdisches  Museum  Berlin              71479                      Spanish  Na+onal  Research  Council              32833                      Massachusels  Ins+tute  of  Technology              81055                      Bri+sh  Library              80713                      Biblioteca  Nacional  de  Portugal  
  • 18. Immersion  vs  landing  page   What  do  you  mean  by  “get  the  data”?   What  inflec+ons  might  dis+nguish  these?   • Immersion  –  a  consump+ve  experience  or   • Landing  page  –  a  menu-­‐study  experience?  
  • 19.
  • 20. Vision  for  a  “data  paper”     •  Wrap  the  unfamiliar  in  a  familiar  façade   •  A  “data  paper”  is  minimally  a  cover  sheet   and  a  set  of  links  to  archived  ar+facts     •  Cover  sheet  contains  familiar  elements:   +tle,  date,  authors,  abstract,  and   persistent  iden+fier  (DOI,  ARK,  etc.)   •  Just  enough  to  permit  basic  exposure  and   discovery   –  Building  a  basic  data  cita+on     –  Indexing  by  services  such  as  Web  of   Science,  Google  Scholar   –  Ins+lling    confidence  in  the  iden+fier’s     stability    
  • 21. New  distributed  framework   Coordina9ng  Nodes   Flexible,  scalable,   Member  Nodes   •  retain  complete  metadata   sustainable  network   •   catalog     ins+tu+ons    diverse   •  subset  of  all  data   •     serve  local  community   •  perform  basic  indexing   •   provide  network-­‐wide   •   provide  resources  for   managing  their  data   services   •  ensure  data  availability   (preserva+on)       •  provide  replica+on   services  
  • 22. ARKs  –  coming  soon   •  Community  forum   •  Standardiza+on  as  an  Internet  RFC   •  New  inflec+ons  for  landing  page  &  immersion  
  • 23. N2T/EZID  –  coming  soon   •  Indexing  by  A&I  vendors   •  Suffix  pass-­‐through   –  Register  Name  -­‐>  target  T   –  Resolve  Name/a/b/c  -­‐>  T/a/b/c  automa+cally   –  Greatly  reduce  number  of  ids  to  manage   •  URNs