SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
ResourceSync - An Introduction



                                                    Todd Carpenter
                                           Executive Director, NISO
                                             Wolfram Data Summit
                                     Thursday, September 6, 2012
 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
@TAC_NISO Twitter Highlights

•   Presenting this afternoon on the ResourceSync project at Wolfram Data Summit #wolframsummit

•   I’m pre-tweeing my slides during #rsync presentation. Slides will be posted later today
    #wolframsummit

•   NISO mission develop & maintain technical standards related to information, documentation, discovery
    & distribution of content #wolframsummit

•   Machines don’t talk like people do.  Then again some people don’t talk like other people do,
    particularly teenagers #wolframsummit

•   So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH
    protocol. #wolframsummit

•   The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P.
    Sloan Foundation.  Thank you! #wolframsummit

•   What is RSync trying to solve (1/2): Source Server has resources that change. Destination servers want
    to leverage some/all of Source #wolframsummit

•   What is RSync trying to solve (2/2): How to sync on ongoing basis in near-real-time & at web scale
    with as little system overhead as poss #wolframsummit

•   RSync studied # of existing protocols to determine protocols that best meet needs. Bias against
    developing new spec from scratch. #wolframsummit

•   The goal of ResourceSync is to find the model that most efficiently distributes the content, while
    limiting the tax on the source system. #wolframsummit

•   Very early days in process of standards development. Still in incubation stage. Consensus & adoption
    phases coming ‘13 & beyond #wolframsummit

•   Draft alpha specification of ResourceSync posted in August, Team meeting in Sept to review comments
    #wolframsummit
About
Non-profit industry trade association accredited by ANSI
Mission of developing and maintaining technical standards
related to information, documentation, discovery and
distribution of published materials and media
125+ Member organizations
Volunteer driven organization: 400+ spread out across the
world
Represent US interests to ISO in the areas of Information &
Documentation
Standards	
  are	
  familiar,	
  even	
  if	
  you	
  don’t	
  no4ce




September	
  6,	
  2012           Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   4
Machines don’t talk like people do




September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   5
Machines talk like this




September	
  6,	
  2012         Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   6
How	
  did	
  we	
  get	
  here?
• OAI-­‐PMH	
  Protocol
        – Developed	
  in	
  2001	
  (v	
  1.1,	
  v	
  2.0	
  -­‐	
  2002)
        – Developed	
  by	
  Herbert	
  van	
  de	
  Sompel,	
  Carl	
  Lagoze,	
  
          Michael	
  Nelson,	
  and	
  Simeon	
  Warner
        – Fairly	
  wide	
  adopQon	
  in	
  scholarly	
  community
• In	
  spring	
  2011,	
  NISO	
  approached	
  OAI	
  to	
  discuss	
  
  updaQng	
  PMH	
  Protocol
• Response	
  was	
  “Let’s	
  try	
  something	
  else	
  more	
  in	
  
  line	
  with	
  more	
  modern	
  technology”	
  
September	
  6,	
  2012            Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter      7
A partnership is born
        Agreement to launch RSync as a
         NISO standards initiative
        Partnership on grant application
        OAI team comprised core
         technology team
        Vetting & education by NISO
September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   8
Special	
  thanks	
  are	
  due	
  to...	
  




September	
  6,	
  2012           Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   9
ResourceSync	
  Working	
  Group
Herbert Van de Sompel (Chair)
Los Alamos National Laboratory                                         Peter Murray
                                                                       Lyrasis
Todd Carpenter (Co-Chair)
National Information Standards Organization (NISO)                     Michael Nelson
                                                                       Old Dominion University
Nettie Lagace                                                          David Rosenthal
National Information Standards Organization (NISO)                     Stanford University

Manuel Bernhardt                                                       David Rosenthal
Delving B.V.                                                           LOCKSS

Kevin Ford                                                             Christian Sadilek
Library of Congress                                                    Red Hat

Bernhard Haslhofer                                                     Shlomo Sanders
Cornell University                                                     Ex Libris, Inc.

Richard Jones                                                          Robert Sanderson
Joint Information Systems Committee (JISC)                             Los Alamos National Laboratory

Martin Klein                                                           Sjoerd Siebinga
Los Alamos National Laboratory                                         Delving B.V.

Graham Klyne                                                           Ed Summers
Joint Information Systems Committee (JISC)                             Library of Congress

Carl Lagoze                                                            Simeon Warner
Cornell University                                                     Cornell University

Stuart Lewis                                                           Jeff Young
Joint Information Systems Committee (JISC)                             OCLC Online Computer Library Center

September	
  6,	
  2012                      Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                   10
What	
  are	
  we	
  trying	
  to	
  do?
• Synchronize	
  web	
  resources	
  –	
  things	
  with	
  a	
  URI	
  
      that	
  can	
  be	
  dereferenced	
  and	
  are	
  cache-­‐able	
  
•     Improve	
  on	
  web	
  synchroniza>on	
  methods
•     For	
  small	
  websites/repositories	
  (a	
  few	
  
      resources)	
  to	
  large	
  repositories/datasets/linked	
  
      data	
  collec>ons	
  (many	
  millions	
  of	
  resources)
•     That	
  change	
  slowly	
  or	
  rapidly
•     Focus	
  on	
  needs	
  of	
  research	
  and	
  cultural	
  heritage	
  
      organiza>ons,	
  but	
  aim	
  for	
  generality	
  
September	
  6,	
  2012          Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   11
Use	
  Cases




September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   12
More	
  Use	
  Cases




September	
  6,	
  2012       Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   13
Not	
  (yet)	
  Use	
  Cases	
  
                             (i.e.:	
  Out	
  of	
  Scope)	
  

• Bidirectional synchronization

• Destination-defined selective synchronization
   (query)

• Bulk URI migration




September	
  6,	
  2012            Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   14
Use	
  cases	
  differ
                 How good is the synchronization?


                 Perfect                                                             Good	
  enough

        How fast is the synchronization?



                          Fast                                                       Fast	
  enough


September	
  6,	
  2012              Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter               15
3	
  disQnct	
  needs	
  regarding	
  resource	
  synchronizaQon

      Baseline	
  matching:	
  An	
  approach	
  to	
  allow	
  a	
  DesQnaQon	
  that	
  wants	
  
      to	
  start	
  synchronizing	
  with	
  a	
  Source	
  to	
  perform	
  an	
  iniQal	
  catch	
  up	
  
      –	
  Dump.

      Incremental	
  resource	
  synchronizaQon:	
  An	
  approach	
  to	
  allow	
  a	
  
      DesQnaQon	
  to	
  remain	
  up-­‐to-­‐date	
  regarding	
  changes	
  at	
  the	
  
      Source.

      Audit:	
  An	
  approach	
  to	
  allow	
  checking	
  whether	
  a	
  DesQnaQon	
  is	
  in	
  
      sync	
  with	
  a	
  Source	
  	
  –	
  Inventory.

      =>	
  All	
  3	
  are	
  considered	
  in	
  scope	
  for	
  ResourceSync


September	
  6,	
  2012                  Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                    16
Incremental	
  Synchroniza9on	
  

        Change	
  NoQficaQon	
  (CN)
               Alert	
  that	
  something	
  happened	
  
                 (create,update,delete)


        Content	
  Transfer	
  (CT)
                    Transfer	
  of	
  just	
  the	
  change	
  or	
  the	
  full	
  resource


September	
  6,	
  2012                  Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter         17
Trivial	
  versus	
  OpQmal	
  Approaches
• Trivial	
  Approach	
  -­‐	
  Retrieve	
  &	
  Compare




• OpQmal	
  Approach	
  -­‐	
  push only the change to only the
      destinations monitoring the resource




September	
  6,	
  2012      Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   18
More	
  advanced	
  opQon
     Feed	
  Extension	
  SoluQon:
     ConQnue	
  the	
  Feed	
  paradigm,	
  but	
  introduce	
  
     aggregaQng	
  service	
  and	
  ping	
  noQficaQon	
  to	
  re-­‐pull	
  
     (simulated	
  push)
     Only	
  advantageous	
  if	
  Source	
  already	
  supports	
  a	
  Feed




September	
  6,	
  2012          Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   19
Change	
  NoQficaQon	
  -­‐	
  Protocols
           Atom PubSubHubbub (PuSH)
        XMPP
           PubSub extension
           BoSH (XMPP over HTTP)
        Comet / HTTP Streaming
           Open an HTTP connection and keep reading from it
           Bayeux Protocol
        Long Polling
           Keep HTTP connection open until a message, then reopen
           BoSH, Bayeux option
        WebSockets
           NullMQ / ZeroMQ
           XMPP over WebSockets?



September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   20
hjp://imgs.xkcd.com/comics/standards.png/

8/23/11              Data	
  AjribuQon	
  and	
  CitaQon	
  Workshop
                                                                       21
RSync	
  Alpha




September	
  6,	
  2012    Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   22
A Framework Based on Sitemaps
• Developing a Modular framework allowing selective
  deployment
• Sitemap is the core component throughout the framework
• Introducing extension elements and attributes:
   – In ResourceSync namespace (rs:) to accommodate
      synchronization needs
   – In XHTML namespace (xhtml:) mainly to
      accommodate discovery needs
• Reuse Sitemap format for Change Sets (both current and
  historical) and for manifest in Dump


September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   23
CommunicaQons	
  structure




September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   24
The	
  lifecycle	
  of	
  standards	
  




               You	
  are	
  here




September	
  6,	
  2012             Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   25
Timeline
• Project	
  Launch	
  =	
  November	
  2011
• Approved	
  work	
  item	
  =	
  December	
  2011
• Working	
  Group	
  formed	
  =	
  February	
  2012
• Webinar	
  on	
  project	
  =	
  March	
  2012
• JCDL	
  meeQng,	
  Washington	
  DC	
  =	
  June	
  2012
• Alpha	
  =	
  September	
  2012
• Team	
  meeQng,	
  Denver,	
  CO	
  =	
  September	
  2012	
  
        – forthcoming	
  D-­‐Lib	
  arQcle
• Beta/Dral	
  for	
  trail	
  use	
  =	
  ??	
  December	
  2012
• Comment	
  period	
  =	
  ??	
  December	
  2012	
  -­‐	
  March	
  2012
• Training	
  =	
  ??	
  May	
  -­‐	
  July	
  2013
• Approval	
  =	
  ??	
  December	
  2013
September	
  6,	
  2012                   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   26
More	
  informaQon
Background	
  webinar	
  (March	
  6,	
  2012)	
  recording	
  

First	
  draL	
  spec:	
  hNp://www.openarchives.org/rs/0.1/
resourcesync	
  

Simulator	
  code	
  on	
  github	
  hNp://github.org/resync/simulator

NISO	
  workspace	
  hNp://www.niso.org/workrooms/
resourcesync/

List	
  for	
  public	
  comment	
  coming	
  soon

September	
  6,	
  2012        Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   27
Standards for Data/Exchange:
            New Work areas?
Many potential areas for work in sharing of data including:
• Author/Contributor disambiguation & other issues
• Data Equivalence – How does one know that this thing and that are
  equivalent (i.e., contain same data)?
• Systemic metadata
  What is the form of this information?  
  What are its structural components?
• Archival issues
  Storage, physical level, metadata, but also migration issues
• Bibliographic information for discovery, delivery and reuse
• Bibliometrics / Assessment & impact
• Rights issues – Ownership, recognition, sharing, privacy
What are appropriate metrics?
• For datasets, what
  is a download?
• How does one use
  compare with
  another?
• Citation ecosystem
  needs to develop
• Should data papers
  count?
Data Equivalence
Basically, is an Excel file
equivalent to a text file?
Creation of a high-level
conceptual model of
data description
A “FRBR” for data
Defines the distinctions
between states &
transformations of data
Basis for identification &
description
Thank you!



                                                                                    Todd Carpenter
                                                                                  Executive Director
                                                                                tcarpenter@niso.org

        National Information Standards Organization (NISO)
        3600 Clipper Mill Road, Suite 302
        Baltimore, MD 21211 USA               NOTE	
  =>	
  NISO	
  HAS	
  MOVED!!	
  <=
        +1 (301) 654-2512
        www.niso.org


September	
  6,	
  2012               Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                31

Contenu connexe

Tendances

"Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ..."Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ...Ahmed AlSum
 
Hadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesHadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesUri Laserson
 
SSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopSSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopWilliam Fee
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2tcloudcomputing-tw
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3tcloudcomputing-tw
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Humphrey Southall
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly ResourcesRobert Sanderson
 
Large Scale Data With Hadoop
Large Scale Data With HadoopLarge Scale Data With Hadoop
Large Scale Data With Hadoopguest27e6764
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsAaron Brooks
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introductionXuan-Chao Huang
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceeakasit_dpu
 
Hadoop tools with Examples
Hadoop tools with ExamplesHadoop tools with Examples
Hadoop tools with ExamplesJoe McTee
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-researchsaintdevil163
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014eswcsummerschool
 

Tendances (20)

NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
 
"Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ..."Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ...
 
Hadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesHadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciences
 
SSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopSSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshop
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly Resources
 
Large Scale Data With Hadoop
Large Scale Data With HadoopLarge Scale Data With Hadoop
Large Scale Data With Hadoop
 
Hadoop Family and Ecosystem
Hadoop Family and EcosystemHadoop Family and Ecosystem
Hadoop Family and Ecosystem
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime Analytics
 
List of Engineering Colleges in Uttarakhand
List of Engineering Colleges in UttarakhandList of Engineering Colleges in Uttarakhand
List of Engineering Colleges in Uttarakhand
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce
 
Memento 101
Memento 101Memento 101
Memento 101
 
Hadoop tools with Examples
Hadoop tools with ExamplesHadoop tools with Examples
Hadoop tools with Examples
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-research
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 

Similaire à Carpenter - Wolfram Data Summit ResourceSync

A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology
 
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseDeep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseJosh Patterson
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Research Data Alliance
 
EDF2012 Peter Boncz - LOD benchmarking SRbench
EDF2012   Peter Boncz - LOD benchmarking SRbenchEDF2012   Peter Boncz - LOD benchmarking SRbench
EDF2012 Peter Boncz - LOD benchmarking SRbenchEuropean Data Forum
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumRobert Sanderson
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...Daniel Bryant
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"Fabien Gandon
 

Similaire à Carpenter - Wolfram Data Summit ResourceSync (20)

ResourceSync - An Introduction
ResourceSync - An IntroductionResourceSync - An Introduction
ResourceSync - An Introduction
 
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
 
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseDeep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the Enterprise
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
 
EDF2012 Peter Boncz - LOD benchmarking SRbench
EDF2012   Peter Boncz - LOD benchmarking SRbenchEDF2012   Peter Boncz - LOD benchmarking SRbench
EDF2012 Peter Boncz - LOD benchmarking SRbench
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall Forum
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
 
Spark
SparkSpark
Spark
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
ApacheCon NA 2013
ApacheCon NA 2013ApacheCon NA 2013
ApacheCon NA 2013
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 

Carpenter - Wolfram Data Summit ResourceSync

  • 1. ResourceSync - An Introduction Todd Carpenter Executive Director, NISO Wolfram Data Summit Thursday, September 6, 2012 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
  • 2. @TAC_NISO Twitter Highlights • Presenting this afternoon on the ResourceSync project at Wolfram Data Summit #wolframsummit • I’m pre-tweeing my slides during #rsync presentation. Slides will be posted later today #wolframsummit • NISO mission develop & maintain technical standards related to information, documentation, discovery & distribution of content #wolframsummit • Machines don’t talk like people do.  Then again some people don’t talk like other people do, particularly teenagers #wolframsummit • So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH protocol. #wolframsummit • The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P. Sloan Foundation.  Thank you! #wolframsummit • What is RSync trying to solve (1/2): Source Server has resources that change. Destination servers want to leverage some/all of Source #wolframsummit • What is RSync trying to solve (2/2): How to sync on ongoing basis in near-real-time & at web scale with as little system overhead as poss #wolframsummit • RSync studied # of existing protocols to determine protocols that best meet needs. Bias against developing new spec from scratch. #wolframsummit • The goal of ResourceSync is to find the model that most efficiently distributes the content, while limiting the tax on the source system. #wolframsummit • Very early days in process of standards development. Still in incubation stage. Consensus & adoption phases coming ‘13 & beyond #wolframsummit • Draft alpha specification of ResourceSync posted in August, Team meeting in Sept to review comments #wolframsummit
  • 3. About Non-profit industry trade association accredited by ANSI Mission of developing and maintaining technical standards related to information, documentation, discovery and distribution of published materials and media 125+ Member organizations Volunteer driven organization: 400+ spread out across the world Represent US interests to ISO in the areas of Information & Documentation
  • 4. Standards  are  familiar,  even  if  you  don’t  no4ce September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 4
  • 5. Machines don’t talk like people do September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 5
  • 6. Machines talk like this September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 6
  • 7. How  did  we  get  here? • OAI-­‐PMH  Protocol – Developed  in  2001  (v  1.1,  v  2.0  -­‐  2002) – Developed  by  Herbert  van  de  Sompel,  Carl  Lagoze,   Michael  Nelson,  and  Simeon  Warner – Fairly  wide  adopQon  in  scholarly  community • In  spring  2011,  NISO  approached  OAI  to  discuss   updaQng  PMH  Protocol • Response  was  “Let’s  try  something  else  more  in   line  with  more  modern  technology”   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 7
  • 8. A partnership is born Agreement to launch RSync as a NISO standards initiative Partnership on grant application OAI team comprised core technology team Vetting & education by NISO September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 8
  • 9. Special  thanks  are  due  to...   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 9
  • 10. ResourceSync  Working  Group Herbert Van de Sompel (Chair) Los Alamos National Laboratory Peter Murray Lyrasis Todd Carpenter (Co-Chair) National Information Standards Organization (NISO) Michael Nelson Old Dominion University Nettie Lagace David Rosenthal National Information Standards Organization (NISO) Stanford University Manuel Bernhardt David Rosenthal Delving B.V. LOCKSS Kevin Ford Christian Sadilek Library of Congress Red Hat Bernhard Haslhofer Shlomo Sanders Cornell University Ex Libris, Inc. Richard Jones Robert Sanderson Joint Information Systems Committee (JISC) Los Alamos National Laboratory Martin Klein Sjoerd Siebinga Los Alamos National Laboratory Delving B.V. Graham Klyne Ed Summers Joint Information Systems Committee (JISC) Library of Congress Carl Lagoze Simeon Warner Cornell University Cornell University Stuart Lewis Jeff Young Joint Information Systems Committee (JISC) OCLC Online Computer Library Center September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 10
  • 11. What  are  we  trying  to  do? • Synchronize  web  resources  –  things  with  a  URI   that  can  be  dereferenced  and  are  cache-­‐able   • Improve  on  web  synchroniza>on  methods • For  small  websites/repositories  (a  few   resources)  to  large  repositories/datasets/linked   data  collec>ons  (many  millions  of  resources) • That  change  slowly  or  rapidly • Focus  on  needs  of  research  and  cultural  heritage   organiza>ons,  but  aim  for  generality   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 11
  • 12. Use  Cases September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 12
  • 13. More  Use  Cases September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 13
  • 14. Not  (yet)  Use  Cases   (i.e.:  Out  of  Scope)   • Bidirectional synchronization • Destination-defined selective synchronization (query) • Bulk URI migration September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 14
  • 15. Use  cases  differ How good is the synchronization? Perfect Good  enough How fast is the synchronization? Fast Fast  enough September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 15
  • 16. 3  disQnct  needs  regarding  resource  synchronizaQon Baseline  matching:  An  approach  to  allow  a  DesQnaQon  that  wants   to  start  synchronizing  with  a  Source  to  perform  an  iniQal  catch  up   –  Dump. Incremental  resource  synchronizaQon:  An  approach  to  allow  a   DesQnaQon  to  remain  up-­‐to-­‐date  regarding  changes  at  the   Source. Audit:  An  approach  to  allow  checking  whether  a  DesQnaQon  is  in   sync  with  a  Source    –  Inventory. =>  All  3  are  considered  in  scope  for  ResourceSync September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 16
  • 17. Incremental  Synchroniza9on   Change  NoQficaQon  (CN) Alert  that  something  happened   (create,update,delete) Content  Transfer  (CT) Transfer  of  just  the  change  or  the  full  resource September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 17
  • 18. Trivial  versus  OpQmal  Approaches • Trivial  Approach  -­‐  Retrieve  &  Compare • OpQmal  Approach  -­‐  push only the change to only the destinations monitoring the resource September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 18
  • 19. More  advanced  opQon Feed  Extension  SoluQon: ConQnue  the  Feed  paradigm,  but  introduce   aggregaQng  service  and  ping  noQficaQon  to  re-­‐pull   (simulated  push) Only  advantageous  if  Source  already  supports  a  Feed September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 19
  • 20. Change  NoQficaQon  -­‐  Protocols Atom PubSubHubbub (PuSH) XMPP PubSub extension BoSH (XMPP over HTTP) Comet / HTTP Streaming Open an HTTP connection and keep reading from it Bayeux Protocol Long Polling Keep HTTP connection open until a message, then reopen BoSH, Bayeux option WebSockets NullMQ / ZeroMQ XMPP over WebSockets? September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 20
  • 21. hjp://imgs.xkcd.com/comics/standards.png/ 8/23/11 Data  AjribuQon  and  CitaQon  Workshop 21
  • 22. RSync  Alpha September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 22
  • 23. A Framework Based on Sitemaps • Developing a Modular framework allowing selective deployment • Sitemap is the core component throughout the framework • Introducing extension elements and attributes: – In ResourceSync namespace (rs:) to accommodate synchronization needs – In XHTML namespace (xhtml:) mainly to accommodate discovery needs • Reuse Sitemap format for Change Sets (both current and historical) and for manifest in Dump September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 23
  • 24. CommunicaQons  structure September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 24
  • 25. The  lifecycle  of  standards   You  are  here September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 25
  • 26. Timeline • Project  Launch  =  November  2011 • Approved  work  item  =  December  2011 • Working  Group  formed  =  February  2012 • Webinar  on  project  =  March  2012 • JCDL  meeQng,  Washington  DC  =  June  2012 • Alpha  =  September  2012 • Team  meeQng,  Denver,  CO  =  September  2012   – forthcoming  D-­‐Lib  arQcle • Beta/Dral  for  trail  use  =  ??  December  2012 • Comment  period  =  ??  December  2012  -­‐  March  2012 • Training  =  ??  May  -­‐  July  2013 • Approval  =  ??  December  2013 September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 26
  • 27. More  informaQon Background  webinar  (March  6,  2012)  recording   First  draL  spec:  hNp://www.openarchives.org/rs/0.1/ resourcesync   Simulator  code  on  github  hNp://github.org/resync/simulator NISO  workspace  hNp://www.niso.org/workrooms/ resourcesync/ List  for  public  comment  coming  soon September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 27
  • 28. Standards for Data/Exchange: New Work areas? Many potential areas for work in sharing of data including: • Author/Contributor disambiguation & other issues • Data Equivalence – How does one know that this thing and that are equivalent (i.e., contain same data)? • Systemic metadata What is the form of this information?   What are its structural components? • Archival issues Storage, physical level, metadata, but also migration issues • Bibliographic information for discovery, delivery and reuse • Bibliometrics / Assessment & impact • Rights issues – Ownership, recognition, sharing, privacy
  • 29. What are appropriate metrics? • For datasets, what is a download? • How does one use compare with another? • Citation ecosystem needs to develop • Should data papers count?
  • 30. Data Equivalence Basically, is an Excel file equivalent to a text file? Creation of a high-level conceptual model of data description A “FRBR” for data Defines the distinctions between states & transformations of data Basis for identification & description
  • 31. Thank you! Todd Carpenter Executive Director tcarpenter@niso.org National Information Standards Organization (NISO) 3600 Clipper Mill Road, Suite 302 Baltimore, MD 21211 USA NOTE  =>  NISO  HAS  MOVED!!  <= +1 (301) 654-2512 www.niso.org September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 31