SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
ResourceSync - An Introduction



                                                    Todd Carpenter
                                           Executive Director, NISO
                                             Wolfram Data Summit
                                     Thursday, September 6, 2012
 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
@TAC_NISO Twitter Highlights

•   Presenting this afternoon on the ResourceSync project at Wolfram Data Summit #wolframsummit

•   I’m pre-tweeing my slides during #rsync presentation. Slides will be posted later today
    #wolframsummit

•   NISO mission develop & maintain technical standards related to information, documentation, discovery
    & distribution of content #wolframsummit

•   Machines don’t talk like people do.  Then again some people don’t talk like other people do,
    particularly teenagers #wolframsummit

•   So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH
    protocol. #wolframsummit

•   The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P.
    Sloan Foundation.  Thank you! #wolframsummit

•   What is RSync trying to solve (1/2): Source Server has resources that change. Destination servers want
    to leverage some/all of Source #wolframsummit

•   What is RSync trying to solve (2/2): How to sync on ongoing basis in near-real-time & at web scale
    with as little system overhead as poss #wolframsummit

•   RSync studied # of existing protocols to determine protocols that best meet needs. Bias against
    developing new spec from scratch. #wolframsummit

•   The goal of ResourceSync is to find the model that most efficiently distributes the content, while
    limiting the tax on the source system. #wolframsummit

•   Very early days in process of standards development. Still in incubation stage. Consensus & adoption
    phases coming ‘13 & beyond #wolframsummit

•   Draft alpha specification of ResourceSync posted in August, Team meeting in Sept to review comments
    #wolframsummit
About
Non-profit industry trade association accredited by ANSI
Mission of developing and maintaining technical standards
related to information, documentation, discovery and
distribution of published materials and media
125+ Member organizations
Volunteer driven organization: 400+ spread out across the
world
Represent US interests to ISO in the areas of Information &
Documentation
Standards	
  are	
  familiar,	
  even	
  if	
  you	
  don’t	
  no4ce




September	
  6,	
  2012           Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   4
Machines don’t talk like people do




September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   5
Machines talk like this




September	
  6,	
  2012         Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   6
How	
  did	
  we	
  get	
  here?
• OAI-­‐PMH	
  Protocol
        – Developed	
  in	
  2001	
  (v	
  1.1,	
  v	
  2.0	
  -­‐	
  2002)
        – Developed	
  by	
  Herbert	
  van	
  de	
  Sompel,	
  Carl	
  Lagoze,	
  
          Michael	
  Nelson,	
  and	
  Simeon	
  Warner
        – Fairly	
  wide	
  adopQon	
  in	
  scholarly	
  community
• In	
  spring	
  2011,	
  NISO	
  approached	
  OAI	
  to	
  discuss	
  
  updaQng	
  PMH	
  Protocol
• Response	
  was	
  “Let’s	
  try	
  something	
  else	
  more	
  in	
  
  line	
  with	
  more	
  modern	
  technology”	
  
September	
  6,	
  2012            Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter      7
A partnership is born
        Agreement to launch RSync as a
         NISO standards initiative
        Partnership on grant application
        OAI team comprised core
         technology team
        Vetting & education by NISO
September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   8
Special	
  thanks	
  are	
  due	
  to...	
  




September	
  6,	
  2012           Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   9
ResourceSync	
  Working	
  Group
Herbert Van de Sompel (Chair)
Los Alamos National Laboratory                                         Peter Murray
                                                                       Lyrasis
Todd Carpenter (Co-Chair)
National Information Standards Organization (NISO)                     Michael Nelson
                                                                       Old Dominion University
Nettie Lagace                                                          David Rosenthal
National Information Standards Organization (NISO)                     Stanford University

Manuel Bernhardt                                                       David Rosenthal
Delving B.V.                                                           LOCKSS

Kevin Ford                                                             Christian Sadilek
Library of Congress                                                    Red Hat

Bernhard Haslhofer                                                     Shlomo Sanders
Cornell University                                                     Ex Libris, Inc.

Richard Jones                                                          Robert Sanderson
Joint Information Systems Committee (JISC)                             Los Alamos National Laboratory

Martin Klein                                                           Sjoerd Siebinga
Los Alamos National Laboratory                                         Delving B.V.

Graham Klyne                                                           Ed Summers
Joint Information Systems Committee (JISC)                             Library of Congress

Carl Lagoze                                                            Simeon Warner
Cornell University                                                     Cornell University

Stuart Lewis                                                           Jeff Young
Joint Information Systems Committee (JISC)                             OCLC Online Computer Library Center

September	
  6,	
  2012                      Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                   10
What	
  are	
  we	
  trying	
  to	
  do?
• Synchronize	
  web	
  resources	
  –	
  things	
  with	
  a	
  URI	
  
      that	
  can	
  be	
  dereferenced	
  and	
  are	
  cache-­‐able	
  
•     Improve	
  on	
  web	
  synchroniza>on	
  methods
•     For	
  small	
  websites/repositories	
  (a	
  few	
  
      resources)	
  to	
  large	
  repositories/datasets/linked	
  
      data	
  collec>ons	
  (many	
  millions	
  of	
  resources)
•     That	
  change	
  slowly	
  or	
  rapidly
•     Focus	
  on	
  needs	
  of	
  research	
  and	
  cultural	
  heritage	
  
      organiza>ons,	
  but	
  aim	
  for	
  generality	
  
September	
  6,	
  2012          Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   11
Use	
  Cases




September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   12
More	
  Use	
  Cases




September	
  6,	
  2012       Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   13
Not	
  (yet)	
  Use	
  Cases	
  
                             (i.e.:	
  Out	
  of	
  Scope)	
  

• Bidirectional synchronization

• Destination-defined selective synchronization
   (query)

• Bulk URI migration




September	
  6,	
  2012            Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   14
Use	
  cases	
  differ
                 How good is the synchronization?


                 Perfect                                                             Good	
  enough

        How fast is the synchronization?



                          Fast                                                       Fast	
  enough


September	
  6,	
  2012              Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter               15
3	
  disQnct	
  needs	
  regarding	
  resource	
  synchronizaQon

      Baseline	
  matching:	
  An	
  approach	
  to	
  allow	
  a	
  DesQnaQon	
  that	
  wants	
  
      to	
  start	
  synchronizing	
  with	
  a	
  Source	
  to	
  perform	
  an	
  iniQal	
  catch	
  up	
  
      –	
  Dump.

      Incremental	
  resource	
  synchronizaQon:	
  An	
  approach	
  to	
  allow	
  a	
  
      DesQnaQon	
  to	
  remain	
  up-­‐to-­‐date	
  regarding	
  changes	
  at	
  the	
  
      Source.

      Audit:	
  An	
  approach	
  to	
  allow	
  checking	
  whether	
  a	
  DesQnaQon	
  is	
  in	
  
      sync	
  with	
  a	
  Source	
  	
  –	
  Inventory.

      =>	
  All	
  3	
  are	
  considered	
  in	
  scope	
  for	
  ResourceSync


September	
  6,	
  2012                  Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                    16
Incremental	
  Synchroniza9on	
  

        Change	
  NoQficaQon	
  (CN)
               Alert	
  that	
  something	
  happened	
  
                 (create,update,delete)


        Content	
  Transfer	
  (CT)
                    Transfer	
  of	
  just	
  the	
  change	
  or	
  the	
  full	
  resource


September	
  6,	
  2012                  Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter         17
Trivial	
  versus	
  OpQmal	
  Approaches
• Trivial	
  Approach	
  -­‐	
  Retrieve	
  &	
  Compare




• OpQmal	
  Approach	
  -­‐	
  push only the change to only the
      destinations monitoring the resource




September	
  6,	
  2012      Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   18
More	
  advanced	
  opQon
     Feed	
  Extension	
  SoluQon:
     ConQnue	
  the	
  Feed	
  paradigm,	
  but	
  introduce	
  
     aggregaQng	
  service	
  and	
  ping	
  noQficaQon	
  to	
  re-­‐pull	
  
     (simulated	
  push)
     Only	
  advantageous	
  if	
  Source	
  already	
  supports	
  a	
  Feed




September	
  6,	
  2012          Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   19
Change	
  NoQficaQon	
  -­‐	
  Protocols
           Atom PubSubHubbub (PuSH)
        XMPP
           PubSub extension
           BoSH (XMPP over HTTP)
        Comet / HTTP Streaming
           Open an HTTP connection and keep reading from it
           Bayeux Protocol
        Long Polling
           Keep HTTP connection open until a message, then reopen
           BoSH, Bayeux option
        WebSockets
           NullMQ / ZeroMQ
           XMPP over WebSockets?



September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   20
hjp://imgs.xkcd.com/comics/standards.png/

8/23/11              Data	
  AjribuQon	
  and	
  CitaQon	
  Workshop
                                                                       21
RSync	
  Alpha




September	
  6,	
  2012    Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   22
A Framework Based on Sitemaps
• Developing a Modular framework allowing selective
  deployment
• Sitemap is the core component throughout the framework
• Introducing extension elements and attributes:
   – In ResourceSync namespace (rs:) to accommodate
      synchronization needs
   – In XHTML namespace (xhtml:) mainly to
      accommodate discovery needs
• Reuse Sitemap format for Change Sets (both current and
  historical) and for manifest in Dump


September	
  6,	
  2012   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   23
CommunicaQons	
  structure




September	
  6,	
  2012     Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   24
The	
  lifecycle	
  of	
  standards	
  




               You	
  are	
  here




September	
  6,	
  2012             Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   25
Timeline
• Project	
  Launch	
  =	
  November	
  2011
• Approved	
  work	
  item	
  =	
  December	
  2011
• Working	
  Group	
  formed	
  =	
  February	
  2012
• Webinar	
  on	
  project	
  =	
  March	
  2012
• JCDL	
  meeQng,	
  Washington	
  DC	
  =	
  June	
  2012
• Alpha	
  =	
  September	
  2012
• Team	
  meeQng,	
  Denver,	
  CO	
  =	
  September	
  2012	
  
        – forthcoming	
  D-­‐Lib	
  arQcle
• Beta/Dral	
  for	
  trail	
  use	
  =	
  ??	
  December	
  2012
• Comment	
  period	
  =	
  ??	
  December	
  2012	
  -­‐	
  March	
  2012
• Training	
  =	
  ??	
  May	
  -­‐	
  July	
  2013
• Approval	
  =	
  ??	
  December	
  2013
September	
  6,	
  2012                   Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   26
More	
  informaQon
Background	
  webinar	
  (March	
  6,	
  2012)	
  recording	
  

First	
  draL	
  spec:	
  hNp://www.openarchives.org/rs/0.1/
resourcesync	
  

Simulator	
  code	
  on	
  github	
  hNp://github.org/resync/simulator

NISO	
  workspace	
  hNp://www.niso.org/workrooms/
resourcesync/

List	
  for	
  public	
  comment	
  coming	
  soon

September	
  6,	
  2012        Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter   27
Standards for Data/Exchange:
            New Work areas?
Many potential areas for work in sharing of data including:
• Author/Contributor disambiguation & other issues
• Data Equivalence – How does one know that this thing and that are
  equivalent (i.e., contain same data)?
• Systemic metadata
  What is the form of this information?  
  What are its structural components?
• Archival issues
  Storage, physical level, metadata, but also migration issues
• Bibliographic information for discovery, delivery and reuse
• Bibliometrics / Assessment & impact
• Rights issues – Ownership, recognition, sharing, privacy
What are appropriate metrics?
• For datasets, what
  is a download?
• How does one use
  compare with
  another?
• Citation ecosystem
  needs to develop
• Should data papers
  count?
Data Equivalence
Basically, is an Excel file
equivalent to a text file?
Creation of a high-level
conceptual model of
data description
A “FRBR” for data
Defines the distinctions
between states &
transformations of data
Basis for identification &
description
Thank you!



                                                                                    Todd Carpenter
                                                                                  Executive Director
                                                                                tcarpenter@niso.org

        National Information Standards Organization (NISO)
        3600 Clipper Mill Road, Suite 302
        Baltimore, MD 21211 USA               NOTE	
  =>	
  NISO	
  HAS	
  MOVED!!	
  <=
        +1 (301) 654-2512
        www.niso.org


September	
  6,	
  2012               Wolfram	
  Data	
  Summit	
  -­‐	
  Carpenter                31

Contenu connexe

Tendances

"Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ..."Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ...Ahmed AlSum
 
Hadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesHadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesUri Laserson
 
SSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopSSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopWilliam Fee
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2tcloudcomputing-tw
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3tcloudcomputing-tw
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Humphrey Southall
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly ResourcesRobert Sanderson
 
Large Scale Data With Hadoop
Large Scale Data With HadoopLarge Scale Data With Hadoop
Large Scale Data With Hadoopguest27e6764
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsAaron Brooks
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introductionXuan-Chao Huang
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceeakasit_dpu
 
Hadoop tools with Examples
Hadoop tools with ExamplesHadoop tools with Examples
Hadoop tools with ExamplesJoe McTee
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-researchsaintdevil163
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014eswcsummerschool
 

Tendances (20)

NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
NISO Forum, Denver, September 24, 2012: ResourceSync: Web-Based Resource Sync...
 
"Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ..."Web Archive services framework for tighter integration between the past and ...
"Web Archive services framework for tighter integration between the past and ...
 
Hadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesHadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciences
 
SSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshopSSHELCO 2016 metadata workshop
SSHELCO 2016 metadata workshop
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
 
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...Maintaining scholarly standards in the digital age: Publishing historical gaz...
Maintaining scholarly standards in the digital age: Publishing historical gaz...
 
Annotating Scholarly Resources
Annotating Scholarly ResourcesAnnotating Scholarly Resources
Annotating Scholarly Resources
 
Large Scale Data With Hadoop
Large Scale Data With HadoopLarge Scale Data With Hadoop
Large Scale Data With Hadoop
 
Hadoop Family and Ecosystem
Hadoop Family and EcosystemHadoop Family and Ecosystem
Hadoop Family and Ecosystem
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime Analytics
 
List of Engineering Colleges in Uttarakhand
List of Engineering Colleges in UttarakhandList of Engineering Colleges in Uttarakhand
List of Engineering Colleges in Uttarakhand
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce
 
Memento 101
Memento 101Memento 101
Memento 101
 
Hadoop tools with Examples
Hadoop tools with ExamplesHadoop tools with Examples
Hadoop tools with Examples
 
Borthakur hadoop univ-research
Borthakur hadoop univ-researchBorthakur hadoop univ-research
Borthakur hadoop univ-research
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 

Similaire à Carpenter - Wolfram Data Summit ResourceSync

A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology
 
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseDeep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseJosh Patterson
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Research Data Alliance
 
EDF2012 Peter Boncz - LOD benchmarking SRbench
EDF2012   Peter Boncz - LOD benchmarking SRbenchEDF2012   Peter Boncz - LOD benchmarking SRbench
EDF2012 Peter Boncz - LOD benchmarking SRbenchEuropean Data Forum
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumRobert Sanderson
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...Daniel Bryant
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"Fabien Gandon
 

Similaire à Carpenter - Wolfram Data Summit ResourceSync (20)

ResourceSync - An Introduction
ResourceSync - An IntroductionResourceSync - An Introduction
ResourceSync - An Introduction
 
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
 
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the EnterpriseDeep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the Enterprise
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
 
EDF2012 Peter Boncz - LOD benchmarking SRbench
EDF2012   Peter Boncz - LOD benchmarking SRbenchEDF2012   Peter Boncz - LOD benchmarking SRbench
EDF2012 Peter Boncz - LOD benchmarking SRbench
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall Forum
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
 
Spark
SparkSpark
Spark
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
ApacheCon NA 2013
ApacheCon NA 2013ApacheCon NA 2013
ApacheCon NA 2013
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"
 

Dernier

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Dernier (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

Carpenter - Wolfram Data Summit ResourceSync

  • 1. ResourceSync - An Introduction Todd Carpenter Executive Director, NISO Wolfram Data Summit Thursday, September 6, 2012 With thanks to Herbert Van de Sompel and Robert Sanderson (LANL)
  • 2. @TAC_NISO Twitter Highlights • Presenting this afternoon on the ResourceSync project at Wolfram Data Summit #wolframsummit • I’m pre-tweeing my slides during #rsync presentation. Slides will be posted later today #wolframsummit • NISO mission develop & maintain technical standards related to information, documentation, discovery & distribution of content #wolframsummit • Machines don’t talk like people do.  Then again some people don’t talk like other people do, particularly teenagers #wolframsummit • So where did the ResourceSync project start?  #NISO approached OAI about updating the PMH protocol. #wolframsummit • The #NISO / OAI ResourceSync project was possible through the generous support of the Alfred P. Sloan Foundation.  Thank you! #wolframsummit • What is RSync trying to solve (1/2): Source Server has resources that change. Destination servers want to leverage some/all of Source #wolframsummit • What is RSync trying to solve (2/2): How to sync on ongoing basis in near-real-time & at web scale with as little system overhead as poss #wolframsummit • RSync studied # of existing protocols to determine protocols that best meet needs. Bias against developing new spec from scratch. #wolframsummit • The goal of ResourceSync is to find the model that most efficiently distributes the content, while limiting the tax on the source system. #wolframsummit • Very early days in process of standards development. Still in incubation stage. Consensus & adoption phases coming ‘13 & beyond #wolframsummit • Draft alpha specification of ResourceSync posted in August, Team meeting in Sept to review comments #wolframsummit
  • 3. About Non-profit industry trade association accredited by ANSI Mission of developing and maintaining technical standards related to information, documentation, discovery and distribution of published materials and media 125+ Member organizations Volunteer driven organization: 400+ spread out across the world Represent US interests to ISO in the areas of Information & Documentation
  • 4. Standards  are  familiar,  even  if  you  don’t  no4ce September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 4
  • 5. Machines don’t talk like people do September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 5
  • 6. Machines talk like this September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 6
  • 7. How  did  we  get  here? • OAI-­‐PMH  Protocol – Developed  in  2001  (v  1.1,  v  2.0  -­‐  2002) – Developed  by  Herbert  van  de  Sompel,  Carl  Lagoze,   Michael  Nelson,  and  Simeon  Warner – Fairly  wide  adopQon  in  scholarly  community • In  spring  2011,  NISO  approached  OAI  to  discuss   updaQng  PMH  Protocol • Response  was  “Let’s  try  something  else  more  in   line  with  more  modern  technology”   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 7
  • 8. A partnership is born Agreement to launch RSync as a NISO standards initiative Partnership on grant application OAI team comprised core technology team Vetting & education by NISO September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 8
  • 9. Special  thanks  are  due  to...   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 9
  • 10. ResourceSync  Working  Group Herbert Van de Sompel (Chair) Los Alamos National Laboratory Peter Murray Lyrasis Todd Carpenter (Co-Chair) National Information Standards Organization (NISO) Michael Nelson Old Dominion University Nettie Lagace David Rosenthal National Information Standards Organization (NISO) Stanford University Manuel Bernhardt David Rosenthal Delving B.V. LOCKSS Kevin Ford Christian Sadilek Library of Congress Red Hat Bernhard Haslhofer Shlomo Sanders Cornell University Ex Libris, Inc. Richard Jones Robert Sanderson Joint Information Systems Committee (JISC) Los Alamos National Laboratory Martin Klein Sjoerd Siebinga Los Alamos National Laboratory Delving B.V. Graham Klyne Ed Summers Joint Information Systems Committee (JISC) Library of Congress Carl Lagoze Simeon Warner Cornell University Cornell University Stuart Lewis Jeff Young Joint Information Systems Committee (JISC) OCLC Online Computer Library Center September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 10
  • 11. What  are  we  trying  to  do? • Synchronize  web  resources  –  things  with  a  URI   that  can  be  dereferenced  and  are  cache-­‐able   • Improve  on  web  synchroniza>on  methods • For  small  websites/repositories  (a  few   resources)  to  large  repositories/datasets/linked   data  collec>ons  (many  millions  of  resources) • That  change  slowly  or  rapidly • Focus  on  needs  of  research  and  cultural  heritage   organiza>ons,  but  aim  for  generality   September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 11
  • 12. Use  Cases September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 12
  • 13. More  Use  Cases September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 13
  • 14. Not  (yet)  Use  Cases   (i.e.:  Out  of  Scope)   • Bidirectional synchronization • Destination-defined selective synchronization (query) • Bulk URI migration September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 14
  • 15. Use  cases  differ How good is the synchronization? Perfect Good  enough How fast is the synchronization? Fast Fast  enough September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 15
  • 16. 3  disQnct  needs  regarding  resource  synchronizaQon Baseline  matching:  An  approach  to  allow  a  DesQnaQon  that  wants   to  start  synchronizing  with  a  Source  to  perform  an  iniQal  catch  up   –  Dump. Incremental  resource  synchronizaQon:  An  approach  to  allow  a   DesQnaQon  to  remain  up-­‐to-­‐date  regarding  changes  at  the   Source. Audit:  An  approach  to  allow  checking  whether  a  DesQnaQon  is  in   sync  with  a  Source    –  Inventory. =>  All  3  are  considered  in  scope  for  ResourceSync September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 16
  • 17. Incremental  Synchroniza9on   Change  NoQficaQon  (CN) Alert  that  something  happened   (create,update,delete) Content  Transfer  (CT) Transfer  of  just  the  change  or  the  full  resource September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 17
  • 18. Trivial  versus  OpQmal  Approaches • Trivial  Approach  -­‐  Retrieve  &  Compare • OpQmal  Approach  -­‐  push only the change to only the destinations monitoring the resource September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 18
  • 19. More  advanced  opQon Feed  Extension  SoluQon: ConQnue  the  Feed  paradigm,  but  introduce   aggregaQng  service  and  ping  noQficaQon  to  re-­‐pull   (simulated  push) Only  advantageous  if  Source  already  supports  a  Feed September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 19
  • 20. Change  NoQficaQon  -­‐  Protocols Atom PubSubHubbub (PuSH) XMPP PubSub extension BoSH (XMPP over HTTP) Comet / HTTP Streaming Open an HTTP connection and keep reading from it Bayeux Protocol Long Polling Keep HTTP connection open until a message, then reopen BoSH, Bayeux option WebSockets NullMQ / ZeroMQ XMPP over WebSockets? September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 20
  • 21. hjp://imgs.xkcd.com/comics/standards.png/ 8/23/11 Data  AjribuQon  and  CitaQon  Workshop 21
  • 22. RSync  Alpha September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 22
  • 23. A Framework Based on Sitemaps • Developing a Modular framework allowing selective deployment • Sitemap is the core component throughout the framework • Introducing extension elements and attributes: – In ResourceSync namespace (rs:) to accommodate synchronization needs – In XHTML namespace (xhtml:) mainly to accommodate discovery needs • Reuse Sitemap format for Change Sets (both current and historical) and for manifest in Dump September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 23
  • 24. CommunicaQons  structure September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 24
  • 25. The  lifecycle  of  standards   You  are  here September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 25
  • 26. Timeline • Project  Launch  =  November  2011 • Approved  work  item  =  December  2011 • Working  Group  formed  =  February  2012 • Webinar  on  project  =  March  2012 • JCDL  meeQng,  Washington  DC  =  June  2012 • Alpha  =  September  2012 • Team  meeQng,  Denver,  CO  =  September  2012   – forthcoming  D-­‐Lib  arQcle • Beta/Dral  for  trail  use  =  ??  December  2012 • Comment  period  =  ??  December  2012  -­‐  March  2012 • Training  =  ??  May  -­‐  July  2013 • Approval  =  ??  December  2013 September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 26
  • 27. More  informaQon Background  webinar  (March  6,  2012)  recording   First  draL  spec:  hNp://www.openarchives.org/rs/0.1/ resourcesync   Simulator  code  on  github  hNp://github.org/resync/simulator NISO  workspace  hNp://www.niso.org/workrooms/ resourcesync/ List  for  public  comment  coming  soon September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 27
  • 28. Standards for Data/Exchange: New Work areas? Many potential areas for work in sharing of data including: • Author/Contributor disambiguation & other issues • Data Equivalence – How does one know that this thing and that are equivalent (i.e., contain same data)? • Systemic metadata What is the form of this information?   What are its structural components? • Archival issues Storage, physical level, metadata, but also migration issues • Bibliographic information for discovery, delivery and reuse • Bibliometrics / Assessment & impact • Rights issues – Ownership, recognition, sharing, privacy
  • 29. What are appropriate metrics? • For datasets, what is a download? • How does one use compare with another? • Citation ecosystem needs to develop • Should data papers count?
  • 30. Data Equivalence Basically, is an Excel file equivalent to a text file? Creation of a high-level conceptual model of data description A “FRBR” for data Defines the distinctions between states & transformations of data Basis for identification & description
  • 31. Thank you! Todd Carpenter Executive Director tcarpenter@niso.org National Information Standards Organization (NISO) 3600 Clipper Mill Road, Suite 302 Baltimore, MD 21211 USA NOTE  =>  NISO  HAS  MOVED!!  <= +1 (301) 654-2512 www.niso.org September  6,  2012 Wolfram  Data  Summit  -­‐  Carpenter 31