SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
Metadata and its Relationship
    to Journal Workflow

 SSP Seminar in Philadelphia, PA
David Yakimischak, Chief Technology Officer
 November 18, 2004         www.jstor.org
Agenda

•   What is JSTOR?
•   What is Metadata?
•   What is Journal Workflow?
•   What is the relationship?
•   Discussion
JSTOR Mission

• JSTOR is a not-for-profit organization with a
  mission to help the scholarly community take
  advantage of the advances in information
  technology. This includes: (1) building a
  reliable and comprehensive archive of core
  scholarly journals, and (2) dramatically
  improve access to this scholarly material
• In pursuing its mission, JSTOR takes a
  system-wide perspective, seeking benefits
  for libraries, publishers and scholars
JSTOR Today

• 2,160 participating libraries
• 269 participating publishers
• 449 journals online
• 16,379,559 pages scanned (and
  counting!)
• Formed an Electronic Archiving
  initiative in 2003
Jan
      -97




                  5,000,000
                              10,000,000
                                           15,000,000
                                                        20,000,000
                                                                     25,000,000




              0
Ap
   r-9
        7
  Ju
      l-9
         7
Oc
    t-9
        7
 Jan
     -98
Ap
   r-9
       8
  Jul
       -98
Oc
    t-98
 Jan
      -99
Ap
   r-9
        9
  Ju
      l-9
          9
Oc
     t-9
         9
 Jan
      -00
Ap
   r-0
       0
  Jul
       -00
Oc
    t-00
 Jan
      -01
Ap
   r-0
        1
  Ju
      l-0
          1
Oc
     t-0
         1
                                                                                  Meaningful Accesses per Month




 Jan
      -02
Ap
   r-0
       2
  Jul
      -02
Oc
   t-02
 Jan
     -03
Ap
   r-0
       3
  Ju
      l-0
         3
Oc
    t-0
        3
 Jan
     -04
Ap
   r-0
       4
 Jul
     -04
Oc
  t-04
                                                                                                                  JSTOR Monthly Usage
What is Metadata?

• Data about data
• Internally we outlaw the unqualified
  use of the word metadata
• Three broad categories
  • Descriptive Metadata
  • Technical Metadata
  • Administrative Metadata
Descriptive Metadata

•   Describes the content or underlying asset
•   Bibliographic information
•   Title, author, journal, date, page, etc.
•   Used to create citations
•   Use to categorize or organize information
•   Creates the browse interface to content
Technical Metadata

• Describes the structure or format of an
  item
• Allows for verification or validation
• Typically used for internal operations
• A registry can be used to catalog these
• Commonly needs to be migrated,
  although this applies to all metadata
• Most suitable for automated processing
Administrative Metadata

• Generally deals with how an item has
  been handled
• Most relevant to the archival records
• Tracks history of change or access
• Closely related to workflow
• Can be gathered automatically and
  unobtrusively
What is Journal Workflow?

• The process of creating the finished
  product
• Can be formal or ad-hoc
• Automated or manual
• Stress-relieving or stress-inducing
What is the relationship?

• Metadata can be managed during the journal
  workflow process
   • Added, ingested, modified, calculated, stored, versioned
• Metadata is used to drive the workflow process
  itself
• The intended use must be considered carefully
   •   Why are we gathering this?
   •   Who is the audience? When? Why?
   •   What about statistics, for example?
   •   Connections into other business process systems
JSTOR Observations

• Metadata is created for every word, illustration,
  page, article, issue, title
• Authorizable units are described with metadata
• Users are described with metadata
• We have established internal guidelines and
  standards
   • Relationship to, but not the same as external exposure
• Quality assurance and quality control are involved in
  oversight
• Formalize workflows early when it is easier
• Get it right up front, it is expensive to go back
Discussion

Contenu connexe

En vedette (7)

2 c.2
2 c.22 c.2
2 c.2
 
280 eileen fenton presentation
280 eileen fenton presentation280 eileen fenton presentation
280 eileen fenton presentation
 
76 linda drumheller
76 linda drumheller76 linda drumheller
76 linda drumheller
 
141 web based-peer-review
141 web based-peer-review141 web based-peer-review
141 web based-peer-review
 
255 shaw
255 shaw255 shaw
255 shaw
 
Seminar4.4
Seminar4.4Seminar4.4
Seminar4.4
 
302 sargent word2007-ssp2008
302 sargent word2007-ssp2008302 sargent word2007-ssp2008
302 sargent word2007-ssp2008
 

Similaire à 50 david yakimischak

Role of-analytics-in-db as-life
Role of-analytics-in-db as-lifeRole of-analytics-in-db as-life
Role of-analytics-in-db as-lifeNavneet Upneja
 
What’s our stack: process, technology, community and ideas for the future of RTI
What’s our stack: process, technology, community and ideas for the future of RTIWhat’s our stack: process, technology, community and ideas for the future of RTI
What’s our stack: process, technology, community and ideas for the future of RTIDavid Eaves
 
Results of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsResults of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsNASIG
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTechWell
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTechWell
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics Sean Forgatch
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayMongoDB
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowskCura_Relativity
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014ALTER WAY
 
Clare Somerville Trish O’Kane Data in Databases
Clare Somerville Trish O’Kane Data in DatabasesClare Somerville Trish O’Kane Data in Databases
Clare Somerville Trish O’Kane Data in DatabasesFuture Perfect 2012
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014ALTER WAY
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transferIyad Abou Rabii
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsQuontra Solutions
 
Managing productions across Supply Chain
Managing productions across Supply ChainManaging productions across Supply Chain
Managing productions across Supply ChainSushovan Bej
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18guest61f1b7d
 

Similaire à 50 david yakimischak (20)

Role of-analytics-in-db as-life
Role of-analytics-in-db as-lifeRole of-analytics-in-db as-life
Role of-analytics-in-db as-life
 
What’s our stack: process, technology, community and ideas for the future of RTI
What’s our stack: process, technology, community and ideas for the future of RTIWhat’s our stack: process, technology, community and ideas for the future of RTI
What’s our stack: process, technology, community and ideas for the future of RTI
 
Results of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisionsResults of Web-scale discovery: Data, discussions and decisions
Results of Web-scale discovery: Data, discussions and decisions
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big Problems
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big Problems
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO Way
 
I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
 
Assessing the quality of open access journals suzhou presentation
Assessing the quality of open access journals suzhou presentationAssessing the quality of open access journals suzhou presentation
Assessing the quality of open access journals suzhou presentation
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Using Data from Google Analytics
Using Data from Google AnalyticsUsing Data from Google Analytics
Using Data from Google Analytics
 
Clare Somerville Trish O’Kane Data in Databases
Clare Somerville Trish O’Kane Data in DatabasesClare Somerville Trish O’Kane Data in Databases
Clare Somerville Trish O’Kane Data in Databases
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transfer
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra Solutions
 
Managing productions across Supply Chain
Managing productions across Supply ChainManaging productions across Supply Chain
Managing productions across Supply Chain
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18
 
New Frontiers of Lean Practice
New Frontiers of Lean PracticeNew Frontiers of Lean Practice
New Frontiers of Lean Practice
 
NISO Webinar: What to Expect When You're Expecting a Platform Change: Perspec...
NISO Webinar: What to Expect When You're Expecting a Platform Change: Perspec...NISO Webinar: What to Expect When You're Expecting a Platform Change: Perspec...
NISO Webinar: What to Expect When You're Expecting a Platform Change: Perspec...
 

Plus de Society for Scholarly Publishing

04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadowsSociety for Scholarly Publishing
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterickSociety for Scholarly Publishing
 

Plus de Society for Scholarly Publishing (20)

10052016 ssp seminar2_newsham
10052016 ssp seminar2_newsham10052016 ssp seminar2_newsham
10052016 ssp seminar2_newsham
 
10052016 ssp seminar2_rivera
10052016 ssp seminar2_rivera10052016 ssp seminar2_rivera
10052016 ssp seminar2_rivera
 
10052016 ssp seminar2_pesanelli
10052016 ssp seminar2_pesanelli10052016 ssp seminar2_pesanelli
10052016 ssp seminar2_pesanelli
 
10052016 ssp seminar2_harley
10052016 ssp seminar2_harley10052016 ssp seminar2_harley
10052016 ssp seminar2_harley
 
10042016 ssp seminar1_session4_myers
10042016 ssp seminar1_session4_myers10042016 ssp seminar1_session4_myers
10042016 ssp seminar1_session4_myers
 
10042016 ssp seminar1_session4_demers
10042016 ssp seminar1_session4_demers10042016 ssp seminar1_session4_demers
10042016 ssp seminar1_session4_demers
 
10042016 ssp seminar1_session4_cochran
10042016 ssp seminar1_session4_cochran10042016 ssp seminar1_session4_cochran
10042016 ssp seminar1_session4_cochran
 
10042016 ssp seminar1_session3_stanley
10042016 ssp seminar1_session3_stanley10042016 ssp seminar1_session3_stanley
10042016 ssp seminar1_session3_stanley
 
10042016 ssp seminar1_session3_ranganathan
10042016 ssp seminar1_session3_ranganathan10042016 ssp seminar1_session3_ranganathan
10042016 ssp seminar1_session3_ranganathan
 
10042016 ssp seminar1_session3_odike
10042016 ssp seminar1_session3_odike10042016 ssp seminar1_session3_odike
10042016 ssp seminar1_session3_odike
 
10042016 ssp seminar1_session3_cochran
10042016 ssp seminar1_session3_cochran10042016 ssp seminar1_session3_cochran
10042016 ssp seminar1_session3_cochran
 
10042016 ssp seminar1_session2_walker
10042016 ssp seminar1_session2_walker10042016 ssp seminar1_session2_walker
10042016 ssp seminar1_session2_walker
 
10042016 ssp seminar1_session2_ivins
10042016 ssp seminar1_session2_ivins10042016 ssp seminar1_session2_ivins
10042016 ssp seminar1_session2_ivins
 
10042016 ssp seminar1_session2_holland
10042016 ssp seminar1_session2_holland10042016 ssp seminar1_session2_holland
10042016 ssp seminar1_session2_holland
 
10042016 ssp seminar1_session1_stanley
10042016 ssp seminar1_session1_stanley10042016 ssp seminar1_session1_stanley
10042016 ssp seminar1_session1_stanley
 
10042016 ssp seminar1_session1_keane
10042016 ssp seminar1_session1_keane10042016 ssp seminar1_session1_keane
10042016 ssp seminar1_session1_keane
 
10042016 ssp seminar1_session1_ivins
10042016 ssp seminar1_session1_ivins10042016 ssp seminar1_session1_ivins
10042016 ssp seminar1_session1_ivins
 
10042016 ssp seminar1_session1_asadilari
10042016 ssp seminar1_session1_asadilari10042016 ssp seminar1_session1_asadilari
10042016 ssp seminar1_session1_asadilari
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
 

50 david yakimischak

  • 1. Metadata and its Relationship to Journal Workflow SSP Seminar in Philadelphia, PA David Yakimischak, Chief Technology Officer November 18, 2004 www.jstor.org
  • 2. Agenda • What is JSTOR? • What is Metadata? • What is Journal Workflow? • What is the relationship? • Discussion
  • 3. JSTOR Mission • JSTOR is a not-for-profit organization with a mission to help the scholarly community take advantage of the advances in information technology. This includes: (1) building a reliable and comprehensive archive of core scholarly journals, and (2) dramatically improve access to this scholarly material • In pursuing its mission, JSTOR takes a system-wide perspective, seeking benefits for libraries, publishers and scholars
  • 4. JSTOR Today • 2,160 participating libraries • 269 participating publishers • 449 journals online • 16,379,559 pages scanned (and counting!) • Formed an Electronic Archiving initiative in 2003
  • 5. Jan -97 5,000,000 10,000,000 15,000,000 20,000,000 25,000,000 0 Ap r-9 7 Ju l-9 7 Oc t-9 7 Jan -98 Ap r-9 8 Jul -98 Oc t-98 Jan -99 Ap r-9 9 Ju l-9 9 Oc t-9 9 Jan -00 Ap r-0 0 Jul -00 Oc t-00 Jan -01 Ap r-0 1 Ju l-0 1 Oc t-0 1 Meaningful Accesses per Month Jan -02 Ap r-0 2 Jul -02 Oc t-02 Jan -03 Ap r-0 3 Ju l-0 3 Oc t-0 3 Jan -04 Ap r-0 4 Jul -04 Oc t-04 JSTOR Monthly Usage
  • 6. What is Metadata? • Data about data • Internally we outlaw the unqualified use of the word metadata • Three broad categories • Descriptive Metadata • Technical Metadata • Administrative Metadata
  • 7. Descriptive Metadata • Describes the content or underlying asset • Bibliographic information • Title, author, journal, date, page, etc. • Used to create citations • Use to categorize or organize information • Creates the browse interface to content
  • 8. Technical Metadata • Describes the structure or format of an item • Allows for verification or validation • Typically used for internal operations • A registry can be used to catalog these • Commonly needs to be migrated, although this applies to all metadata • Most suitable for automated processing
  • 9. Administrative Metadata • Generally deals with how an item has been handled • Most relevant to the archival records • Tracks history of change or access • Closely related to workflow • Can be gathered automatically and unobtrusively
  • 10. What is Journal Workflow? • The process of creating the finished product • Can be formal or ad-hoc • Automated or manual • Stress-relieving or stress-inducing
  • 11. What is the relationship? • Metadata can be managed during the journal workflow process • Added, ingested, modified, calculated, stored, versioned • Metadata is used to drive the workflow process itself • The intended use must be considered carefully • Why are we gathering this? • Who is the audience? When? Why? • What about statistics, for example? • Connections into other business process systems
  • 12. JSTOR Observations • Metadata is created for every word, illustration, page, article, issue, title • Authorizable units are described with metadata • Users are described with metadata • We have established internal guidelines and standards • Relationship to, but not the same as external exposure • Quality assurance and quality control are involved in oversight • Formalize workflows early when it is easier • Get it right up front, it is expensive to go back