SlideShare une entreprise Scribd logo
1  sur  38
Télécharger pour lire hors ligne
Jabin White
                    Director of Strategic Content
Wolters Kluwer Health – Professional & Education

                        SSP 31st Annual Meeting
                   Baltimore, MD | May 28, 2009
 What  is metadata, and why should
  publishers care?
 Impact on publishers – how metadata
  impacts processes
 Case Studies – This isn’t your Daddy’s
  publishing business
 Final Thoughts, Recommendations
 Reading   most definitions of metadata and
  related standards is like trying to resolve
  disputes with my kids
 As Ed said, metadata is “data about data”
  • But what does that mean?
 Its
    use may be increasing, but metadata is
  NOT new
 In the move from print publishing to digital,
  metadata is a powerful tool to help publishers
  get content in the right place, in the right
  format, and known to the right systems and
  people, at the right time
 Print books were easy
  • Everyone knew what they were
  • You could really only use them one way
  • They had a beginning, an end, a physical presence,
       and a set price (mostly)
 Today, computers are often communicating
  with one another as much as they are with
  users (people)
 Metadata becomes critical in:
  • B2B relationships
  • Enhancing B2C relationships
  • B2-_________ relationships
 Thequality of the metadata gives
 publishers a more powerful voice in what
 happens to their content
 For example:
   • A digital asset (an image)
   • What file format is it?
   • How big is the image?
   • Who took the picture?
   • Who owns the picture?
   • Can you use it on your web site? If you do, what credit
     do you have to give to the owner?
   • What date was it created?
   • Is it part of a collection?
   • Is it related to another piece of content?
 Ifa publisher’s goal is to disseminate
  content to the widest possible audience,
  metadata is critical
   Again, in books you had one use model
   Metadata allows publishers to have diverse
    relationships with content consumers and other
    information providers
    • Customers (duh)
    • Aggregators
    • The Open Web (not Google, but other search engines)
       But don’t try to “game” the search engines with adult keywords;
        that’s just wrong
       There have been lawsuits over use of meta keywords, including
        Playboy suing two adult web sites
    • Technology partners/developers
    • Systems wherein content is a “value add”
    • Multiple output formats
   HTML Metadata
    • <meta http-equiv="Content-Type" content="text/html; charset=iso-
        8859-1">
    •   <meta name="verify-v1"
        content="kBoFGUuwppiWVWGx4Ypzkw1Cs1GgMYEMMbfNr7F
        Y65w=" />                      For people
    •   <meta name="description" content="International publisher of
        professional health information for physicians, nurses, specialized
        clinicians & students. Medical & nursing charts, journals, and pda
        software.">                      For search enginges
    •   <meta name="keywords" content="springhouse, medical book,
        nursing journal, medical pda software, lippincott medical
        reference, lww, lippincott, lww com, medical publisher">
    •   <link rel="stylesheet" href="/css/style.css" type="text/css">
 Classifying    Metadata     DescriptiveMetadata
  • ISBN (I told you this     (sorry, my examples
      wasn’t new)             are from STM)
  •   Dewey Decimal            • ICD-9 and ICD-10
      System                       Codes
  •   Books in                 •   MeSH
      Print/CIP/Library of     •   SNOMED-CT
      Congress data
                               •   NANDA, NIC, NOC for
  •   MARC records                 Nursing
  •   DOI (Digital Object      •   NDC, HCPCS for drugs
      Identifier)
 Classifying    Metadata     DescriptiveMetadata
  • ISBN (I told you this     (sorry, my examples
      wasn’t new)             are from STM)
  •   Dewey Decimal            • ICD-9 and ICD-10
      System                     Codes
  •   Books in                 • MeSH
      Print/CIP/Library of     • SNOMED-CT
      Congress data
                               • NANDA, NIC, NOC for
  •   MARC records               Nursing
  •   DOI (Digital Object      • NDC, HCPCS for drugs
      Identifier)
                               • DOI (Digital Object
                                 Identifier)
 Usingcontrolled vocabularies, extra power
 can be added to content via semantic
 tagging to drive:
  • More precise searching
  • Contextually-based connections
  • Lowering of “two terms meaning the same thing”
    syndrome (hypertension vs. high blood pressure;
    heart attack vs. myocardial infarction)
  • Filling in of content gaps
How Metadata Changes
           Processes
 Impacton publishers depends on answers
 to questions in previous section
  • i.e., what am I going to get in return for investing
    in metadata, and is it worth it?
  • More and more, this is not an “if” proposition, it’s
    “how much”
 Publisherswho buy in have two basic
 choices on approach:
   Requires deeper commitment, but has bigger
    potential upside
    • Positive impact on product creation and development
 Requires thinking about tools, workflows, and
  enterprise-level systems to allow for creation and
  maintenance of metadata
 Combination of good metadata in the workflow and
  creativity in product development team can pay big
  benefits
 Allows participation of authors (or subject matter
  experts in lieu of) at the beginning of the workflow
 Requires   lesser commitment, but potentially
  fewer rewards
 Can be done with zero impact on current
  systems
 Has benefit of content being in “final form”
  (whatever that means anymore) when
  intelligence is added in metadata
 Can keep SMEs as a separate offshoot of the
  workflow – easily outsourced
 Can replace all of the above with software
  solutions (Darrell and Chris will talk about
  that) 
 Chris,  Darrell and I do NOT disagree
 There are justifications that can be made
  for tagging or entity extraction approaches
  (or both)
 Just as there is no “one size fits all”
  metadata, there is no ONE solution
 But if you must pick one, I’m right 
 Active   vs. Passive Metadata
  • Active metadata
     Publisher intentionally associates markup with certain
      pieces of content
     Often using controlled vocabulary
     Includes semantic indexing
     Can also be machine-based, using scripts, etc.
  • Passive metadata
     Metadata created based on use of content
     Inheritance of properties from parent objects
   The use of active metadata usually means an
    impact on support tools
    • Re-think authoring tools to allow for capture of metadata by
      authors
        This can be outsourced to external SMEs – help is available
    • Re-think content management to allow for
      preservation/management of metadata
   How deep you go depends on how big the payoff
    • Good semantic indexing can drive new features and
      functionality, but must used appropriately
   If you decide to add active metadata, a controlled
    vocabulary just became your new best friend
– a specific specification of a
 Ontology
 conceptualization
  • In English: a controlled vocabulary used to
   describe a group of topics
 Taxonomy     – same as ontology, but with
  hierarchy implied
 Caveat – These two terms are so misused,
  their definitions no longer matter (think
  Content Management circa 2000)
 PRISM (Publishing Requirements for Industry
  Standard Metadata) – an XML metadata vocabulary for
  handling content – started out in magazines and
  journals, but has added other types
 Dublin Core – named after a 1995 workshop in Dublin,
  Ohio, it is, very simply, a set of 15 agreed-upon
  metadata elements used to describe objects
    • PRISM uses Dublin Core elements and then makes them specific
      to publishing
   RDF (Resource Description Format): an XML
    implementation that lets you richly describe
    relationships between data on web pages. Explain
    triplets
   Semantic Web – A web of data. Envisioned by Tim
    Berners-Lee, it will be a web driven by data that “talks”
    to other data
    • My kids will work on this
 FOAF Project (Friend of a Friend): Uses RDF to
  describe people and their preferences to the web, so
  you can find people with similar interests; all about
  social networking
 SPARQL (Simple Protocol and RDF Query Language)
  – once you have used RDF to describe resources and
  their connection points, you use SPARQL to ask
  questions about those connections and find stuff
 OWL (Web Ontology Language) – extends ability of
  RDF and XML Schemas to describe information
 Drug Reference Product
 Perfect, structured information that is a great
  example of metadata becoming just as
  important as content
 Examples of things that were stored in
  metadata:
  •   Codes, codes, and more codes
  •   Drug interaction information
  •   Classifications (this one was actually redundant)
  •   Formulary information
  •   FDA approval date (could also be redundant)
 Four editors spent as much time working
  on metadata as they did on content itself
 All work on import/export from DB was
  done by:
  • Acting on metadata
  • Keeping metadata at top of priority list on output
  • “Output all drugs anticoagulants that were
   approved before 1982”
 Medical  content (5 years ago I would have
  said “book”)
 Thousands of topics, sometimes printed,
  always updated, sent to web, handhelds
 How/when they are updated, whether or
  not they are printed, and whether or not
  they get extracted is all driven by ….

 Metadata!
 Extracts
        all are made by acting on
 metadata
  • What is the subject area of the topic? (this can be
    a MANY to ONE relationship)
  • When was the topic last updated?
  • Who was the author of the last update?
ID Values assigned during XML conversion
Gender values assigned by authors
 Have a metadata strategy
  • Business case should support investment in metadata
  • Be careful, and stay alert for mission creep – this stuff
    can get out of control very easily
 Know your organization
  • Is it a change tolerant organization? “All in” vs.
     measured, incremental approach should be
     considered
   • Show me someone who says they have the correct
     universal approach to metadata, and I’ll show you a
     liar
A   little bit of metadata understanding by
  product development people can go a long
  way
 If a content set can benefit from metadata
  in the creation of new products, that could
  justify investment in metadata strategy and
  tools within the workflow
Jabin White
Jabin.white@wolterskluwer.com
1.   Contributor   9.    Publisher
2.   Coverage      10.   Relation
3.   Creator       11.   Rights
4.   Date          12.   Source
5.   Description   13.   Subject
6.   Format        14.   Title
7.   Identifier    15.   Type
8.   Language

                                     Return

Contenu connexe

Tendances

Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
 
Choosing which big data, nosql or database technology to use
Choosing which big data, nosql or database technology to useChoosing which big data, nosql or database technology to use
Choosing which big data, nosql or database technology to usemark madsen
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
Data preparation and processing chapter 2
Data preparation and processing chapter  2Data preparation and processing chapter  2
Data preparation and processing chapter 2Mahmoud Alfarra
 
Introduction-to-Knowledge Discovery in Database
Introduction-to-Knowledge Discovery in DatabaseIntroduction-to-Knowledge Discovery in Database
Introduction-to-Knowledge Discovery in DatabaseKartik Kalpande Patil
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cyclehktripathy
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesKaran Deep Singh
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalitiesRajendran
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)Kartik Kalpande Patil
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Gihan Wikramanayake
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data MiningSi Krishan
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 
Some Questions About Your Data
Some Questions About Your DataSome Questions About Your Data
Some Questions About Your DataDamian T. Gordon
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousingsumit621
 
Datamining
DataminingDatamining
Dataminingsumit621
 

Tendances (19)

Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 
Choosing which big data, nosql or database technology to use
Choosing which big data, nosql or database technology to useChoosing which big data, nosql or database technology to use
Choosing which big data, nosql or database technology to use
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
Data preparation and processing chapter 2
Data preparation and processing chapter  2Data preparation and processing chapter  2
Data preparation and processing chapter 2
 
Introduction-to-Knowledge Discovery in Database
Introduction-to-Knowledge Discovery in DatabaseIntroduction-to-Knowledge Discovery in Database
Introduction-to-Knowledge Discovery in Database
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cycle
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and Issues
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalities
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 
Some Questions About Your Data
Some Questions About Your DataSome Questions About Your Data
Some Questions About Your Data
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousing
 
Datamining
DataminingDatamining
Datamining
 

En vedette (9)

166 sspcc1 b_newman
166 sspcc1 b_newman166 sspcc1 b_newman
166 sspcc1 b_newman
 
113 sem 3_c-koscher
113 sem 3_c-koscher113 sem 3_c-koscher
113 sem 3_c-koscher
 
367 peter binfield
367 peter binfield367 peter binfield
367 peter binfield
 
41 cc 4_c_j-plotkin
41 cc 4_c_j-plotkin41 cc 4_c_j-plotkin
41 cc 4_c_j-plotkin
 
213 roni zeiger society for scholarly publishing zeiger
213 roni zeiger society for scholarly publishing   zeiger213 roni zeiger society for scholarly publishing   zeiger
213 roni zeiger society for scholarly publishing zeiger
 
77 mandelbaum
77 mandelbaum77 mandelbaum
77 mandelbaum
 
2 c.1
2 c.12 c.1
2 c.1
 
338 seminar4 keithlawrenz
338 seminar4 keithlawrenz338 seminar4 keithlawrenz
338 seminar4 keithlawrenz
 
376 sspin2011 bradleyallen
376 sspin2011 bradleyallen376 sspin2011 bradleyallen
376 sspin2011 bradleyallen
 

Similaire à 1 d.1

Mining Institutional Knowledge: Using Text and Data Mining to Enhance Discovery
Mining Institutional Knowledge: Using Text and Data Mining to Enhance DiscoveryMining Institutional Knowledge: Using Text and Data Mining to Enhance Discovery
Mining Institutional Knowledge: Using Text and Data Mining to Enhance DiscoveryMary Ellen Bates
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityPrecisely
 
Data-Ed: Trends in Data Modeling
Data-Ed: Trends in Data ModelingData-Ed: Trends in Data Modeling
Data-Ed: Trends in Data ModelingData Blueprint
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingDATAVERSITY
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareHealth Catalyst
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareDale Sanders
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW
 
Making Meaning with Metadata
Making Meaning with MetadataMaking Meaning with Metadata
Making Meaning with MetadataJohn Horodyski
 
ASA conference Feb 2013
ASA conference Feb 2013ASA conference Feb 2013
ASA conference Feb 2013mrkwr
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...Concept Searching, Inc
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker AIIM International
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolkfear
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciencesChris Dwan
 
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey SwartStreamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey SwartHewlett Packard Enterprise Services
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 

Similaire à 1 d.1 (20)

Mining Institutional Knowledge: Using Text and Data Mining to Enhance Discovery
Mining Institutional Knowledge: Using Text and Data Mining to Enhance DiscoveryMining Institutional Knowledge: Using Text and Data Mining to Enhance Discovery
Mining Institutional Knowledge: Using Text and Data Mining to Enhance Discovery
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data Quality
 
Data-Ed: Trends in Data Modeling
Data-Ed: Trends in Data ModelingData-Ed: Trends in Data Modeling
Data-Ed: Trends in Data Modeling
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of Healthcare
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
The Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of HealthcareThe Data Operating System: Changing the Digital Trajectory of Healthcare
The Data Operating System: Changing the Digital Trajectory of Healthcare
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise It
 
Making Meaning with Metadata
Making Meaning with MetadataMaking Meaning with Metadata
Making Meaning with Metadata
 
ASA conference Feb 2013
ASA conference Feb 2013ASA conference Feb 2013
ASA conference Feb 2013
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
Big data analytics - hadoop
Big data analytics - hadoopBig data analytics - hadoop
Big data analytics - hadoop
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciences
 
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey SwartStreamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
 
Elsevier
ElsevierElsevier
Elsevier
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Tec2010 Buckley Share
Tec2010 Buckley ShareTec2010 Buckley Share
Tec2010 Buckley Share
 
Intro big data analytics
Intro big data analyticsIntro big data analytics
Intro big data analytics
 

Plus de Society for Scholarly Publishing

04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadowsSociety for Scholarly Publishing
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterickSociety for Scholarly Publishing
 

Plus de Society for Scholarly Publishing (20)

10052016 ssp seminar2_newsham
10052016 ssp seminar2_newsham10052016 ssp seminar2_newsham
10052016 ssp seminar2_newsham
 
10052016 ssp seminar2_rivera
10052016 ssp seminar2_rivera10052016 ssp seminar2_rivera
10052016 ssp seminar2_rivera
 
10052016 ssp seminar2_pesanelli
10052016 ssp seminar2_pesanelli10052016 ssp seminar2_pesanelli
10052016 ssp seminar2_pesanelli
 
10052016 ssp seminar2_harley
10052016 ssp seminar2_harley10052016 ssp seminar2_harley
10052016 ssp seminar2_harley
 
10042016 ssp seminar1_session4_myers
10042016 ssp seminar1_session4_myers10042016 ssp seminar1_session4_myers
10042016 ssp seminar1_session4_myers
 
10042016 ssp seminar1_session4_demers
10042016 ssp seminar1_session4_demers10042016 ssp seminar1_session4_demers
10042016 ssp seminar1_session4_demers
 
10042016 ssp seminar1_session4_cochran
10042016 ssp seminar1_session4_cochran10042016 ssp seminar1_session4_cochran
10042016 ssp seminar1_session4_cochran
 
10042016 ssp seminar1_session3_stanley
10042016 ssp seminar1_session3_stanley10042016 ssp seminar1_session3_stanley
10042016 ssp seminar1_session3_stanley
 
10042016 ssp seminar1_session3_ranganathan
10042016 ssp seminar1_session3_ranganathan10042016 ssp seminar1_session3_ranganathan
10042016 ssp seminar1_session3_ranganathan
 
10042016 ssp seminar1_session3_odike
10042016 ssp seminar1_session3_odike10042016 ssp seminar1_session3_odike
10042016 ssp seminar1_session3_odike
 
10042016 ssp seminar1_session3_cochran
10042016 ssp seminar1_session3_cochran10042016 ssp seminar1_session3_cochran
10042016 ssp seminar1_session3_cochran
 
10042016 ssp seminar1_session2_walker
10042016 ssp seminar1_session2_walker10042016 ssp seminar1_session2_walker
10042016 ssp seminar1_session2_walker
 
10042016 ssp seminar1_session2_ivins
10042016 ssp seminar1_session2_ivins10042016 ssp seminar1_session2_ivins
10042016 ssp seminar1_session2_ivins
 
10042016 ssp seminar1_session2_holland
10042016 ssp seminar1_session2_holland10042016 ssp seminar1_session2_holland
10042016 ssp seminar1_session2_holland
 
10042016 ssp seminar1_session1_stanley
10042016 ssp seminar1_session1_stanley10042016 ssp seminar1_session1_stanley
10042016 ssp seminar1_session1_stanley
 
10042016 ssp seminar1_session1_keane
10042016 ssp seminar1_session1_keane10042016 ssp seminar1_session1_keane
10042016 ssp seminar1_session1_keane
 
10042016 ssp seminar1_session1_ivins
10042016 ssp seminar1_session1_ivins10042016 ssp seminar1_session1_ivins
10042016 ssp seminar1_session1_ivins
 
10042016 ssp seminar1_session1_asadilari
10042016 ssp seminar1_session1_asadilari10042016 ssp seminar1_session1_asadilari
10042016 ssp seminar1_session1_asadilari
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
04142015 ssp webinar_theworldisflatforscholarlypublishing_caitlinmeadows
 
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
04142015 ssp webinar_theworldisflatforscholarlypublishing_bruceheterick
 

1 d.1

  • 1. Jabin White Director of Strategic Content Wolters Kluwer Health – Professional & Education SSP 31st Annual Meeting Baltimore, MD | May 28, 2009
  • 2.  What is metadata, and why should publishers care?  Impact on publishers – how metadata impacts processes  Case Studies – This isn’t your Daddy’s publishing business  Final Thoughts, Recommendations
  • 3.  Reading most definitions of metadata and related standards is like trying to resolve disputes with my kids  As Ed said, metadata is “data about data” • But what does that mean?  Its use may be increasing, but metadata is NOT new
  • 4.  In the move from print publishing to digital, metadata is a powerful tool to help publishers get content in the right place, in the right format, and known to the right systems and people, at the right time  Print books were easy • Everyone knew what they were • You could really only use them one way • They had a beginning, an end, a physical presence, and a set price (mostly)
  • 5.  Today, computers are often communicating with one another as much as they are with users (people)  Metadata becomes critical in: • B2B relationships • Enhancing B2C relationships • B2-_________ relationships  Thequality of the metadata gives publishers a more powerful voice in what happens to their content
  • 6.  For example: • A digital asset (an image) • What file format is it? • How big is the image? • Who took the picture? • Who owns the picture? • Can you use it on your web site? If you do, what credit do you have to give to the owner? • What date was it created? • Is it part of a collection? • Is it related to another piece of content?
  • 7.  Ifa publisher’s goal is to disseminate content to the widest possible audience, metadata is critical
  • 8. Again, in books you had one use model  Metadata allows publishers to have diverse relationships with content consumers and other information providers • Customers (duh) • Aggregators • The Open Web (not Google, but other search engines)  But don’t try to “game” the search engines with adult keywords; that’s just wrong  There have been lawsuits over use of meta keywords, including Playboy suing two adult web sites • Technology partners/developers • Systems wherein content is a “value add” • Multiple output formats
  • 9. HTML Metadata • <meta http-equiv="Content-Type" content="text/html; charset=iso- 8859-1"> • <meta name="verify-v1" content="kBoFGUuwppiWVWGx4Ypzkw1Cs1GgMYEMMbfNr7F Y65w=" /> For people • <meta name="description" content="International publisher of professional health information for physicians, nurses, specialized clinicians & students. Medical & nursing charts, journals, and pda software."> For search enginges • <meta name="keywords" content="springhouse, medical book, nursing journal, medical pda software, lippincott medical reference, lww, lippincott, lww com, medical publisher"> • <link rel="stylesheet" href="/css/style.css" type="text/css">
  • 10.  Classifying Metadata  DescriptiveMetadata • ISBN (I told you this (sorry, my examples wasn’t new) are from STM) • Dewey Decimal • ICD-9 and ICD-10 System Codes • Books in • MeSH Print/CIP/Library of • SNOMED-CT Congress data • NANDA, NIC, NOC for • MARC records Nursing • DOI (Digital Object • NDC, HCPCS for drugs Identifier)
  • 11.  Classifying Metadata  DescriptiveMetadata • ISBN (I told you this (sorry, my examples wasn’t new) are from STM) • Dewey Decimal • ICD-9 and ICD-10 System Codes • Books in • MeSH Print/CIP/Library of • SNOMED-CT Congress data • NANDA, NIC, NOC for • MARC records Nursing • DOI (Digital Object • NDC, HCPCS for drugs Identifier) • DOI (Digital Object Identifier)
  • 12.  Usingcontrolled vocabularies, extra power can be added to content via semantic tagging to drive: • More precise searching • Contextually-based connections • Lowering of “two terms meaning the same thing” syndrome (hypertension vs. high blood pressure; heart attack vs. myocardial infarction) • Filling in of content gaps
  • 13. How Metadata Changes Processes
  • 14.  Impacton publishers depends on answers to questions in previous section • i.e., what am I going to get in return for investing in metadata, and is it worth it? • More and more, this is not an “if” proposition, it’s “how much”  Publisherswho buy in have two basic choices on approach:
  • 15. Requires deeper commitment, but has bigger potential upside • Positive impact on product creation and development  Requires thinking about tools, workflows, and enterprise-level systems to allow for creation and maintenance of metadata  Combination of good metadata in the workflow and creativity in product development team can pay big benefits  Allows participation of authors (or subject matter experts in lieu of) at the beginning of the workflow
  • 16.  Requires lesser commitment, but potentially fewer rewards  Can be done with zero impact on current systems  Has benefit of content being in “final form” (whatever that means anymore) when intelligence is added in metadata  Can keep SMEs as a separate offshoot of the workflow – easily outsourced  Can replace all of the above with software solutions (Darrell and Chris will talk about that) 
  • 17.  Chris, Darrell and I do NOT disagree  There are justifications that can be made for tagging or entity extraction approaches (or both)  Just as there is no “one size fits all” metadata, there is no ONE solution  But if you must pick one, I’m right 
  • 18.  Active vs. Passive Metadata • Active metadata  Publisher intentionally associates markup with certain pieces of content  Often using controlled vocabulary  Includes semantic indexing  Can also be machine-based, using scripts, etc. • Passive metadata  Metadata created based on use of content  Inheritance of properties from parent objects
  • 19. The use of active metadata usually means an impact on support tools • Re-think authoring tools to allow for capture of metadata by authors  This can be outsourced to external SMEs – help is available • Re-think content management to allow for preservation/management of metadata  How deep you go depends on how big the payoff • Good semantic indexing can drive new features and functionality, but must used appropriately  If you decide to add active metadata, a controlled vocabulary just became your new best friend
  • 20. – a specific specification of a  Ontology conceptualization • In English: a controlled vocabulary used to describe a group of topics  Taxonomy – same as ontology, but with hierarchy implied  Caveat – These two terms are so misused, their definitions no longer matter (think Content Management circa 2000)
  • 21.  PRISM (Publishing Requirements for Industry Standard Metadata) – an XML metadata vocabulary for handling content – started out in magazines and journals, but has added other types  Dublin Core – named after a 1995 workshop in Dublin, Ohio, it is, very simply, a set of 15 agreed-upon metadata elements used to describe objects • PRISM uses Dublin Core elements and then makes them specific to publishing  RDF (Resource Description Format): an XML implementation that lets you richly describe relationships between data on web pages. Explain triplets
  • 22. Semantic Web – A web of data. Envisioned by Tim Berners-Lee, it will be a web driven by data that “talks” to other data • My kids will work on this  FOAF Project (Friend of a Friend): Uses RDF to describe people and their preferences to the web, so you can find people with similar interests; all about social networking  SPARQL (Simple Protocol and RDF Query Language) – once you have used RDF to describe resources and their connection points, you use SPARQL to ask questions about those connections and find stuff  OWL (Web Ontology Language) – extends ability of RDF and XML Schemas to describe information
  • 23.  Drug Reference Product  Perfect, structured information that is a great example of metadata becoming just as important as content  Examples of things that were stored in metadata: • Codes, codes, and more codes • Drug interaction information • Classifications (this one was actually redundant) • Formulary information • FDA approval date (could also be redundant)
  • 24.  Four editors spent as much time working on metadata as they did on content itself  All work on import/export from DB was done by: • Acting on metadata • Keeping metadata at top of priority list on output • “Output all drugs anticoagulants that were approved before 1982”
  • 25.  Medical content (5 years ago I would have said “book”)  Thousands of topics, sometimes printed, always updated, sent to web, handhelds  How/when they are updated, whether or not they are printed, and whether or not they get extracted is all driven by ….  Metadata!
  • 26.  Extracts all are made by acting on metadata • What is the subject area of the topic? (this can be a MANY to ONE relationship) • When was the topic last updated? • Who was the author of the last update?
  • 27.
  • 28.
  • 29.
  • 30.
  • 31. ID Values assigned during XML conversion
  • 33.
  • 34.
  • 35.  Have a metadata strategy • Business case should support investment in metadata • Be careful, and stay alert for mission creep – this stuff can get out of control very easily  Know your organization • Is it a change tolerant organization? “All in” vs. measured, incremental approach should be considered • Show me someone who says they have the correct universal approach to metadata, and I’ll show you a liar
  • 36. A little bit of metadata understanding by product development people can go a long way  If a content set can benefit from metadata in the creation of new products, that could justify investment in metadata strategy and tools within the workflow
  • 38. 1. Contributor 9. Publisher 2. Coverage 10. Relation 3. Creator 11. Rights 4. Date 12. Source 5. Description 13. Subject 6. Format 14. Title 7. Identifier 15. Type 8. Language Return