SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Managing Mature Taxonomies:
Resolving Orphan Terms
SLA Taxonomy Division Webinar
December 12, 2016
Heather Hedden
Senior Vocabulary Editor
Metadata Standards and Services
Gale | Cengage Learning
Heather Hedden
 Senior vocabulary editor, Cengage Learning, 1996-2004, 2014-present
 Author of The Accidental Taxonomist (2010, 2016)
 Continuing education instructor, Simmons College School of Library and Information Science
 Former taxonomy consultant
Gale, a Cengage Learning Company
 Subscription databases to libraries: GVRL ebooks, In Context, Academic OneFile,
Business Collection, Literature Resource Center, etc.
 Web products to the public: Questia, Books & Authors, HighBeam Research,
Encyclopedia.com
 Gale Research reference books, directories, and other book imprints (Greenhaven,
Thorndike, St. James Press, etc.)
 Primary Source Media digital archives (Artemis)
 Legacy library database vendor companies: Information Access Company, Predicasts
2
 Managed by four vocabulary editors, divided
by broad subject area
Outline
 Taxonomies, Thesauri, and Orphan Terms
 The Gale Project to Review Orphan Terms
 Issues in Finding Parents to Orphan Terms
4
Taxonomies, Thesauri, and Orphan Terms
Taxonomies and Thesauri Compared
5
Less MoreControlled Vocabularies - Complexity
Pick List Synonym
Ring
Authority
File
Taxonomy Thesaurus Ontology
Ambiguity
control
Synonym
control
Ambiguity
control
Synonym
control
Ambiguity
control
(Synonym
control)
Hierarchical
relationships
Ambiguity
control
Synonym
control
Hierarchical
relationship
Associative
relationships
Ambiguity
control
(Synonym
control)
Semantic
relationships
Classes
Taxonomies, Thesauri, and Orphan Terms
Taxonomies and Thesauri Compared
6
Taxonomies
 All terms belong to a limited
number of major hierarchies (or
facets)
 May bend standard hierarchical
rules.
 Supports classification,
categorization, and concept
organization.
(Like Linnaean taxonomy.)
 Approach is a top-down
navigation.
Thesauri
 All terms have relationships, but
“hierarchies” can comprise as few as
2 terms.
 ANSI/NISO or ISO standard rules are
strictly followed.
 Supports concept scoping,
disambiguation, and relationships
with similar concepts.
(Like looking up in Roget’s.)
 Approach is term-centered and what
terms are linked to/from it.
Taxonomies, Thesauri, and Orphan Terms
Hierarchical Relationship Rules (ANSI/NISO Z.39.19 Guidelines)
7
1. Generic-Specific
Category or class
NT members/types
Narrower term “is/are a
kind of” broader term.
Plants
NT Trees
3. Whole-Part
Concept or entity
NT Part or sub-entity
Narrower term ‘is in” broader
term (as an integral part).
France
NT Paris
Digestive system
NT Stomach
2. Generic-Instance
Common noun
NT Proper noun
Narrower term is an
instance of broader term.
Smartphones
NT Samsung Galaxy
Taxonomies, Thesauri, and Orphan Terms
Orphan Term Definitions
1. Terms with no hierarchical or associative relationships (ANSI/NISO Z.39.19 definition)
 Not permitted in taxonomies or thesauri
2. Terms with no hierarchical (broader or narrower) relationships (“hierarchical orphans”)
 Not permitted in taxonomies; may be permitted in thesauri
3. Terms with no broader terms (no broader/parent, thus “orphans”) that are not
intended as top terms
 Not desired in taxonomies or thesauri
The problem:
Due to the lack of relationships to other term, orphan terms cannot be found by users
when browsing the taxonomy/thesaurus. (Can be found by search, though)
8
Gale Orphan Term Review Project
Gale Subject Thesaurus
 Used along with multiple separate name authority files and other classification
metadata for indexing articles and various other content resources
 60,000 preferred terms and always growing
 Managed by four vocabulary editors, divided by broad subject area
 Terms belong to one or more of 6 subject areas: Business, Health/Medicine,
Humanities, Social Sciences, Science Technology
 Developed in the 1970s based on LCSH
 Thoroughly revised in the early 2000s to become an ANSI/NISO Z.39.19-compliant
thesaurus
̶ Project changed See also relationships to either BT/NT or to RT as appropriate.
̶ If terms were left as hierarchical orphans, that was ignored.
9
Gale Orphan Term Review Project
Orphan Term Review Project Background
1. Orphan terms with lacking any relationships (hierarchical or associative)
 Thesaurus management software has report option for this kind of “orphans”
 Vocabulary editors can/should periodically run reports on their sections of the
vocabulary to clean up these kinds of orphans, which are always
unacceptable.
2. “Orphan” terms lacking only broader terms
 A back-end system report needs to be run for this
 Vocabulary editors review the report to either approve these terms as top
terms and/or to add relationships to them.
10
Gale Orphan Term Review Project
Orphan Term Review Project Background
 Started as a project in April 2014 when a new vocabulary editor joined the team.
 A “back burner” project for vocabulary editors to work on when they are not busy
with higher priorities. No timeline or deadline.
 Identified 2420 “orphan” terms (those with no BTs), put them in a spreadsheet
 Two people split the list and provided an initial review with recommendations of
broader terms, where applicable, or comments.
 The orphan term list was sorted by subject category and sub-lists of orphan terms
for each category were assigned to each vocabulary editor for more detail.
11
Gale Orphan Term Review Project
12
Gale Orphan Term Review Project
Orphan Term Project Methodology
Goal: Create broader term relationships to existing terms, if it complies with
ANSI/NISO rules.
If not…
 Creating a new broader term is possible, but must follow policies of justification
for creating new terms: usage warrant, authoritative source(s), and practicality of
a new term.
 Leaving a term as an orphan is OK, but then at least an RT relationship should be
present, ideally more than one.
 Changing or deleting the term (or subsuming into an existing term) might also be
considered, upon further research. Occasionally, orphans are simply not good
terms. 13
Gale Orphan Term Review Project
Orphan Term Project Methodology
Resolutions indicated on spreadsheet and entered in thesaurus management system
14
Gale Orphan Term Review Project
Causes of Orphan Terms
 During previous project that put the Subjects into a thesaurus format (changing
See also relationships to either BT/NT or to RT as appropriate), if terms were left
as orphans, that was ignored.
 Quickly created terms for immediate indexing needs, whose relationships were
not completed.
 Terms for which a broader term is uncertain, and it would take time and effort,
and perhaps changes to other terms (disambiguation) to resolve.
 Terms correctly created, with all correct relationships, for which a correct broader
term simply does not exist.
15
Issues in Finding Parents to Orphan Terms
Finding imperfect Broader Terms
Stretching the permissibility of BT/NT rules
Example orphan terms and their proposed questionable broader terms:
 Atmospheric composition BT Atmosphere?
 Atmospheric haze BT Atmosphere?
 Conflict termination (Military science) BT Wars?
 Behavior problems BT Behavior?
 Probably OK
16
Issues in Finding Parents to Orphan Terms
Finding imperfect Broader Terms
Stretching the permissibility of BT/NT rules: Topics within a field
Considering the narrower term “is in” the field.
Example orphan terms and their proposed questionable broader terms:
 Coping (Psychology) BT Psychology?
 Convergence (Mathematics) BT Mathematics?
 Decision analysis BT Management science?
 Cell population BT Cytology?
 Chemical models BT Chemistry?
 Carbon rationing BT Environmental economics?
 Maybe not OK. (Would be OK in a hierarchical taxonomy, rather than a thesaurus.)
17
Issues in Finding Parents to Orphan Terms
Leaving terms as orphans (although including RTs)
Logical parent would be too broad
 College applications - We won’t create BT Applications
 Animal tracks – We won’t create BT Tracks
Abstract terms without broader terms
 Controversy
Complex concepts that are not what they seem
 Haunted houses – It does not belong as a narrower term to Housing (UF Houses)
Legacy LC pre-coordinated concepts that have no single broader term
 Computers and children not NT to either computers or children
18
Issues in Finding Parents to Orphan Terms
Parents Found!
Examples
 Alteration (Clothing) BT Tailoring
 Apathy BT Emotions
 Conscious sedation BT Anesthesia
 Stockrooms BT Storage (Physical)
19
Questions/Contact
Heather Hedden
Senior Vocabulary Editor
Indexing & Vocabulary Services
Metadata Standards and Services
Gale | Cengage Learning
20 Channel Center St., Boston, MA 02210
(o) 617-757-8211 | (m) 978-467-5195
heather.hedden@cengage.com
www.gale.com
www.cengage.com
heather@hedden.net
www.accidental-taxonomist.com
20

Contenu connexe

Tendances

Ontology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical studyOntology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical studyDebashisnaskar
 
Taxonomies in Support of Search
Taxonomies in Support of SearchTaxonomies in Support of Search
Taxonomies in Support of SearchHeather Hedden
 
SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing Colleen Farrelly
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application ProfilesDiane Hillmann
 
Annotating for Individual experiences
Annotating for Individual experiencesAnnotating for Individual experiences
Annotating for Individual experiencesliddy
 
Jarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageJarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageMustafa Jarrar
 
Eswc2012 ss ontologies
Eswc2012 ss ontologiesEswc2012 ss ontologies
Eswc2012 ss ontologiesElena Simperl
 
Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Ryan Scicluna
 
Personalised Terms Derivative- Semantic Stemming
Personalised Terms Derivative- Semantic StemmingPersonalised Terms Derivative- Semantic Stemming
Personalised Terms Derivative- Semantic Stemmingnitin jha
 

Tendances (15)

Ontology
Ontology Ontology
Ontology
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
 
Tools for Taxonomies
Tools for TaxonomiesTools for Taxonomies
Tools for Taxonomies
 
Ontology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical studyOntology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical study
 
Taxonomies in Support of Search
Taxonomies in Support of SearchTaxonomies in Support of Search
Taxonomies in Support of Search
 
SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing
 
Hlava, Davis, Corson-Rikert, and Parr "Control Your Vocabulary: Real-World A...
Hlava, Davis, Corson-Rikert, and Parr "Control Your Vocabulary:  Real-World A...Hlava, Davis, Corson-Rikert, and Parr "Control Your Vocabulary:  Real-World A...
Hlava, Davis, Corson-Rikert, and Parr "Control Your Vocabulary: Real-World A...
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application Profiles
 
Annotating for Individual experiences
Annotating for Individual experiencesAnnotating for Individual experiences
Annotating for Individual experiences
 
RDF and OWL
RDF and OWLRDF and OWL
RDF and OWL
 
Jarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageJarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology Language
 
Eswc2012 ss ontologies
Eswc2012 ss ontologiesEswc2012 ss ontologies
Eswc2012 ss ontologies
 
Subject analysis, lcsh part 2
Subject analysis, lcsh part 2Subject analysis, lcsh part 2
Subject analysis, lcsh part 2
 
Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...
 
Personalised Terms Derivative- Semantic Stemming
Personalised Terms Derivative- Semantic StemmingPersonalised Terms Derivative- Semantic Stemming
Personalised Terms Derivative- Semantic Stemming
 

Similaire à Managing Mature Taxonomies: Resolving Orphan Terms

What do the fields of cosmology, financial matters, fund, law, scien.pdf
What do the fields of cosmology, financial matters, fund, law, scien.pdfWhat do the fields of cosmology, financial matters, fund, law, scien.pdf
What do the fields of cosmology, financial matters, fund, law, scien.pdfannaielectronicsvill
 
Metaphic or the art of looking another way.
Metaphic or the art of looking another way.Metaphic or the art of looking another way.
Metaphic or the art of looking another way.Suresh Manian
 
LIS415 Class PBCVC
LIS415 Class PBCVCLIS415 Class PBCVC
LIS415 Class PBCVCAlisonNoel
 
Taxonomy Development and Digital Projects
Taxonomy Development and Digital ProjectsTaxonomy Development and Digital Projects
Taxonomy Development and Digital Projects daniela barbosa
 
Theresa regli bw-3
Theresa regli bw-3Theresa regli bw-3
Theresa regli bw-3R Aunpad
 
Topic Maps - Human-oriented semantics?
Topic Maps - Human-oriented semantics?Topic Maps - Human-oriented semantics?
Topic Maps - Human-oriented semantics?Lars Marius Garshol
 
What are learning theories good for?
What are learning theories good for?What are learning theories good for?
What are learning theories good for?James Atherton
 
Writing the introduction chapter of your disseration
Writing the introduction chapter of your disserationWriting the introduction chapter of your disseration
Writing the introduction chapter of your disserationThe Free School
 
1Assignment Guidelines1. Policy AnalysisA. The poli.docx
1Assignment Guidelines1. Policy AnalysisA.  The poli.docx1Assignment Guidelines1. Policy AnalysisA.  The poli.docx
1Assignment Guidelines1. Policy AnalysisA. The poli.docxfelicidaddinwoodie
 
Litmus Test for a Doctoral-Level Research ProblemBackground on.docx
Litmus Test for a Doctoral-Level Research ProblemBackground on.docxLitmus Test for a Doctoral-Level Research ProblemBackground on.docx
Litmus Test for a Doctoral-Level Research ProblemBackground on.docxjeremylockett77
 
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...taxonbytes
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesMustafa Jarrar
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processingATHMAN HAJ-HAMOU
 
Background Information
Background  InformationBackground  Information
Background Informationguestc3053f
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...Sarah Morrow
 

Similaire à Managing Mature Taxonomies: Resolving Orphan Terms (20)

Thesaurus 2101
Thesaurus 2101Thesaurus 2101
Thesaurus 2101
 
What do the fields of cosmology, financial matters, fund, law, scien.pdf
What do the fields of cosmology, financial matters, fund, law, scien.pdfWhat do the fields of cosmology, financial matters, fund, law, scien.pdf
What do the fields of cosmology, financial matters, fund, law, scien.pdf
 
Class14
Class14Class14
Class14
 
Metaphic or the art of looking another way.
Metaphic or the art of looking another way.Metaphic or the art of looking another way.
Metaphic or the art of looking another way.
 
LIS415 Class PBCVC
LIS415 Class PBCVCLIS415 Class PBCVC
LIS415 Class PBCVC
 
Taxonomy Development and Digital Projects
Taxonomy Development and Digital ProjectsTaxonomy Development and Digital Projects
Taxonomy Development and Digital Projects
 
Theresa regli bw-3
Theresa regli bw-3Theresa regli bw-3
Theresa regli bw-3
 
Topic Maps - Human-oriented semantics?
Topic Maps - Human-oriented semantics?Topic Maps - Human-oriented semantics?
Topic Maps - Human-oriented semantics?
 
What are learning theories good for?
What are learning theories good for?What are learning theories good for?
What are learning theories good for?
 
Writing the introduction chapter of your disseration
Writing the introduction chapter of your disserationWriting the introduction chapter of your disseration
Writing the introduction chapter of your disseration
 
1Assignment Guidelines1. Policy AnalysisA. The poli.docx
1Assignment Guidelines1. Policy AnalysisA.  The poli.docx1Assignment Guidelines1. Policy AnalysisA.  The poli.docx
1Assignment Guidelines1. Policy AnalysisA. The poli.docx
 
Exercise Science
Exercise ScienceExercise Science
Exercise Science
 
Litmus Test for a Doctoral-Level Research ProblemBackground on.docx
Litmus Test for a Doctoral-Level Research ProblemBackground on.docxLitmus Test for a Doctoral-Level Research ProblemBackground on.docx
Litmus Test for a Doctoral-Level Research ProblemBackground on.docx
 
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...
Franz Et Al - Concepts and Tools Needed to Increase Bottom-Up Taxonomic Exper...
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing Ontologies
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processing
 
Background Information
Background  InformationBackground  Information
Background Information
 
20100427 Earthster Core Ontology
20100427 Earthster Core Ontology20100427 Earthster Core Ontology
20100427 Earthster Core Ontology
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
 

Plus de Heather Hedden

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfHeather Hedden
 
Benefits of Taxonomies
Benefits of TaxonomiesBenefits of Taxonomies
Benefits of TaxonomiesHeather Hedden
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Heather Hedden
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOSHeather Hedden
 
Mapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesMapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesHeather Hedden
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementHeather Hedden
 
A Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsA Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsHeather Hedden
 
Managing Taxonomy Tagging
Managing Taxonomy TaggingManaging Taxonomy Tagging
Managing Taxonomy TaggingHeather Hedden
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePointHeather Hedden
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressHeather Hedden
 
Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignHeather Hedden
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerceHeather Hedden
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesHeather Hedden
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesHeather Hedden
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingHeather Hedden
 

Plus de Heather Hedden (17)

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdf
 
Benefits of Taxonomies
Benefits of TaxonomiesBenefits of Taxonomies
Benefits of Taxonomies
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
Mapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesMapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and Ontologies
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology Management
 
A Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsA Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge Graphs
 
Managing Taxonomy Tagging
Managing Taxonomy TaggingManaging Taxonomy Tagging
Managing Taxonomy Tagging
 
Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePoint
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPress
 
Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy Design
 
Testing Taxonomies
Testing TaxonomiesTesting Taxonomies
Testing Taxonomies
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual Taxonomies
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating Taxonomies
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-Indexing
 

Dernier

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Dernier (20)

Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Managing Mature Taxonomies: Resolving Orphan Terms

  • 1. Managing Mature Taxonomies: Resolving Orphan Terms SLA Taxonomy Division Webinar December 12, 2016 Heather Hedden Senior Vocabulary Editor Metadata Standards and Services Gale | Cengage Learning
  • 2. Heather Hedden  Senior vocabulary editor, Cengage Learning, 1996-2004, 2014-present  Author of The Accidental Taxonomist (2010, 2016)  Continuing education instructor, Simmons College School of Library and Information Science  Former taxonomy consultant Gale, a Cengage Learning Company  Subscription databases to libraries: GVRL ebooks, In Context, Academic OneFile, Business Collection, Literature Resource Center, etc.  Web products to the public: Questia, Books & Authors, HighBeam Research, Encyclopedia.com  Gale Research reference books, directories, and other book imprints (Greenhaven, Thorndike, St. James Press, etc.)  Primary Source Media digital archives (Artemis)  Legacy library database vendor companies: Information Access Company, Predicasts 2
  • 3.  Managed by four vocabulary editors, divided by broad subject area
  • 4. Outline  Taxonomies, Thesauri, and Orphan Terms  The Gale Project to Review Orphan Terms  Issues in Finding Parents to Orphan Terms 4
  • 5. Taxonomies, Thesauri, and Orphan Terms Taxonomies and Thesauri Compared 5 Less MoreControlled Vocabularies - Complexity Pick List Synonym Ring Authority File Taxonomy Thesaurus Ontology Ambiguity control Synonym control Ambiguity control Synonym control Ambiguity control (Synonym control) Hierarchical relationships Ambiguity control Synonym control Hierarchical relationship Associative relationships Ambiguity control (Synonym control) Semantic relationships Classes
  • 6. Taxonomies, Thesauri, and Orphan Terms Taxonomies and Thesauri Compared 6 Taxonomies  All terms belong to a limited number of major hierarchies (or facets)  May bend standard hierarchical rules.  Supports classification, categorization, and concept organization. (Like Linnaean taxonomy.)  Approach is a top-down navigation. Thesauri  All terms have relationships, but “hierarchies” can comprise as few as 2 terms.  ANSI/NISO or ISO standard rules are strictly followed.  Supports concept scoping, disambiguation, and relationships with similar concepts. (Like looking up in Roget’s.)  Approach is term-centered and what terms are linked to/from it.
  • 7. Taxonomies, Thesauri, and Orphan Terms Hierarchical Relationship Rules (ANSI/NISO Z.39.19 Guidelines) 7 1. Generic-Specific Category or class NT members/types Narrower term “is/are a kind of” broader term. Plants NT Trees 3. Whole-Part Concept or entity NT Part or sub-entity Narrower term ‘is in” broader term (as an integral part). France NT Paris Digestive system NT Stomach 2. Generic-Instance Common noun NT Proper noun Narrower term is an instance of broader term. Smartphones NT Samsung Galaxy
  • 8. Taxonomies, Thesauri, and Orphan Terms Orphan Term Definitions 1. Terms with no hierarchical or associative relationships (ANSI/NISO Z.39.19 definition)  Not permitted in taxonomies or thesauri 2. Terms with no hierarchical (broader or narrower) relationships (“hierarchical orphans”)  Not permitted in taxonomies; may be permitted in thesauri 3. Terms with no broader terms (no broader/parent, thus “orphans”) that are not intended as top terms  Not desired in taxonomies or thesauri The problem: Due to the lack of relationships to other term, orphan terms cannot be found by users when browsing the taxonomy/thesaurus. (Can be found by search, though) 8
  • 9. Gale Orphan Term Review Project Gale Subject Thesaurus  Used along with multiple separate name authority files and other classification metadata for indexing articles and various other content resources  60,000 preferred terms and always growing  Managed by four vocabulary editors, divided by broad subject area  Terms belong to one or more of 6 subject areas: Business, Health/Medicine, Humanities, Social Sciences, Science Technology  Developed in the 1970s based on LCSH  Thoroughly revised in the early 2000s to become an ANSI/NISO Z.39.19-compliant thesaurus ̶ Project changed See also relationships to either BT/NT or to RT as appropriate. ̶ If terms were left as hierarchical orphans, that was ignored. 9
  • 10. Gale Orphan Term Review Project Orphan Term Review Project Background 1. Orphan terms with lacking any relationships (hierarchical or associative)  Thesaurus management software has report option for this kind of “orphans”  Vocabulary editors can/should periodically run reports on their sections of the vocabulary to clean up these kinds of orphans, which are always unacceptable. 2. “Orphan” terms lacking only broader terms  A back-end system report needs to be run for this  Vocabulary editors review the report to either approve these terms as top terms and/or to add relationships to them. 10
  • 11. Gale Orphan Term Review Project Orphan Term Review Project Background  Started as a project in April 2014 when a new vocabulary editor joined the team.  A “back burner” project for vocabulary editors to work on when they are not busy with higher priorities. No timeline or deadline.  Identified 2420 “orphan” terms (those with no BTs), put them in a spreadsheet  Two people split the list and provided an initial review with recommendations of broader terms, where applicable, or comments.  The orphan term list was sorted by subject category and sub-lists of orphan terms for each category were assigned to each vocabulary editor for more detail. 11
  • 12. Gale Orphan Term Review Project 12
  • 13. Gale Orphan Term Review Project Orphan Term Project Methodology Goal: Create broader term relationships to existing terms, if it complies with ANSI/NISO rules. If not…  Creating a new broader term is possible, but must follow policies of justification for creating new terms: usage warrant, authoritative source(s), and practicality of a new term.  Leaving a term as an orphan is OK, but then at least an RT relationship should be present, ideally more than one.  Changing or deleting the term (or subsuming into an existing term) might also be considered, upon further research. Occasionally, orphans are simply not good terms. 13
  • 14. Gale Orphan Term Review Project Orphan Term Project Methodology Resolutions indicated on spreadsheet and entered in thesaurus management system 14
  • 15. Gale Orphan Term Review Project Causes of Orphan Terms  During previous project that put the Subjects into a thesaurus format (changing See also relationships to either BT/NT or to RT as appropriate), if terms were left as orphans, that was ignored.  Quickly created terms for immediate indexing needs, whose relationships were not completed.  Terms for which a broader term is uncertain, and it would take time and effort, and perhaps changes to other terms (disambiguation) to resolve.  Terms correctly created, with all correct relationships, for which a correct broader term simply does not exist. 15
  • 16. Issues in Finding Parents to Orphan Terms Finding imperfect Broader Terms Stretching the permissibility of BT/NT rules Example orphan terms and their proposed questionable broader terms:  Atmospheric composition BT Atmosphere?  Atmospheric haze BT Atmosphere?  Conflict termination (Military science) BT Wars?  Behavior problems BT Behavior?  Probably OK 16
  • 17. Issues in Finding Parents to Orphan Terms Finding imperfect Broader Terms Stretching the permissibility of BT/NT rules: Topics within a field Considering the narrower term “is in” the field. Example orphan terms and their proposed questionable broader terms:  Coping (Psychology) BT Psychology?  Convergence (Mathematics) BT Mathematics?  Decision analysis BT Management science?  Cell population BT Cytology?  Chemical models BT Chemistry?  Carbon rationing BT Environmental economics?  Maybe not OK. (Would be OK in a hierarchical taxonomy, rather than a thesaurus.) 17
  • 18. Issues in Finding Parents to Orphan Terms Leaving terms as orphans (although including RTs) Logical parent would be too broad  College applications - We won’t create BT Applications  Animal tracks – We won’t create BT Tracks Abstract terms without broader terms  Controversy Complex concepts that are not what they seem  Haunted houses – It does not belong as a narrower term to Housing (UF Houses) Legacy LC pre-coordinated concepts that have no single broader term  Computers and children not NT to either computers or children 18
  • 19. Issues in Finding Parents to Orphan Terms Parents Found! Examples  Alteration (Clothing) BT Tailoring  Apathy BT Emotions  Conscious sedation BT Anesthesia  Stockrooms BT Storage (Physical) 19
  • 20. Questions/Contact Heather Hedden Senior Vocabulary Editor Indexing & Vocabulary Services Metadata Standards and Services Gale | Cengage Learning 20 Channel Center St., Boston, MA 02210 (o) 617-757-8211 | (m) 978-467-5195 heather.hedden@cengage.com www.gale.com www.cengage.com heather@hedden.net www.accidental-taxonomist.com 20