Mitchell, Joan S., Marcia Lei Zeng, and Maja Zumer. Presented at the International UDC Seminar 2011, Classification & Ontology, The Hague, The Netherlands, Sept. 19-20, 2011.
Extending models for controlled vocabularies to classification systems: modelling DDC with FRSAD
1. Extending Models for Controlled Vocabularies to Classification Systems: Modelling DDC with FRSAD Joan S. Mitchell OCLC, Inc. Marcia Lei Zeng Kent State University Maja Žumer University of Ljubljana , Slovenia
2. The big question Can the FRSAD conceptual model be extended beyond subject authority data (its original focus) to model classification data?
3.
4. DDC UDC LCSH FRSAD FRAD FRBR TEST* * Thesaurus of engineering and scientific terms ISO 2788 (1974) Guidelines for the Establishment and Development of Monolingual Thesauri ISO 5964 (1985) Guidelines for the Establishment and Development of Multilingual Thesauri ISO 2788* ISO5964* SKOS OWL 1. From Knowledge Organisation Systems to Data and Conceptual Models: Timeline 2009 1998 2010 1876 1905 1898 1967 1974 1985 2004-2009
5. From Knowledge Organisation Systems to Data and Conceptual Models: Modelling efforts Classifi-cation Subject headings FRSAD FRAD FRBR ISO 2788 ISO5964 SKOS OWL Classifi-cation Thesauri Thesauri KOS KOS ontology Thesauri: mostly comply with ISO 2788 and ISO 5964. Subject heading schemes : adopted the basic structure of the thesaurus since 1990s. Classification systems : implemented different practices and are usually constructed according to specific conventions and examples. 2009 1998 2010 1876 1905 1898 1967 1974 1985 2004-2009
6.
7. The FRBR family models: main entities and relationships FRBR FRAD FRSAD
10. The core of the FRSAD conceptual model FRSAD Part 1: WORK has as subject THEMA / THEMA is subject of WORK FRSAD Part 2: THEMA has appellation NOMEN / NOMEN is appellation of THEMA NOMEN = any sign or sequence of signs (alphanumeric characters, symbols, sound, etc.) that a thema is known by, referred to or addressed as
11. Note: in a given controlled vocabulary and within a domain, a nomen should be an appellation of only one thema . The ‘has appellation’ relationship between thema and nomen in a controlled vocabulary:
22. Nomens: DDC number, Full caption, URI 025.04 Computer science, information & general works/Library & information sciences/Operations of libraries, archives, information centers/Information storage and retrieval systems http://dewey.info/class/025.04/
23. Thema: Any topic co-extensive with the full meaning of the class topics that are functionally equivalent to the class
24. Scope note: Text describing or defining thema or specifying scope within particular system Scope note (≠ thema/class) Scope note (≠ thema/class)
34. French DDC 22 German DDC 22 Italian DDC 22 Swedish Mixed DDC 22 Italian A14 Vietnamese A14 French A14 Spanish A14 Hebrew A14 200 Religion Class Guide (French) DDC 22 A14 DDC Sach-Gruppen (German) DDC Summaries English French Italian Rhaeto-Romansch Afrikaans Arabic Chinese French German Norwegian Portuguese Russian Scots Gaelic Spanish Swedish
35. Mappings and crosswalks DDC LCSH MeSH SWD RAMEAU SAB BISAC SEARS CSH UDC LCC SAO Nuovo Soggettario
MZ note: I found a line in white font on the slide: Thema = “any entity used as a subject of a work ". Joan: is this what you would like to bring into the slide? I put an indirect question here. I understand we may not need to show it here. We can answer later… Indirect question: Will a general conceptual model be useful to model classification data?
This slide intends to put the development of knowledge organization systems in a timeline. It reveals a fact that standards and models were not developed ahead of them, rather, were initiated much later. Ref: http://catalogingandclassificationquarterly.com/ccq29nr1-2ed.htm THE LCSH CENTURY: A BRIEF HISTORY OF THE LIBRARY OF CONGRESS SUBJECT HEADINGS, AND INTRODUCTION TO THE CENTENNIAL ESSAYS By Alva Stone. Cataloging & Classification Quarterly, Volume 29, Number 1-2 2000 "Technically speaking, LC ’s publication of its subject headings list did not really begin in 1898. That was instead the year in which the Library of Congress converted from an author- plus a classed-catalog to a dictionary catalog , which incorporated author, title, and subject entries into a single file. The first actual printing of Subject Headings Used in the Dictionary Catalogues of the Library of Congress (later to be titled Library of Congress Subject Headings) began in the summer of 1909, …" 1905-1907 UDC published. According to sources the work on UDC started in 1896 - (as suggested by Aida) ISO 2788 (1974) Guidelines for the Establishment and Development of Monolingual Thesauri , published by International Standards Institution. ISO 5964 (1985) Guidelines for the Establishment and Development of Multilingual Thesauri , published by International Standards Institution.
Another view of this chart. Although standards were developed, they were for thesauri speficicly. Thesauri usually comply with international standards such as ISO 2788 and ISO 5964. Subject heading schemes such as the LCSH have also adopted the basic structure of the thesaurus for the last two decades. Classification systems have implemented different practices and are usually constructed according to specific conventions and examples. SKOS and FRSAD also focused more towards controlled vocabularies rather than classifications. Although they aimed at KOS in general, they were influenced by thesaurus standards and practices heavily simply because no standards for classifications yet.
An overview of the FRBR family conceptual models. Ref: Zumer, Zeng, and Salaba, 2010. FRBR: A Generalized Approach to Dublin Core Application Profiles. Proc. Int ’ l Conf. on Dublin Core and Metadata Applications 2010
This model is generated within the FRBR framework. See next slide.
Still see three major entities: work, thema, and nomen, here the original FRBR entities and relationships are used to show the root of FRSAD development. In this context, thema is defined as ‘any entity used as a subject of a work’.
Other hierarchical relationships include Polyhierarchical, Faceted, and Perspective Hierarchical Structures
This is why nomen (in general) has to be an entity, not an attribute of thema. An instance of a nomen may have parts: a personal name is a combination of first name and last name ( ‘Albert Einstein’ has parts ‘Albert’ and ‘Einstein’); a subject heading string is a combination of terms (“Universities and colleges--Employees--Labor unions--Germany”). In such cases a whole-part relationship (partial relationship) exists between the nomen and its components. In a particular knowledge organization system, rules are established to govern the creation of complex nomens from such components.
There might exist a need to have case-by-case analysis in order to align FRSAD elements to the macro-structures that inherit certain conventions
Dewey Breakfast/Update June 23, 2007 ALA Annual Conference 2007