Contenu connexe Similaire à 2011 Taxonomy Standards Update Similaire à 2011 Taxonomy Standards Update (20) 2011 Taxonomy Standards Update1. Taxonomy Boot Camp
Standards Update
Marjorie M.K. Hlava
mhlava@accessinn.com
Data Harmony Software
Access Innovations, Inc.
www.accessinn.com
Leveraging your content semantically
2. What do we need a standard for??
2
© 2010. Access Innovations, Inc. All Rights Reserved.
3. Why Standards?
“Standards save money.”
“Standards accelerate projects.”
“Standards make better solutions.”
“Standards provide better governance for
solutions.”
“Standards protect your investment for the
future.”
“Standards save money for future projects.”
http://michaelrowell.com/wp/
3
© 2010. Access Innovations, Inc. All Rights Reserved.
5. What does it take to make the
components of the digital
information model work…
5
© 2010. Access Innovations, Inc. All Rights Reserved.
8. Standards Groups
ISO
W3C – World Wide Web Consortium
NISO
W3C
IFLA
US Library of Congress
Dublin Core
New things as they come up
© 2010. Access Innovations, Inc. All Rights Reserved.
9. Standard Definition
Universally or widely accepted, agreed upon, or
established means of determining what something
should be. Major classifications of this term include:
Material or substance whose properties are known
with a level of accuracy that is sufficient to allow its
use as a physical reference in calibrating or
measuring the same properties of another material or
substance.
Concept, norm, or principle established by
agreement, authority, or custom, and used generally
as an example or model to compare or measure the
quality or performance of a practice or procedure.
9
10. Standard Definition
Written definition, limit, or rule approved
and monitored for compliance by an
authoritative agency (or professional or
recognized body) as a minimum
acceptable benchmark. This is the usual
meaning of the plural term (standards).
See also specification.
10
11. Traditional Standards
ISO
TC 46
SC 9
ISO/CD 25964-1
ANSI
NISO
Z39.19
BSI
BS 8723
1
1
© 2010. Access Innovations, Inc. All Rights Reserved.
12. More Standards Groups
W3C
OWL
SKOS
US Government
Office of Management and
Budget
European Union
Individual Organization Attempts
12
© 2010. Access Innovations, Inc. All Rights Reserved.
13. The Process
ISO
137 Nations one vote each
ANSI
US organization
NISO
US Information and Library Standards
Tag 46 – ISO information Standards
NISO is the secretariat
13
© 2010. Access Innovations, Inc. All Rights Reserved.
14. Impatient others
W3C
World Wide Web Consortium
Ad hoc groups
Author ID’s
ORCID
VIVO
Industry and National Groups
Library of Congress
Associations for their industry
SMPTE
Book Industry Study Group
ONIX
14
© 2010. Access Innovations, Inc. All Rights Reserved.
15. Proprietary ―Standards‖
PDF – Portable Document Format
Adobe
CSV
Comma Separated Values
Microsoft
15
© 2010. Access Innovations, Inc. All Rights Reserved.
16. Soliciting Input – Formal Process
Standard need
Group with a frustration – need
Standard proposal
NISO topic groups, CCM and others
Call for participation
Write the draft standard
Open for comments
NO must be resolved
Standard for vote
Approved for five years
16
© 2010. Access Innovations, Inc. All Rights Reserved.
17. Soliciting Input – Informal Process
Standard need
Group with a frustration – need
Create a Working Group
Write the draft standard
Publish to the web
Allow comments
Resolve comments
When no comments considered OK
Start implementing / Early implementers
17
© 2010. Access Innovations, Inc. All Rights Reserved.
18. Taxonomies don’t exist in a vacuum
They are part of metadata
They are used to tag information objects
They are used on
• Web sites
• In Search
• To profile people
• To link resources
So we have to know a little about those
standards as well
18
19. Groups of standards
Identifiers
Records management
Country Codes
Transliteration
Formats
Taxonomy related
19
© 2010. Access Innovations, Inc. All Rights Reserved.
20. Identifiers
ISO/IEC CD 27037, Information technology —
Security techniques —
Guidelines for identification, collection, acquisition,
and preservation of digital evidence
Information and documentation –
International standard document link
ISO/FDIS 24619, Language resource
management —
Persistent identification and sustainable access
(PISA)
20
© 2010. Access Innovations, Inc. All Rights Reserved.
21. Records management
ISO/FDIS 16175-2, Information and documentation
Principles and functional requirements for records
in electronic office environments —
Part 2: Guidelines and functional requirements for
digital records management systems
ISO/DTR 17068 Information and documentation
Records management — Third party repository for
electronic records
ISO/DIS 13008, Information and documentation —
Digital records conversion and migration process
21
© 2010. Access Innovations, Inc. All Rights Reserved.
22. Country Codes
Systematic review ISO 3166-2:2007,
Codes for the representation of names of
countries and their subdivisions — Part 2:
Country subdivision code
22
© 2010. Access Innovations, Inc. All Rights Reserved.
23. Transliteration
Systematic review ISO 3602:1989,
Documentation –
Romanization of Japanese (kana script)
Systematic review of ISO 11940-2:2007,
Information and documentation —
Transliteration of Thai characters into Latin
characters, Part 2: Simplified transcription of Thai
language
23
© 2010. Access Innovations, Inc. All Rights Reserved.
24. Formats
ISO/DIS 14289-1, Document
management applications — Electronic
document file format enhancement for
accessibility — Part 1: Use of ISO 32000-
1 (PDF/UA-1)
24
© 2010. Access Innovations, Inc. All Rights Reserved.
25. RFID
ISO/FDIS 28560, Information and
documentation — RFID in libraries
Part 1: Data elements and general guidelines for
implementation
Part 2: Encoding of RFID data elements based on
rules from ISO/IEC 15962
Part 3: Fixed length encoding
25
© 2010. Access Innovations, Inc. All Rights Reserved.
26. Internet Engineering Task Force
(IETF)
Brought you EMAIL
Brought you the Internet
Now XSF
XMPP Standards Foundation
define open protocols for presence, instant
messaging, and real-time communication
collaboration on top of the IETF’s Extensible
Messaging and Presence Protocol (XMPP).
Now in Google 2011
26
© 2010. Access Innovations, Inc. All Rights Reserved.
27. W3C
HTML 5
Linked Data
Ontologies (OWL) and SKOS
Simple Knowledge Organization System
Cascading Style Sheets (CSS),
Adding style to Web content.
Widgets
Widget Packaging and XML Configuration,
Widget Interface,
API to metadata and persistently storing data.
XML Digital Signatures for Widgets,
27
© 2010. Access Innovations, Inc. All Rights Reserved.
28. Big Library Followings
DCMI – Dublin Core Metadata Initiative
Functional requirements
Library of Congress
28
© 2010. Access Innovations, Inc. All Rights Reserved.
29. Library of Congress
MARC 21 formats and MARCXML
VRA Core -- them
METS (Metadata Encoding & Transmission
Standard)
MIX (NISO Metadata for Images in XML) -
PREMIS (Preservation Metadata) -
TextMD (Technical Metadata for Text) –
ALTO - Technical Metadata for Optical
Character Recognition
Extended Date/Time Format (EDTF)
29
© 2010. Access Innovations, Inc. All Rights Reserved.
30. Taxonomy related standards
Vocabularies
Taxonomies
Thesaurus
Knowledge Organization Systems
Terminologies
30
© 2010. Access Innovations, Inc. All Rights Reserved.
31. Vocabularies
ISO/TC 46
SC 6 and SC 9
Information and documentation
ISO/TC 37,
Terminology and other language and content
resources
31
© 2010. Access Innovations, Inc. All Rights Reserved.
32. Thesaurus related
NISO Z39.19 2010 www.niso.org
ISO 2788 - Monolingual (1986)
ISO 5964 - Multilingual (1985)
ISO 5127, Information and documentation
Vocabulary
BSI (BS 8723) the next revised ISO
ISO 25964 / part 1 – Controlled Vocabularies
ISO25964 / part 2 – Taxonomy Interoperability
OWL from W3C
SKOS the W3C thesaurus standard
Dublin Core DCMI Functional requirements 3
2
© 2010. Access Innovations, Inc. All Rights Reserved.
33. Thesaurus and Indexing Standards –
ANSI/NISO
NISO Z39.19-2005 (2010) Guidelines for the
Construction, Format, and Management of
Monolingual Controlled Vocabularies
NISO TR02-1997 Guidelines for Indexes and
Related Information Retrieval Devices
by James D. Anderson
3
3
© 2010. Access Innovations, Inc. All Rights Reserved.
34. ISO TC 46/SC 9
Information and Documentation - Identification
and Description
TC 46 is ISO's Technical Committee (TC) for
information and documentation standards.
SC 9 is the TC 46 Subcommittee (SC) that
develops and maintains ISO standards on the
identification and description of information
resources.
3
4
© 2010. Access Innovations, Inc. All Rights Reserved.
35. Thesaurus and Indexing
Standards – ISO
ISO 2788:1986
Documentation - Guidelines for the establishment and
development of monolingual thesauri
ISO 5964:1985
Documentation - Guidelines for the establishment and
development of multilingual thesauri
ISO 5963:1985
Documentation - Methods for examining documents,
determining their subjects, and selecting indexing terms
ISO 999:1996
Information and documentation - Guidelines for the
content, organization and presentation of indexes
3
5
© 2010. Access Innovations, Inc. All Rights Reserved.
36. New ISO Taxonomy Standard
ISO 25964. Thesauri and interoperability
with other vocabularies
Part 1: Thesauri for information retrieval
(Approved)
Part 2: Interoperability with other
vocabularies (released for comment )
Stella Dexter Clarke, principal author
3
6
© 2010. Access Innovations, Inc. All Rights Reserved.
37. ISO TC 37
Scope of ISO TC 37:
Standardization of principles, methods and
applications relating to terminology and other
language resources.
TC 37/SC 1 - Principles and methods
TC 37/SC 2 - Terminography and lexicography
TC 37/SC 3 - Computer applications for
terminology
TC 37/SC 4 - Language resource management
3
7
© 2010. Access Innovations, Inc. All Rights Reserved.
38. OntoIOp
Terminographical and lexicographical
working methods, for a new project
proposal on: Ontology Integration and
Interoperability (OntoIOp).
A reference ontology for the interchange of
cultural heritage information.
ICOM (International Council of Museums) CIDOC
(International Committee for Documentation)
38
© 2010. Access Innovations, Inc. All Rights Reserved.
39. ISO 1087 Terminology Work
Vocabulary
ISO 1087-1:2000 Terminology work -
Vocabulary - Part 1: Theory and
application
ISO 1087-2:2000 Terminology work -
Vocabulary - Part 2: Computer
applications
39
© 2010. Access Innovations, Inc. All Rights Reserved.
40. ISO 1087
40
© 2010. Access Innovations, Inc. All Rights Reserved.
41. Terminology work — Vocabulary
41
© 2010. Access Innovations, Inc. All Rights Reserved.
43. Other ISO standards:
Concept-oriented terminology
ISO 704:2000 Terminology work - Principles and
methods
ISO 860:1996 Terminology work - Harmonization of
concepts and terms
ISO 10241:1992 Preparation and layout of
international terminology standards
43
© 2010. Access Innovations, Inc. All Rights Reserved.
44. ISO - Data Categories
ISO 12200:1999 Computer applications in
terminology - Machine-readable terminology
interchange format (MARTIF) - Negotiated
interchange
ISO 12616:2002 Translation-oriented
terminography
ISO/TR 12618:1994 Computer aids in terminology -
Creation and use of terminological databases and
text corpora
ISO 12620:1999 Computer applications in
terminology - Data categories used to create
glossaries
44
Others in the work codes
© 2010. Access Innovations, Inc. All Rights Reserved.
45. W3C
OWL – Web Ontology Language
RDF – Resource Description Format
Topic Maps
SKOS - Simple Knowledge Organization
Systems
SKOS 2 DCMI
TURTLE
Which community to serve?
45
© 2010. Access Innovations, Inc. All Rights Reserved.
47. A data wrapper
47
© 2010. Access Innovations, Inc. All Rights Reserved.
48. RDF Triples
Subject
Object
Predicate
Linked data source standard
Action path of ontologies
48
© 2010. Access Innovations, Inc. All Rights Reserved.
52. Other Relevant ISO & W3C
Standards
Metadata standards overview
http://www.slis.kent.edu/~mzeng/metadataba
sics/completelist.htm
Review of SKOS / DCMI / Taxonomy
Standards
http://nkos.slis.kent.edu/
52
© 2010. Access Innovations, Inc. All Rights Reserved.
54. SKOS
SKOS 1 –
no synonyms,
no polyhierarchies
SKOS 2 –
Added the above
Allow other fields (elements) on request
OWL Crosswalk
NISO Z39.19, BSI 8723, and ISO 25964
54
© 2010. Access Innovations, Inc. All Rights Reserved.
55. Who supports SKOS - Everyone
Data Harmony Thesaurus Master
Synaptica
SmartLogic
WordMap
Poolparty
Top Quadrant
Protégé
Etc.
55
© 2010. Access Innovations, Inc. All Rights Reserved.
56. Places to watch
Other W3C and ISO areas
Support groups
Blogs
Communities of Practice
WSDL – Web Services Digital Library
DCMI
NKOS
ISKO
Linked Data 56
© 2010. Access Innovations, Inc. All Rights Reserved.
63. Lists of Taxonomy Resources
Registry? NKOS KOS of KOS
SKOS participants – W3C
KOS typology – Tudhope
TaxoBank.org
Tesauro.com – Spanish - Salama
Kent.edu site – Marcia Zeng
Taxonomy Warehouse – Synaptica
UMLS - Unified Medical Language
System - NIH 63
© 2010. Access Innovations, Inc. All Rights Reserved.
65. What about?
Authority Files
Other pick lists
Roget's and other synonym rings
Dictionaries
Gazetteers
Glossaries
Etc.
65
© 2010. Access Innovations, Inc. All Rights Reserved.
66. Standards and pragmatism
Use Standards
Lead to richer, more informative product
Promote interoperability -- Allow you to adopt or
adapt other controlled vocabularies
Promote predictability
Allow repurposing within your organization and by
other organizations
Follow thesaurus standards for taxonomy
Incorporate authority files / final nodes as needed
Your taxonomy or thesaurus must meet your needs 66
© 2010. Access Innovations, Inc. All Rights Reserved.
67. Author Systems & Disambiguation
Vivo
UniPHY AIP
Collexis
Now part of ESP
SciVal Experts
Author Net
Cambridge University Press
ORCID
67
© 2010. Access Innovations, Inc. All Rights Reserved.
69. The Problem – KEEPING UP
Many players we know and don’t know
Between controlled vocabulary standards
ISO 2788 and 5964,
BSI 8723
Groups developing guidelines and standards
W3C with SKOS and OWL
Governments world wide developing and
mandating taxonomies
Communities
Increase reuse
Mapping interoperability between controlled 6
vocabularies 9
© 2010. Access Innovations, Inc. All Rights Reserved.
70. How to be active -Standards Role
Voting member of NISO, CEBEMA, etc
Member of the NISO Content Board
Crosswalks for NISO, ISO, W3C SKOS
Standards committees SLA, NFAIS, ASIST
DCMI Functional requirements for KOS
Active in NKOS
Active in ISKO
Read the blogs, COP
70
ORCID Prototype (one of…)
© 2010. Access Innovations, Inc. All Rights Reserved.
71. The Games
71
© 2010. Access Innovations, Inc. All Rights Reserved.
72. Standards are our guiding light
Thank you
Marjorie M.K. Hlava
President
Access Innovations, Inc.
Data Harmony
mhlava@accessinn.com
505-998-0800
www.taxodiary.com - the
taxonomy news blog
mmkhlava = twitter
mhlava = facebook,
linkedin, eacademy, plaxo
© 2010. Access Innovations, Inc. All Rights Reserved.
73. Questions??
Okay Enough – go have a drink!
Marjorie M.K. Hlava
President
Data Harmony Software
Access Innovations, Inc.
mhlava@accessinn.com