SlideShare une entreprise Scribd logo
1  sur  25
#mashcat:
Evolving MarcEdit
LEVERAGING SEMANTIC DATA IN MARCEDIT
Little History
MarcEdit Development started around 1999ish (as parts)
◦ Originally coded in 3 programming languages: Assembler (libraries), Visual Basic (UI) and Delphi (COM).
◦ I started writing it as an undergraduate to better understand MARC & circumvent OCLC’s Passport for
Windows program
◦ First “MarcEdit” was released Sept. 11, 2000 (thank you WayBack Machine:
http://web.archive.org/web/20001017105529/http://ucs.orst.edu/~reeset/marcedit/indexb.html)
Today:
◦ Written in C# (Windows/Linux) & Object-C/C# (OSX)
◦ Active user community is ~20,000ish (based on update logs)
◦ Used in ~190ish countries/political regions
◦ Roughly 1/3 of the users reside outside of Canada/United States*
* Based on loose analysis of server logs by my server-side stats software
MarcEdit Evolution
MarcEdit 1.0-2.0 Main Window MarcEdit MARC Tools 1.0-2.0
MarcEdit 1.0-2.0 MarcEditor
MarcEdit Evolution
Early application was developed to (again, thank you Internet Archive):
1. Be user-friendly (whether I’ve accomplished that is debatable – I’m not a UI designer)
2. Support LC’s MARCBreakr/Maker diacritics (largely yes)
3. Be fast (which I think that it is)
4. Simplify editing records in batch
5. Provide a set of programming tools to solve my own local needs
MarcEdit Today
Three development rules I follow
MarcEdit is a real-world metadata tool
◦ Tool is designed to provide workflows for data problems currently facing libraries right now
MarcEdit is MARC Agnostic
◦ Too many metadata tools are anglo-centric; MarcEdit has been designed to work within the very
heterogeneous metadata environment that we find ourselves today, which includes:
◦ Support for MARC (not a particular flavor*)
◦ Near universal characterset support (because the world is bigger than MARC8 and UTF8)
◦ Supports a wide range of Library metadata standards beyond MARC
MarcEdit is one part of the larger library metadata tooling environment
◦ So integrations with OCLC, ILSs (when possible), OpenRefine are important
* And if something assumes MARC21 – call it out
So how does any of this relate to
semantic data in Libraries?
http://musictheorysite.com/img/dwight_question.jpg
A lot of metadata people I talk to fall into
two camps
BibFrame and Linked Data as RDA 2.0
BibFrame
http://www.wired.com/wp-content/uploads/archive/news/images/full/duke_nukem_frever_f.16807.jpg
http://astronomy.nmsu.edu/cwc/Group/magiicat/images/magiicat-logo.gif
Linked Data
BibFrame and linked data as datacorns
https://whatsthebigdata.files.wordpress.com/2015/10/datascience_unicorn.png?w=640
I prefer a more practical outlook…
https://www.etsy.com/search?q=unicorn+cat+hat
MarcEdit’s MARCNext
MarcEdit’s MARCNext is a first attempt to start
having this discussion by:
1. Integrating a linked data framework into
MarcEdit, including tooling for:
a. JSON-LD
b. SPARQL
c. RDF
2. Providing catalogers with proof of concept
tools to begin experimenting with their own
data
3. Provide a method to integrate semantic
concepts into legacy data
4. Provide a toolset that MarcEdit can use to
build new tools.
Let’s take a closer look at two
Link Identifiers Tool
◦ This tool embeds URIs into MARC data
◦ Is rules driven (i.e., not MARC21 centric)
◦ Supports ~24 different in-use data sources
Validate Headings Tool
◦ First tool in MarcEdit to make use of the tools linked data platform and available data services to
provide a real-world application.
Link Identifiers Tool
Link Identifiers Tool
Initially released in Aug. 2014[1] as a proof of concept for testing the linked data framework
being developed in MarcEdit
◦ Initially only processed LCSH and NAF
Currently, I’ve profiled ~24 data sources, and the tool can be integrated in MarcEdit’s Task
Workflow.
◦ Translation profiles are currently in flux, as I work with a PCC group developing recommendations for
embedding URIs in MARC records.
◦ Working on a process that would allow users to self-profile identifier services, so long as they supported
JSON-LD or SPARQL.
[1] MarcEdit’s Research Toolkit: MARCNext: http://blog.reeset.net/archives/1359
Link Identifiers Tool
Tool has evolved over the last year to utilize a rules based configuration (example):
<field type="bibliographic">
<tag>630</tag>
<ind2 value="0" vocab="naf_lcsh" />
<ind2 value="1" vocab="lcshac" />
<ind2 value="2" vocab="mesh" />
<subfields>adfkqnp</subfields>
<uri>0</uri>
<special_instructions>mixed</special_instructions>
</field>
<field type="authority|bibliographic">
<tag>336</tag>
<subfields>a</subfields>
<index>2</index>
<uri>0</uri>
</field>
Linked Identifiers: Turning strings
=336 $atext$btxt$2rdacontent
=337 $aunmediated$bn$2rdamedia
=338 $avolume$bnc$2rdacarrier
=600 10$6880-06$aHu, Zongnan,$d1896-1962$vDiaries.
=650 0$aGenerals$zChina$vBiography.
=650 0$aGenerals$zTaiwan$vBiography.
=600 17$aHu, Zongnan,$d1896-1962.$2fast$0(OCoLC)fst00131171
=650 7$aGenerals.$2fast$0(OCoLC)fst00939841
=651 7$aChina.$2fast$0(OCoLC)fst01206073
=651 7$aTaiwan.$2fast$0(OCoLC)fst01207854
=655 7$aDiaries.$2lcgft
=655 7$aAutobiographies.$2lcgft
Linked Identifiers: into strings+
=336 $atext$btxt$2rdacontent$0http://id.loc.gov/vocabulary/contentTypes/txt
=337 $aunmediated$bn$2rdamedia$0http://id.loc.gov/vocabulary/mediaTypes/n
=338 $avolume$bnc$2rdacarrier$0http://id.loc.gov/vocabulary/carriers/nc
=600 10$6880-06$aHu, Zongnan,$d1896-1962$vDiaries.$0http://id.loc.gov/authorities/names/n84029846
=650 0$aGenerals$zChina$vBiography.$0http://id.loc.gov/authorities/subjects/sh2008105087
=650 0$aGenerals$zTaiwan$vBiography.$0http://id.loc.gov/authorities/subjects/sh2008105117
=600 17$aHu, Zongnan,$d1896-1962.$2fast$0http://id.worldcat.org/fast/00131171
=650 7$aGenerals.$2fast$0http://id.worldcat.org/fast/00939841
=651 7$aChina.$2fast$0http://id.worldcat.org/fast/01206073
=651 7$aTaiwan.$2fast$0http://id.worldcat.org/fast/01207854
=655 7$aDiaries.$2lcgft$0http://id.loc.gov/authorities/genreForms/gf2014026085
=655 7$aAutobiographies.$2lcgft$0http://id.loc.gov/authorities/genreForms/gf2014026047
Example
Linked Data tools
Things that are still hard:
◦ Most identifier services use their own rules for data escaping – and they aren’t documented
◦ Many services are still not well suited for this work
◦ Anything that doesn’t provide an option to do an exact lookup like ULAN, AAT, or VIAF – all these require additional
processing to ensure that results match the queried term.
◦ Many services are little “p” production in that lots of look-ups can (and do) cause problems.
Validate Headings
Automated authority control processing
◦ Utilizes id.loc.gov
◦ Provides reports of data that isn’t currently “authorized”
◦ Provides options for generating brief authorities
◦ Extracts for further data processing
◦ Ability to embed URIs during validation
◦ If URIs are present – they are used rather than a direct look up
◦ Automatic heading correction when variants are encountered
Validate Headings
Validate Headings can be run from inside the
MarcEditor, or outside as a stand alone tool
Example
Continued work…
Would like to continue to add additional vocabularies
Expand headings validation to more than just LCSH/NAF
Include Linking Profiles for UNIMARC
Using Linked Data sources for sameas subject generation
Questions
Contact Information:
Terry Reese
Email: reese.2179@osu.edu or reeset@gmail.com
MarcEdit Website: http://marcedit.reeset.net
Help: http://marcedit.reeset.net/help

Contenu connexe

Tendances

Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Terry Reese
 
Ms access 1
Ms access 1Ms access 1
Ms access 1aliamla
 
Enhancing a library OPAC with linked data
Enhancing a library OPAC with linked dataEnhancing a library OPAC with linked data
Enhancing a library OPAC with linked dataMichael Cummings
 
Large Scale Data Clean-ups & Challenges for the Library
Large Scale Data Clean-ups & Challenges for the Library Large Scale Data Clean-ups & Challenges for the Library
Large Scale Data Clean-ups & Challenges for the Library Ksenija Mincic Obradovic
 
Emerging technologies in academic libraries
Emerging technologies in academic librariesEmerging technologies in academic libraries
Emerging technologies in academic librariesMichael Cummings
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Anja Jentzsch
 
Semantic web
Semantic webSemantic web
Semantic webKarthik
 
Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Daniele Bailo
 
Participation reports webinar May 2020
Participation reports webinar May 2020Participation reports webinar May 2020
Participation reports webinar May 2020Crossref
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the webChiara Del Vescovo
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020Crossref
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? Debra Shapiro
 
Linked data as a library data platform
Linked data as a library data platformLinked data as a library data platform
Linked data as a library data platformJindřich Mynarz
 

Tendances (20)

Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History
 
Ms access 1
Ms access 1Ms access 1
Ms access 1
 
Enhancing a library OPAC with linked data
Enhancing a library OPAC with linked dataEnhancing a library OPAC with linked data
Enhancing a library OPAC with linked data
 
Ms access
Ms access Ms access
Ms access
 
Large Scale Data Clean-ups & Challenges for the Library
Large Scale Data Clean-ups & Challenges for the Library Large Scale Data Clean-ups & Challenges for the Library
Large Scale Data Clean-ups & Challenges for the Library
 
Emerging technologies in academic libraries
Emerging technologies in academic librariesEmerging technologies in academic libraries
Emerging technologies in academic libraries
 
Linked Data
Linked DataLinked Data
Linked Data
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)
 
Semantic web
Semantic webSemantic web
Semantic web
 
Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2Metadata & brokering - a modern approach #2
Metadata & brokering - a modern approach #2
 
Participation reports webinar May 2020
Participation reports webinar May 2020Participation reports webinar May 2020
Participation reports webinar May 2020
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020
 
Linked data for Libraries
Linked data for LibrariesLinked data for Libraries
Linked data for Libraries
 
Linked library data
Linked library dataLinked library data
Linked library data
 
Session x(ado.net)
Session x(ado.net)Session x(ado.net)
Session x(ado.net)
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
 
Linking library data
Linking library dataLinking library data
Linking library data
 
Wiggins-7-jun15
Wiggins-7-jun15Wiggins-7-jun15
Wiggins-7-jun15
 
Linked data as a library data platform
Linked data as a library data platformLinked data as a library data platform
Linked data as a library data platform
 

Similaire à #mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit

Semantic Web, an introduction for bioscientists
Semantic Web, an introduction for bioscientistsSemantic Web, an introduction for bioscientists
Semantic Web, an introduction for bioscientistsEmanuele Della Valle
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Bits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI ExperienceBits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI ExperienceEvergreen ILS
 
Notespane - A community based learning system
Notespane - A community based learning systemNotespane - A community based learning system
Notespane - A community based learning systemIRJET Journal
 
Digital Library Applications Of Social Networking Jeju Intl Conference
Digital Library Applications Of Social Networking Jeju Intl ConferenceDigital Library Applications Of Social Networking Jeju Intl Conference
Digital Library Applications Of Social Networking Jeju Intl Conferenceguestbba8ac
 
Linked lists
Linked listsLinked lists
Linked listscaffed
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic WikiJie Bao
 
Moving from Web 1.0 to Web 2.0
Moving from Web 1.0 to Web 2.0Moving from Web 1.0 to Web 2.0
Moving from Web 1.0 to Web 2.0Estelle Weyl
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013François Belleau
 
OrientDB for real & Web App development
OrientDB for real & Web App developmentOrientDB for real & Web App development
OrientDB for real & Web App developmentLuca Garulli
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaThomas Kurz
 
Daniel Egan Msdn Tech Days Oc
Daniel Egan Msdn Tech Days OcDaniel Egan Msdn Tech Days Oc
Daniel Egan Msdn Tech Days OcDaniel Egan
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategySemantic Web Company
 
What's new with Apache Spark?
What's new with Apache Spark?What's new with Apache Spark?
What's new with Apache Spark?Paco Nathan
 
Linked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaLinked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaThomas Kurz
 

Similaire à #mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit (20)

Semantic Web, an introduction for bioscientists
Semantic Web, an introduction for bioscientistsSemantic Web, an introduction for bioscientists
Semantic Web, an introduction for bioscientists
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Wedi
WediWedi
Wedi
 
Bits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI ExperienceBits and Pieces from the UPEI Experience
Bits and Pieces from the UPEI Experience
 
Notespane - A community based learning system
Notespane - A community based learning systemNotespane - A community based learning system
Notespane - A community based learning system
 
Digital Library Applications Of Social Networking Jeju Intl Conference
Digital Library Applications Of Social Networking Jeju Intl ConferenceDigital Library Applications Of Social Networking Jeju Intl Conference
Digital Library Applications Of Social Networking Jeju Intl Conference
 
Digital Library Applications Of Social Networking
Digital Library Applications Of Social Networking  Digital Library Applications Of Social Networking
Digital Library Applications Of Social Networking
 
Linked lists
Linked listsLinked lists
Linked lists
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic Wiki
 
Moving from Web 1.0 to Web 2.0
Moving from Web 1.0 to Web 2.0Moving from Web 1.0 to Web 2.0
Moving from Web 1.0 to Web 2.0
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
OrientDB for real & Web App development
OrientDB for real & Web App developmentOrientDB for real & Web App development
OrientDB for real & Web App development
 
Final Presentation
Final PresentationFinal Presentation
Final Presentation
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
 
Daniel Egan Msdn Tech Days Oc
Daniel Egan Msdn Tech Days OcDaniel Egan Msdn Tech Days Oc
Daniel Egan Msdn Tech Days Oc
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data Strategy
 
What's new with Apache Spark?
What's new with Apache Spark?What's new with Apache Spark?
What's new with Apache Spark?
 
Linked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaLinked Media Management with Apache Marmotta
Linked Media Management with Apache Marmotta
 
URL Design
URL DesignURL Design
URL Design
 

Plus de Terry Reese

MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...Terry Reese
 
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...Terry Reese
 
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A PrimerMarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A PrimerTerry Reese
 
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEditMarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEditTerry Reese
 
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit MacMarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit MacTerry Reese
 
Slides from the NASIG 2018 Preconference
Slides from the NASIG 2018 PreconferenceSlides from the NASIG 2018 Preconference
Slides from the NASIG 2018 PreconferenceTerry Reese
 
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...Terry Reese
 
Rejoining the Information access landscape
Rejoining the Information access landscapeRejoining the Information access landscape
Rejoining the Information access landscapeTerry Reese
 
Open metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscapeOpen metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscapeTerry Reese
 
Getting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEditGetting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEditTerry Reese
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowTerry Reese
 
The world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questionsThe world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questionsTerry Reese
 
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...Terry Reese
 
Making RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditMaking RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditTerry Reese
 
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...Terry Reese
 
MarcEdit and OCLC Integration -- Summer ALA 2014
MarcEdit and OCLC Integration -- Summer ALA 2014MarcEdit and OCLC Integration -- Summer ALA 2014
MarcEdit and OCLC Integration -- Summer ALA 2014Terry Reese
 
Make MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services RetreatMake MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services RetreatTerry Reese
 
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata API
Automated Interactions With WorldCat:  A Look at OCLC’s WorldCat Metadata APIAutomated Interactions With WorldCat:  A Look at OCLC’s WorldCat Metadata API
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata APITerry Reese
 
Rethinking shared metadata at the platform level
Rethinking shared metadata at the platform levelRethinking shared metadata at the platform level
Rethinking shared metadata at the platform levelTerry Reese
 
Dragging old data forward: finding yourself an RDA Helper
Dragging old data forward:  finding yourself an RDA HelperDragging old data forward:  finding yourself an RDA Helper
Dragging old data forward: finding yourself an RDA HelperTerry Reese
 

Plus de Terry Reese (20)

MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
 
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
 
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A PrimerMarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
 
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEditMarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
 
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit MacMarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
 
Slides from the NASIG 2018 Preconference
Slides from the NASIG 2018 PreconferenceSlides from the NASIG 2018 Preconference
Slides from the NASIG 2018 Preconference
 
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
 
Rejoining the Information access landscape
Rejoining the Information access landscapeRejoining the Information access landscape
Rejoining the Information access landscape
 
Open metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscapeOpen metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscape
 
Getting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEditGetting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEdit
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage Workflow
 
The world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questionsThe world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questions
 
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
 
Making RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditMaking RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEdit
 
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...
Open Repositories 2014 Poster -- Managing Change: An Organizational Outline f...
 
MarcEdit and OCLC Integration -- Summer ALA 2014
MarcEdit and OCLC Integration -- Summer ALA 2014MarcEdit and OCLC Integration -- Summer ALA 2014
MarcEdit and OCLC Integration -- Summer ALA 2014
 
Make MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services RetreatMake MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services Retreat
 
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata API
Automated Interactions With WorldCat:  A Look at OCLC’s WorldCat Metadata APIAutomated Interactions With WorldCat:  A Look at OCLC’s WorldCat Metadata API
Automated Interactions With WorldCat: A Look at OCLC’s WorldCat Metadata API
 
Rethinking shared metadata at the platform level
Rethinking shared metadata at the platform levelRethinking shared metadata at the platform level
Rethinking shared metadata at the platform level
 
Dragging old data forward: finding yourself an RDA Helper
Dragging old data forward:  finding yourself an RDA HelperDragging old data forward:  finding yourself an RDA Helper
Dragging old data forward: finding yourself an RDA Helper
 

Dernier

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Dernier (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 

#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit

  • 2. Little History MarcEdit Development started around 1999ish (as parts) ◦ Originally coded in 3 programming languages: Assembler (libraries), Visual Basic (UI) and Delphi (COM). ◦ I started writing it as an undergraduate to better understand MARC & circumvent OCLC’s Passport for Windows program ◦ First “MarcEdit” was released Sept. 11, 2000 (thank you WayBack Machine: http://web.archive.org/web/20001017105529/http://ucs.orst.edu/~reeset/marcedit/indexb.html) Today: ◦ Written in C# (Windows/Linux) & Object-C/C# (OSX) ◦ Active user community is ~20,000ish (based on update logs) ◦ Used in ~190ish countries/political regions ◦ Roughly 1/3 of the users reside outside of Canada/United States* * Based on loose analysis of server logs by my server-side stats software
  • 3. MarcEdit Evolution MarcEdit 1.0-2.0 Main Window MarcEdit MARC Tools 1.0-2.0 MarcEdit 1.0-2.0 MarcEditor
  • 4. MarcEdit Evolution Early application was developed to (again, thank you Internet Archive): 1. Be user-friendly (whether I’ve accomplished that is debatable – I’m not a UI designer) 2. Support LC’s MARCBreakr/Maker diacritics (largely yes) 3. Be fast (which I think that it is) 4. Simplify editing records in batch 5. Provide a set of programming tools to solve my own local needs
  • 6. Three development rules I follow MarcEdit is a real-world metadata tool ◦ Tool is designed to provide workflows for data problems currently facing libraries right now MarcEdit is MARC Agnostic ◦ Too many metadata tools are anglo-centric; MarcEdit has been designed to work within the very heterogeneous metadata environment that we find ourselves today, which includes: ◦ Support for MARC (not a particular flavor*) ◦ Near universal characterset support (because the world is bigger than MARC8 and UTF8) ◦ Supports a wide range of Library metadata standards beyond MARC MarcEdit is one part of the larger library metadata tooling environment ◦ So integrations with OCLC, ILSs (when possible), OpenRefine are important * And if something assumes MARC21 – call it out
  • 7. So how does any of this relate to semantic data in Libraries? http://musictheorysite.com/img/dwight_question.jpg
  • 8. A lot of metadata people I talk to fall into two camps
  • 9. BibFrame and Linked Data as RDA 2.0 BibFrame http://www.wired.com/wp-content/uploads/archive/news/images/full/duke_nukem_frever_f.16807.jpg http://astronomy.nmsu.edu/cwc/Group/magiicat/images/magiicat-logo.gif Linked Data
  • 10. BibFrame and linked data as datacorns https://whatsthebigdata.files.wordpress.com/2015/10/datascience_unicorn.png?w=640
  • 11. I prefer a more practical outlook… https://www.etsy.com/search?q=unicorn+cat+hat
  • 12. MarcEdit’s MARCNext MarcEdit’s MARCNext is a first attempt to start having this discussion by: 1. Integrating a linked data framework into MarcEdit, including tooling for: a. JSON-LD b. SPARQL c. RDF 2. Providing catalogers with proof of concept tools to begin experimenting with their own data 3. Provide a method to integrate semantic concepts into legacy data 4. Provide a toolset that MarcEdit can use to build new tools.
  • 13. Let’s take a closer look at two Link Identifiers Tool ◦ This tool embeds URIs into MARC data ◦ Is rules driven (i.e., not MARC21 centric) ◦ Supports ~24 different in-use data sources Validate Headings Tool ◦ First tool in MarcEdit to make use of the tools linked data platform and available data services to provide a real-world application.
  • 15. Link Identifiers Tool Initially released in Aug. 2014[1] as a proof of concept for testing the linked data framework being developed in MarcEdit ◦ Initially only processed LCSH and NAF Currently, I’ve profiled ~24 data sources, and the tool can be integrated in MarcEdit’s Task Workflow. ◦ Translation profiles are currently in flux, as I work with a PCC group developing recommendations for embedding URIs in MARC records. ◦ Working on a process that would allow users to self-profile identifier services, so long as they supported JSON-LD or SPARQL. [1] MarcEdit’s Research Toolkit: MARCNext: http://blog.reeset.net/archives/1359
  • 16. Link Identifiers Tool Tool has evolved over the last year to utilize a rules based configuration (example): <field type="bibliographic"> <tag>630</tag> <ind2 value="0" vocab="naf_lcsh" /> <ind2 value="1" vocab="lcshac" /> <ind2 value="2" vocab="mesh" /> <subfields>adfkqnp</subfields> <uri>0</uri> <special_instructions>mixed</special_instructions> </field> <field type="authority|bibliographic"> <tag>336</tag> <subfields>a</subfields> <index>2</index> <uri>0</uri> </field>
  • 17. Linked Identifiers: Turning strings =336 $atext$btxt$2rdacontent =337 $aunmediated$bn$2rdamedia =338 $avolume$bnc$2rdacarrier =600 10$6880-06$aHu, Zongnan,$d1896-1962$vDiaries. =650 0$aGenerals$zChina$vBiography. =650 0$aGenerals$zTaiwan$vBiography. =600 17$aHu, Zongnan,$d1896-1962.$2fast$0(OCoLC)fst00131171 =650 7$aGenerals.$2fast$0(OCoLC)fst00939841 =651 7$aChina.$2fast$0(OCoLC)fst01206073 =651 7$aTaiwan.$2fast$0(OCoLC)fst01207854 =655 7$aDiaries.$2lcgft =655 7$aAutobiographies.$2lcgft
  • 18. Linked Identifiers: into strings+ =336 $atext$btxt$2rdacontent$0http://id.loc.gov/vocabulary/contentTypes/txt =337 $aunmediated$bn$2rdamedia$0http://id.loc.gov/vocabulary/mediaTypes/n =338 $avolume$bnc$2rdacarrier$0http://id.loc.gov/vocabulary/carriers/nc =600 10$6880-06$aHu, Zongnan,$d1896-1962$vDiaries.$0http://id.loc.gov/authorities/names/n84029846 =650 0$aGenerals$zChina$vBiography.$0http://id.loc.gov/authorities/subjects/sh2008105087 =650 0$aGenerals$zTaiwan$vBiography.$0http://id.loc.gov/authorities/subjects/sh2008105117 =600 17$aHu, Zongnan,$d1896-1962.$2fast$0http://id.worldcat.org/fast/00131171 =650 7$aGenerals.$2fast$0http://id.worldcat.org/fast/00939841 =651 7$aChina.$2fast$0http://id.worldcat.org/fast/01206073 =651 7$aTaiwan.$2fast$0http://id.worldcat.org/fast/01207854 =655 7$aDiaries.$2lcgft$0http://id.loc.gov/authorities/genreForms/gf2014026085 =655 7$aAutobiographies.$2lcgft$0http://id.loc.gov/authorities/genreForms/gf2014026047
  • 20. Linked Data tools Things that are still hard: ◦ Most identifier services use their own rules for data escaping – and they aren’t documented ◦ Many services are still not well suited for this work ◦ Anything that doesn’t provide an option to do an exact lookup like ULAN, AAT, or VIAF – all these require additional processing to ensure that results match the queried term. ◦ Many services are little “p” production in that lots of look-ups can (and do) cause problems.
  • 21. Validate Headings Automated authority control processing ◦ Utilizes id.loc.gov ◦ Provides reports of data that isn’t currently “authorized” ◦ Provides options for generating brief authorities ◦ Extracts for further data processing ◦ Ability to embed URIs during validation ◦ If URIs are present – they are used rather than a direct look up ◦ Automatic heading correction when variants are encountered
  • 22. Validate Headings Validate Headings can be run from inside the MarcEditor, or outside as a stand alone tool
  • 24. Continued work… Would like to continue to add additional vocabularies Expand headings validation to more than just LCSH/NAF Include Linking Profiles for UNIMARC Using Linked Data sources for sameas subject generation
  • 25. Questions Contact Information: Terry Reese Email: reese.2179@osu.edu or reeset@gmail.com MarcEdit Website: http://marcedit.reeset.net Help: http://marcedit.reeset.net/help