Museum Data Exchange

•Télécharger en tant que PPT, PDF•

5 j'aime•1,145 vues

A presentation focusing on the data analysis OCLC Research performed on 900K museum records, plus next steps for the nine project museums who now have the capacity to share standards-based records.

Formation Technologie

Museum Data Exchange: Making hay with harvestable data G ünter Waibel Program Officer OCLC Research 12 November 2009 MCN Portland

Museum Data Exchange ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Create Tools Extract CDWA Lite XML Publish via OAI-PMH Harvest Data Test tools Create Research Aggregation Analyze Data Standards compliance? Interoperability? What now?

Open Archives Initiative (OAI) Protocol for Metadata Harvesting Museums OCLC Research TMS COBOAT OAICatMuseum Research Aggregation Museums Basic Interface

Extracting data and harvesting Data extraction 6 out of 9 museums used COBOAT 3 used alternative means: XSLT, Custom, Vendor support (MIMSY XG) Data harvest 6 out of 9 museums used OAI-PMH add’l 2 wanted to use OAI-PMH 3 used alternative means: SQL dumps, FTP Data validation 2 out of 9 harvests completely validated (Mostly) minor errors for the rest

Research Aggregation 9 Museums 887,572 Records

Analyzing the Research Aggregation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Available at: http://hangingtogether.org/wp-content/uploads/2009/04/methodology_03-30-2009.pdf

Use of CDWA Lite 131 CDWA Lite units of information Museums 54% not used 8% used by all

Conformance to CDWA Lite Required/Highly recommended elements Records 90% consistency for 7 of 17 elements

Impact of COBOAT mapping ,[object Object],[object Object],[object Object],[object Object]

Controlled Vocabularies match rates < 42%

Data Values correlated to records Top 100 <objectWorkType> All the Institution’s 100K+ Records AAT Match 27 No AAT Match 73 73% 99%

Data Values correlated to records Top 100 <nameCreator> All the Institution’s 200K+ records ULAN Match 49 No ULAN Match 51 25% 50%

What now? ,[object Object],[object Object],“ Universal access to cultural heritage will likely soon become a reality, but museums may be losing their role as key players.” - Nicholas Crofts 2008

Thank you! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Contenu connexe

Tendances

2015 NISO Forum: The Future of Library Resource Discovery

National Information Standards Organization (NISO)

2015 NISO Forum: The Future of Library Resource Discovery

National Information Standards Organization (NISO)

Connecting the Dots: Linking Digitized Collections Across Metadata Silos

OCLC

This presentation discusses how a model of “data sharing as publishing” can contribute to developing Linked Open Data resources in archaeology and the study of the ancient world. The paper gives examples from Open Context’s developing approach to data editing, documentation and quality improvement processes. The goal of these efforts is to better align the professional interests of individual researchers with the needs of the larger community to access and use high-quality data in Linked Data scenarios.

Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation

ekansa

Describing Theses and Dissertations Using Schema.org

OCLC

Plenary Lecture Presented at INCF Neuroinformatics 2019 https://www.neuroinformatics2019.org Title: FAIRy stories: tales from building the FAIR Research Commons Findable Accessable Interoperable Reusable. The “FAIR Principles” for research data, software, computational workflows, scripts, or any kind of Research Object is a mantra; a method; a meme; a myth; a mystery. For the past 15 years I have been working on FAIR in a range of projects and initiatives in the Life Sciences as we try to build the FAIR Research Commons. Some are top-down like the European Research Infrastructures ELIXIR, ISBE and IBISBA, and the NIH Data Commons. Some are bottom-up, supporting FAIR for investigator-led projects (FAIRDOM), biodiversity analytics (BioVel), and FAIR drug discovery (Open PHACTS, FAIRplus). Some have become movements, like Bioschemas, the Common Workflow Language and Research Objects. Others focus on cross-cutting approaches in reproducibility, computational workflows, metadata representation and scholarly sharing & publication. In this talk I will relate a series of FAIRy tales. Some of them are Grimm. There are villains and heroes. Some have happy endings; all have morals.

FAIRy stories: tales from building the FAIR Research Commons

Carole Goble

Libraries, collections, technology: presented at Pennylvania State University...

lisld

DataCite: the Perfect Complement to CrossRef

Crossref

2015 NISO Forum: The Future of Library Resource Discovery

National Information Standards Organization (NISO)

Towards collaboration at scale: Libraries, the social and the technical

lisld

Presented at the OCLC Research Library Partnership meeting by Senior Program Officer, Karen Smith-Yoshimura and hosted by the University of Sydney in Sydney, NSW Australia, 17 February 2017. This meeting provided an opportunity for Research Library Partners to touch base with each other on issues of common concern and explore possible areas of future engagement with the OCLC Research Library Partnership and OCLC Research.

The OCLC Research Library Partnership

OCLC

2015 NISO Forum: The Future of Library Resource Discovery

National Information Standards Organization (NISO)

2015 NISO Forum: The Future of Library Resource Discovery

National Information Standards Organization (NISO)

Cambridge university library ess update for ucs

Edmund Chamberlain

Rachel Frick, OCLC Executive Director of the OCLC Research Library Partnership, reviews some of the broad agenda items and recent publications related to the work of OCLC Research. Rachel is then joined for two presentations on specific research topics. First, Sharon Streams (OCLC Director of WebJunction) and Monika Sengul-Jones (OCLC Wikipedian-in-Residence) present on “Public Libraries and Wikipedia.” Next, Kenning Arlitsch (Dean, Montana State University Library) and Jeff Mixter (OCLC Senior Software Engineer) share their findings on “Accurate Institutional Repository Download Measurement using RAMP, the Repository Analytics and Metrics Portal.”

OCLC Research Update at ALA Chicago. June 26, 2017.

OCLC

We often think of collections as local – whether owned or licensed. Increasingly this picture is changing in several ways. Libraries are sharing responsibility for collections. Libraries are providing access to materials they do not own, but which are available to their users (freely available digital book collections for example). Demand driven acquisitions changes the view of local collections. Institutions are also thinking about how to manage locally produced materials (research data for example) and support access across institutions. This trend is supported by changes as discovery is peeled away from local collections. This presentation discusses these trends, and collections and discovery change in a network environment. This was a presentation at the Libraries Australia Forum, Melbourne, 2015

The facilitated collection: collections and collecting in a network environment

lisld

The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data

OCLC

Redefining the Academic Library

Ted Lin (林泰宏)

**Important note - notes visible in downloaded presentation. ** An overview of research library collection trends. Presented in the context of changing demands of research and learning in a network environment. Behaviors shape technology; technology shapes behaviors. There is also some analysis of the RLUK collective collections study and a quick look at some characertistics of The Bodleian Libraries' collections.

Looking at Libraries, collections & technology

lisld

Librarians, researchers, and the general public have largely embraced the concept of open access (OA). Yet, incorporating OA resources into existing discovery and tracking systems is often a complicated process. Open access material can be delivered through a variety of publishing or archival mechanisms, creating certain challenges, particularly for those managing e-resources. Although an increasing proportion of research output is becoming open access each year, organization and discovery of these resources remains imperfect. The debate between the relative merits of Green and Gold OA is regularly discussed in academic circles but less attention is devoted towards Hybrid OA and the challenges inherent in this model. Most major publishers offer open access through one or more of these models, but open access metadata standards seem to be lacking among these content providers. The presenters will discuss some of these challenges identified in the literature and through other mechanisms, including data gathered by NISO and an original survey. By identifying these issues, the scholarly communication community can work together to improve discovery for end users. Chris Bulock Electronic Resources Librarian, SIUE Lovejoy Library Chris is an Electronic Resources Librarian and NASIG member from the St. Louis area. His research and work are focused on improving the library user's experience. Chris is the recipient of the 2012 HARRASSOWITZ Charleston Conference Scholarship. Nathan Hosburgh Discovery & Systems Librarian, Rollins College Nate Hosburgh is currently the Discovery & Systems Librarian at Rollins College in Winter Park, Florida as part of a revamped Collections & Systems department that includes ILL, collection development, acquisitions, systems, and technical services. Previously, he held positions managing e-resources at Montana State University and managing interlibrary loan & document delivery at Florida Institute of Technology in Melbourne

OA in the Library Collection: The Challenge of Identifying and Managing Open ...

NASIG

Tendances (20)

2015 NISO Forum: The Future of Library Resource Discovery

Connecting the Dots: Linking Digitized Collections Across Metadata Silos

Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation

Describing Theses and Dissertations Using Schema.org

FAIRy stories: tales from building the FAIR Research Commons

Libraries, collections, technology: presented at Pennylvania State University...

DataCite: the Perfect Complement to CrossRef

2015 NISO Forum: The Future of Library Resource Discovery

Towards collaboration at scale: Libraries, the social and the technical

The OCLC Research Library Partnership

2015 NISO Forum: The Future of Library Resource Discovery

Cambridge university library ess update for ucs

OCLC Research Update at ALA Chicago. June 26, 2017.

The facilitated collection: collections and collecting in a network environment

The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data

Redefining the Academic Library

Looking at Libraries, collections & technology

OA in the Library Collection: The Challenge of Identifying and Managing Open ...

Similaire à Museum Data Exchange

Over the past 5 years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs and so forth. Don’t stop reading. Data management isn’t likely to win anyone a Nobel prize. But publications should be supported and accompanied by data, methods, procedures, etc. to assure reproducibility of results. Funding agencies expect data (and increasingly software) management retention and access plans as part of the proposal process for projects to be funded. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. The multi-component, multi-disciplinary nature of Systems Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation. The FAIR Guiding Principles for scientific data management and stewardship (http://www.nature.com/articles/sdata201618) has been an effective rallying-cry for EU and USA Research Infrastructures. FAIRDOM (Findable, Accessible, Interoperable, Reusable Data, Operations and Models) Initiative has 8 years of experience of asset sharing and data infrastructure ranging across European programmes (SysMO and EraSysAPP ERANets), national initiatives (de.NBI, German Virtual Liver Network, UK SynBio centres) and PI's labs. It aims to support Systems and Synthetic Biology researchers with data and model management, with an emphasis on standards smuggled in by stealth and sensitivity to asset sharing and credit anxiety. This talk will use the FAIRDOM Initiative to discuss the FAIR management of data, SOPs, and models for Sys Bio, highlighting the challenges of and approaches to sharing, credit, citation and asset infrastructures in practice. I'll also highlight recent experiments in affecting sharing using behavioural interventions. http://www.fair-dom.org http://www.fairdomhub.org http://www.seek4science.org Presented at COMBINE 2016, Newcastle, 19 September. http://co.mbine.org/events/COMBINE_2016

FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...

Carole Goble

Lecture 1: Being FAIR: FAIR data and model management In recent years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship [1] have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. The multi-component, multi-disciplinary nature of Systems and Synthetic Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation. Our FAIRDOM project (http://www.fair-dom.org) supports Systems Biology research projects with their research data, methods and model management, with an emphasis on standards smuggled in by stealth and sensitivity to asset sharing and credit anxiety. The FAIRDOM Platform has been installed by over 30 labs or projects. Our public, centrally hosted Asset Commons, the FAIRDOMHub.org, supports the outcomes of 50+ projects. Now established as a grassroots association, FAIRDOM has over 8 years of experience of practical asset sharing and data infrastructure at the researcher coal-face ranging across European programmes (SysMO and ERASysAPP ERANets), national initiatives (Germany's de.NBI and Systems Medicine of the Liver; Norway's Digital Life) and European Research Infrastructures (ISBE) as well as in PI's labs and Centres such as the SynBioChem Centre at Manchester. In this talk I will show explore how FAIRDOM has been designed to support Systems Biology projects and show examples of its configuration and use. I will also explore the technical and social challenges we face. I will also refer to European efforts to support public archives for the life sciences. ELIXIR (http:// http://www.elixir-europe.org/) the European Research Infrastructure of 21 national nodes and a hub funded by national agreements to coordinate and sustain key data repositories and archives for the Life Science community, improve access to them and related tools, support training and create a platform for dataset interoperability. As the Head of the ELIXIR-UK Node and co-lead of the ELIXIR Interoperability Platform I will show how this work relates to your projects. [1] Wilkinson et al, The FAIR Guiding Principles for scientific data management and stewardship Scientific Data 3, doi:10.1038/sdata.2016.18

Being FAIR: FAIR data and model management SSBSS 2017 Summer School

Carole Goble

Descriptive Standards and Applications in Memory Institutions

E. Murphy

Integration of oreChem with the eCrystals repository for crystal structures

Mark Borkum

The eCrystals Federation

ManjulaPatel

The Case for Graphs in Supply Chains

Neo4j

Sharing massive data analysis: from provenance to linked experiment reports

Gaignard Alban

Research Object Community Update

Carole Goble

Active Data : Managing Data-Life Cycle on Heterogeneous Systems and Infrastructures The Big Data challenge consists in managing, storing, analyzing and visualizing these huge and ever growing data sets to extract sense and knowledge. As the volume of data grows exponentially, the management of these data becomes more complex in proportion. A key point is to handle the complexity of the 'Data Life Cycle', i.e. the various operations performed on data: transfer, archiving, replication, deletion, etc. Indeed, data-intensive applications span over a large variety of devices and e-infrastructures which implies that many systems are involved in data management and processing. ''Active Data'' is new approach to automate and improve the expressiveness of data management applications. It consists of * a 'formal model' for Data Life Cycle, based on Petri Net, that allows to describe and expose data life cycle across heterogeneous systems and infrastructures. * a 'programming model' allows code execution at each stage of the data life cycle: routines provided by programmers are executed when a set of events (creation, replication, transfer, deletion) happen to any data.

Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...

Gilles Fedak

Approximation and Self-Organisation on the Web of Data

Kathrin Dentler

Donatella Castelli presents the survey results: "Towards a common EOSC catalogue" | OSFair2017 Workshop Workshop title: How FAIR friendly is your data catalogue? Workshop overview: This workshop will build upon the work planned by the EOSCpilot data interoperability task and the BlueBridge workshop held on April 3 at the RDA meeting. We will investigate common mechanisms for interoperation of data catalogues that preserve established community standards, norms and resources, while simplifying the process of being/becoming FAIR. Can we have a simple interoperability architecture based on a common set of metadata types? What are the minimum metadata requirements to expose FAIR data to EOSC services and EOSC users? DAY 3 - PARALLEL SESSION 6 & 7

OSFair2017 Workshop | Survey results towards a common EOSC catalogue

Open Science Fair

Organizing the Data Chaos of Scientists

Andreas Schreiber

Data munging is a crucial task across domains ranging from drug discovery and policy studies to data science. Indeed, it has been reported that data munging accounts for 60% of the time spent in data analysis. Because data munging involves a wide variety of tasks using data from multiple sources, it often becomes difficult to understand how a cleaned dataset was actually produced (i.e. its provenance). In this talk, I discuss our recent work on tracking data provenance within desktop systems, which addresses problems of efficient and fine grained capture. I also describe our work on scalable provence tracking within a triple store/graph database that supports messy web data. Finally, I briefly touch on whether we will move from adhoc data munging approaches to more declarative knowledge representation languages such as Probabilistic Soft Logic. Presented at Information Sciences Institute - August 13, 2015

Provenance for Data Munging Environments

Paul Groth

The Allotrope Foundation is a consortium of major pharmaceutical companies and a partner network whose goal is to address challenges in the pharmaceutical industry by providing a set of public, non-proprietary standards for using and integrating analytical laboratory data. Current challenges in data management within the pharmaceutical industry often center around inconsistent or incomplete data and metadata and proprietary data formats. Because of a lack of standardization, several operations (e.g. integration of instruments/applications, transfer of methods or results, archiving for regulatory purposes) require unnecessary efforts. Further, higher level aggregation of data, e.g. regulatory filings, that are derived from multiple sources of laboratory data are costly to create. These unnecessary costs impact operations within a company’s laboratories, between partnering companies, and between a company and contract research organizations (CROs). Finally, the accelerating transition of laboratories from hybrid (paper + electronic) to purely electronic data streams, coupled with an ever-increasing regulatory scrutiny of electronic data management practices, further require a comprehensive solution. This talk will discuss how The Allotrope Foundation is providing a new framework for data standards through collaboration between numerous stakeholders.

Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...

OSTHUS

I gave a talk today at this year (2013)'s South African Institute for Computer Scientists and Information Technologists ---SAICSIT--- conference, held in East London, South Africa. The pre-print is here [1], and the actual publisher copy is here [2]. I used the same bastardised version [3] of the Torino Beamer theme. [1] http://goo.gl/PmHRVo [2] http://goo.gl/ipQlgw [3] http://blog.barisione.org/2007-09/torino-a-pretty-theme-for-latex-beamer

Flexible Design for Simple Digital Library Tools and Services

Lighton Phiri

NHM Data Portal: first steps toward the Graph-of-Life

Edward Baker

NHM Data Portal: first steps toward the Graph-of-Life

Vince Smith

DataFinder: A Python Application for Scientific Data Management

Andreas Schreiber

Integrating scientific laboratories into the cloud

Data Finder

Benchmarking Commercial RDF Stores with Publications Office Dataset

Ghislain Atemezing

Similaire à Museum Data Exchange (20)

FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...

Being FAIR: FAIR data and model management SSBSS 2017 Summer School

Descriptive Standards and Applications in Memory Institutions

Integration of oreChem with the eCrystals repository for crystal structures

The eCrystals Federation

The Case for Graphs in Supply Chains

Sharing massive data analysis: from provenance to linked experiment reports

Research Object Community Update

Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...

Approximation and Self-Organisation on the Web of Data

OSFair2017 Workshop | Survey results towards a common EOSC catalogue

Organizing the Data Chaos of Scientists

Provenance for Data Munging Environments

Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...

Flexible Design for Simple Digital Library Tools and Services

NHM Data Portal: first steps toward the Graph-of-Life

DataFinder: A Python Application for Scientific Data Management

Integrating scientific laboratories into the cloud

Benchmarking Commercial RDF Stores with Publications Office Dataset

Plus de OCLC Research

ARLIS 2010 RLG Partnership Round Table

OCLC Research

Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...

OCLC Research

OCLC Research @ U of Calgary: New directions for metadata workflows across li...

OCLC Research

Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration

OCLC Research

These slides were presented as part of a webinar to provide RLG Partnership institutions with the opportunity to learn more about the current work taking place in OCLC Research and discover new ways to become more engaged in the RLG Partnership. Topics covered include: Green ILL Practices & Deaccessioning Decision Tree; Cloud Library; In-copyright Print Books; Evaluating Rights & Risk for Unpublished Materials; Special Collections Survey; The Library's Role in Research Assessment; Data Curation; and Social Metadata. A preview of upcoming events, reports and webinars was also included.

RLG Partnership Update Webinar Slides

OCLC Research

Günter Waibel CILIP LAM presentation

OCLC Research

LAM collaboration @ LMLAG

OCLC Research

Collections Assessment Surveys

OCLC Research

Shared Print Update ALA 2009

OCLC Research

RLG Prospective Journals Preservation Project Factsheet

OCLC Research

'Seeding' the Cloud Library--Precipitating Change in Library Infrastructure

OCLC Research

SHARES Update

OCLC Research

Names and Identities

OCLC Research

Archives' User Studies & Archival WorldCat Records

OCLC Research

The Virtual International Authority File

OCLC Research

Beyond Copyright: Risk, Benefit and Charting a Course for Action

OCLC Research

Work Pages in WorldCat

OCLC Research

The Crosswalk Web Service

OCLC Research

Networking Library Services: A Glimpse at the Future--Moving Library Manageme...

OCLC Research

Going Global--Digital Convergence Across National Libraries and the Global Re...

OCLC Research

Plus de OCLC Research (20)

ARLIS 2010 RLG Partnership Round Table

Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...

OCLC Research @ U of Calgary: New directions for metadata workflows across li...

Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration

RLG Partnership Update Webinar Slides

Günter Waibel CILIP LAM presentation

LAM collaboration @ LMLAG

Collections Assessment Surveys

Shared Print Update ALA 2009

RLG Prospective Journals Preservation Project Factsheet

'Seeding' the Cloud Library--Precipitating Change in Library Infrastructure

SHARES Update

Names and Identities

Archives' User Studies & Archival WorldCat Records

The Virtual International Authority File

Beyond Copyright: Risk, Benefit and Charting a Course for Action

Work Pages in WorldCat

The Crosswalk Web Service

Networking Library Services: A Glimpse at the Future--Moving Library Manageme...

Going Global--Digital Convergence Across National Libraries and the Global Re...

Dernier

Key note speaker Neum_Admir Softic_ENG.pdf

Admir Softic

Dyslexia AI Workshop for Slideshare.pptx

callscotland1987

Application orientated numerical on hev.ppt

RamjanShidvankar

Activity 01 - Artificial Culture (1).pdf

ciinovamais

Python Notes for mca i year students osmania university.docx

Ramakrishna Reddy Bijjam

Micro-Scholarship, What it is, How can it help me.pdf

Poh-Sun Goh

How to Give a Domain for a Field in Odoo 17

Celine George

1029-Danh muc Sach Giao Khoa khoi 6.pdf

QucHHunhnh

Spatium Project Simulation student brief

Association for Project Management

Introduction to Nonprofit Accounting: The Basics

TechSoup

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Nirmal Dwivedi

ICT role in 21st century education and it's challenges.

MaryamAhmad92

Google Gemini An AI Revolution in Education.pptx

Dr. Sarita Anand

Salient Features of India constitution especially power and functions

KarakKing

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes

Celine George

Graduate Outcomes Presentation Slides - English

neillewis46

Making communications land - Are they received and understood as intended? webinar Thursday 2 May 2024 A joint webinar created by the APM Enabling Change and APM People Interest Networks, this is the third of our three part series on Making Communications Land. presented by Ian Cribbes, Director, IMC&T Ltd @cribbesheet The link to the write up page and resources of this webinar: https://www.apm.org.uk/news/making-communications-land-are-they-received-and-understood-as-intended-webinar/ Content description: How do we ensure that what we have communicated was received and understood as we intended and how do we course correct if it has not.

Making communications land - Are they received and understood as intended? we...

Association for Project Management

Basic Civil Engineering notes first year Notes Building notes Selection of site for Building Layout of a Building What is Burjis, Mutam Building Bye laws Basic Concept of sunlight ventilation in building National Building Code of India Set back or building line Types of Buildings Floor Space Index (F.S.I) Institutional Vs Educational Building Components & function Sills, Lintels, Cantilever Doors, Windows and Ventilators Types of Foundation AND THEIR USES Plinth Area Shallow and Deep Foundation Super Built-up & carpet area Floor Area Ratio (F.A.R) RCC Reinforced Cement Concrete RCC VS PCC

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Denish Jangid

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

MaritesTamaniVerdade

Unit-IV- Pharma. Marketing Channels.pptx

VishalSingh1417

Dernier (20)

Key note speaker Neum_Admir Softic_ENG.pdf

Dyslexia AI Workshop for Slideshare.pptx

Application orientated numerical on hev.ppt

Activity 01 - Artificial Culture (1).pdf

Python Notes for mca i year students osmania university.docx

Micro-Scholarship, What it is, How can it help me.pdf

How to Give a Domain for a Field in Odoo 17

1029-Danh muc Sach Giao Khoa khoi 6.pdf

Spatium Project Simulation student brief

Introduction to Nonprofit Accounting: The Basics

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

ICT role in 21st century education and it's challenges.

Google Gemini An AI Revolution in Education.pptx

Salient Features of India constitution especially power and functions

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes

Graduate Outcomes Presentation Slides - English

Making communications land - Are they received and understood as intended? we...

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

Unit-IV- Pharma. Marketing Channels.pptx

Museum Data Exchange

1. Museum Data Exchange: Making hay with harvestable data G ünter Waibel Program Officer OCLC Research 12 November 2009 MCN Portland

3. Open Archives Initiative (OAI) Protocol for Metadata Harvesting Museums OCLC Research TMS COBOAT OAICatMuseum Research Aggregation Museums Basic Interface

4. Extracting data and harvesting Data extraction 6 out of 9 museums used COBOAT 3 used alternative means: XSLT, Custom, Vendor support (MIMSY XG) Data harvest 6 out of 9 museums used OAI-PMH add’l 2 wanted to use OAI-PMH 3 used alternative means: SQL dumps, FTP Data validation 2 out of 9 harvests completely validated (Mostly) minor errors for the rest

5. Research Aggregation 9 Museums 887,572 Records

7. Available at: http://hangingtogether.org/wp-content/uploads/2009/04/methodology_03-30-2009.pdf

8. Use of CDWA Lite 131 CDWA Lite units of information Museums 54% not used 8% used by all

9. Conformance to CDWA Lite Required/Highly recommended elements Records 90% consistency for 7 of 17 elements

10.

11. Controlled Vocabularies match rates < 42%

12. Data Values correlated to records Top 100 <objectWorkType> All the Institution’s 100K+ Records AAT Match 27 No AAT Match 73 73% 99%

13. Data Values correlated to records Top 100 <nameCreator> All the Institution’s 200K+ records ULAN Match 49 No ULAN Match 51 25% 50%

14.

15. publishing your collections

16. reaching educators

17. reaching higher education

18. sharing infrastructure

19. projecting authority

20. national collaboration

21.

Notes de l'éditeur

Data extraction 6 out of 9 museums used COBOAT 3 used alternative means: XSLT, Custom, Vendor support Data harvest 6 out of 9 used OAI-PMH add’l 2 wanted to use OAI-PMH 3 used alternative means: SQL dumps, FTP
Overall, a total of 887,572 records were contributed by nine museums. 6 out of 9 museums contributed all accessioned objects in their database at the time of harvest. Of the remaining three, one chose a subset of all materials published on their website, while two made decisions based on the perceived state of cataloging. Among those providing subsets of data, the approximate percentages range from one third to a little over 50% of the data in their collections management system.
This graph shows that few of the available units of information were consistently and widely used, some were used a little, and many were not used at all.
Discounting the outlier <subjectTerm>, the consistency with which these data elements occur is greater than 65% overall. For 7 of 17 data elements, consistency is above 90%.
The high record count with which many individual data values on the Top 100 lists are associated suggests opportunities for adding value to the data by controlling a relatively low number of terms with impact on a relatively high number of records per data set. Consider, for example <objectWorkType> values in records from the Metropolitan Museum of Art: The top 100 <objectWorkType> values represent 99% of <objectWorkType> values in all 112,000 Metropolitan records. The top 100 <objectWorkType> values matching on either a preferred or a non-preferred AAT term represent 27 matches, which is equal to 73% of all Metropolitan records. (8 of these 27 matches contain terms which match on more than one AAT entry.) The top 100 <objectWorkType> values not matching on any AAT term represent 73, which is equal to 26% of all Metropolitan records. In other words, by tending to 73 <objectWorkType> values and disambiguating an additional 8, the Metropolitan could extend <objectWorkType> control from 73% to 99% of all 112,000 Metropolitan Museum records.
A data element like <objectWorkType> would be expected to produce favorable numbers in this type of analysis: by its very nature, <objectWorkType> contains a relatively low number of values which presumably reappear across many records in a collection. For <nameCreator>, a data element which has a relatively high number of values across aggregation records, one would expect a less impressive result from focusing on Top 100 terms. Consider this example from the Harvard Art Museum: The top 100 <nameCreator> values represent 50% of <nameCreator> values in all 236, 000 Harvard records. The top 100 <nameCreator> values matching on either a preferred or a non-preferred ULAN term represent 49 matches, which represent 25% of all Harvard records. (2 of the top 49 matches contain terms which match on more than one ULAN entry). The top 100 <nameCreator> values not matching on any ULAN term represent 51, which represent 26% of all Harvard records. In other words, by tending to 51 <nameCreator> values and disambiguating an additional 2, Harvard could extend <nameCreator> control from 25% to 50% of all 236,000 Harvard records. While the overall percentatges for <nameCreator> are necessarily lower than for <objectWorkType>, doubling the rate of control still constitutes a formidable result.
The willingness of museums to share data more widely is tied to the compelling application for that shared data. When there are applications for sharing data which directly support the museum mission, more data is shared. When more data is shared, more such compelling applications emerge. This chicken-and-egg conundrum provides a challenge to both museum policy makers as well as those wishing to aggregate data.
OMEKA is a free, open source web-based publishing platform for collections created by the Center for History and New Media, George Mason University. For a museum to take advantage of this tool, a core set of data has to migrate from the local collections management system into the OMEKA platform. A museum can use COBOAT and OAICatMuseum to create an OAI data content provider, and OMEKA’s OAI-PMH Harvester plug-in [1] to ingest and update the data. At least one of the MDE project participants supported the test of this plug-in by by providing access to their OAI-PMH installations, and others may follow. [1] Cf. http:// omeka.org/codex/Plugins/OaipmhHarvester
ArtsConnectEd is an interactive Web site that provides access to works of art and educational resources from the Minneapolis Institute of Arts and the Walker Art Center. The Institute and the Walker pool their resources using a CDWA Lite / OAI-PMH infrastructure, and the Institute of Arts has successfully implemented the MDE tools to contribute to this aggregation. At the final face-to-face MDE meeting, both ArtsConnectEd representatives and grant participants speculated about whether the resource could grow to include additional contributors. Robin Dowden (Walker) demonstrated a private research prototype site including the MDE datasets from the National Gallery of Canada and the Harvard Art Museums, as well as records from the Brooklyn Museum (6K) accessed via an API. These three new datasets had been loaded into ArtsConnectEd within 72 hours by Nate Solas (Walker), and even without any smoothing around the edges, the five-institution version of ArtsConnectEd provided a solid experience: “CDWA Lite format and indexing is very good at first glance,” as Robin observed.
As one of the original co-creators of CDWA Lite, ARTstor welcomes contributions in CDWA Lite. During our final face-to-face project meeting, Bill Ying and Christine Kuan (both ARTstor) emphasized that ARTstor is eager to create relationships with data contributors in which repeat OAI harvesting to support updating and adding to the data via OAI becomes a matter of routine.
At Yale, both the Yale University Art Gallery (grant participant) and the Yale Center for British Art (an avid follower of the project) are implementing COBOAT / OAICatMuseum with the goal of using this capacity for a variety of university-wide initiatives. The museums contribute data to a cross-collection search effort via OAI-PMH (Princeton has similar ambitions), and the Yale museums will also use the same set-up to sync data with OpenText’s Artesia, a university-wide digital assets management system.
CDWA Lite as a contribution format to CONA.
The Canadian Heritage Information Network (CHIN) is currently redeveloping Artefacts Canada (http://www.chin.gc.ca/English/Artefacts_Canada/), a resource of more than 3 million object records and 580,000 images from hundreds of museums across the country. So far, the resource grows largely via contributions of tab-delimited files and spreadsheets representing museum data, and CHIN would like to explore other mechanisms for aggregation. Corina MacDonald (CHIN), who attended the MDE final project meeting, speculated that a testbed of large Canadian institutions using COBOAT / OAICatMuseum might provide lessons for a way forward.

Museum Data Exchange

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Museum Data Exchange

Similaire à Museum Data Exchange (20)

Plus de OCLC Research

Plus de OCLC Research (20)

Dernier

Dernier (20)

Museum Data Exchange

Notes de l'éditeur