SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
Bridging research and
     collections
Vyacheslav Tykhonov - Software Developer
vty@iisg.nl
http://www.linkedin.com/in/vyacheslavtikhonov
Jerry de Vries - Information Analyst
jvr@iisg.nl
http://nl.linkedin.com/pub/jerry-de-vries/13/751/537
Bridging research and collections
                                                                      25/03/2013
                                                                                2




This presentation

• Mission statement of IISH
• Adjusting ICT-strategy
• Requirements for software development
  • Solutions
    – Demo / Proof Of Concepts (POC) of projects & tools
• Questions
Bridging research and collections
                                                                        25/03/2013
                                                                                  3




Mission statement

The IISH conducts historical research on labour relations at a
 global scale and to this end collects data, which are made
           available to other researchers as well

•   Do research
•   Search, use, visualize and update data
•   Collecting and preserving the data
•   Make data available for research and public
Bridging research and collections
                                                                       25/03/2013
                                                                                 4




What is our data?

•   Metadata (describing data and collections)
•   Scans / Full-text
•   Image, sound, movie, books & serials
•   Datasets
•   Aggregation (Metadata, full-text papers and datasets)

• Analogue, digitized and digital
Bridging research and collections
                                                                     25/03/2013
                                                                               5




What are our target groups?

We are listening to our target groups:
• Researchers
• Collectors
• Public Audience

We are collecting all ideas and requirements from you!
Bridging research and collections
                                                                      25/03/2013
                                                                                6




Historical Research Methodology

What is historical research in IISH?

1.   Formulation of the research question
2.   Data collection and/or literature review
3.   Evaluation of materials
4.   Data analysis
5.   Write and publish articles
6.   Sharing datasets
Bridging research and collections
                                                                      25/03/2013
                                                                                7




Data Collecting Methodology

What is collecting in IISH?

• Collecting data
• Storing data
• Preservation
  •   Digitization/scanning
  •   OCR/full text
  •   Metadata/MARC21/Indexing
• Make data public available in digital infrastructure
Bridging research and collections
                                                                     25/03/2013
                                                                               8




Customer Development Methodology

What is software development in IISH?

Our target groups are sharing with us:
• Requirements
• Experiments
• Insights and ideas

1. Create prototype based on requirements and ideas
2. Present prototypes of software tools to our target group
3. Improve software tools after feedback from our target
   group
Bridging research and collections
                                                        25/03/2013
                                                                  9




Where? Who?


                  IISH



    CODI                       Research
                   WE



              General public
Bridging research and collections
                                                                      25/03/2013
                                                                               10




Mission statement

The IISH conducts historical research on labour relations at a
 global scale and to this end collects data, which are made
           available to other researchers as well

Let’s now look into collecting first!
Bridging research and collections
                                                                                             25/03/2013
                                                                                                      11




Typical collectors requirements

• Describe, index and store Metadata in digital library
  system
• Improve Metadata
  •   Based on computer based analysis and Natural Language Processing tools
• Link Metadata from IISH to other Metadata systems
• Search and discover digitized and digital born materials
• Transform Metadata into research data (datasets)
Bridging research and collections
                                                                          25/03/2013
                                                                                   12




Indexing

Extract entities and store it as terms in Metadata

           Collections                  Scans
                         Metadata



        Manual                             Automatic




          CODI                                  HiTIME
Bridging research and collections
                                                                                 25/03/2013
                                                                                          13




Automatic indexing example
Input from scan:

Founded at the initiative of Vladimir I. Lenin in 1901 in Switzerland after
the Second Congress of the RSDRP in 1903 the League became the
main bulwark of Menshevism abroad until it disbanded in 1905.

Metadata linked with Evergreen Authorities:

Vladimir;;VladiMir;;566353;;Personal Name
Congress;;Video Congress;;316063;;Meeting Name
Lenin;;Lenin;;570134;;Uniform Title
Switzerland;;Switzerland;;350823;;Geographic Name
Second Congress of the RSDRP;;411162;;Meeting Name
Bridging research and collections
                                                                     25/03/2013
                                                                              14




Solutions for collectors

• Evergreen Library System
  Product
• Metadata management
  Product
• Metadata reports
  Product
• Evergreen OAI protocol
  Product
• Text analyzing tools (collectors & researchers)
  Prototype API
Bridging research and collections
                                                                      25/03/2013
                                                                               15




Project overview: Evergreen
Collectors Metadata Storage System


• Perfect library solution to store Metadata in MARC21
  standard
• Open-Source License (free of charge for usage)
• Flexible and Powerful solution, works with millions of
  MARC records
• Export of all data in OAI-PMH protocol to link data with
  other systems
• Visualization tools to present data online
Bridging research and collections
                                                 25/03/2013
                                                          16




Evergreen Library System
Bridging research and collections
                                            25/03/2013
                                                     17




Metadata management
Bridging research and collections
                                                                      25/03/2013
                                                                               18




Mission statement

Remember:

The IISH conducts historical research on labour relations at a
 global scale and to this end collects data, which are made
           available to other researchers as well

Let’s do some research!
Bridging research and collections
                                                                       25/03/2013
                                                                                19




What is historical research?

The process of systematically examining past events to give
       an account of what has happened in the past

Why do we conduct historical research?
• To uncover the unknown
• To answer questions
• To identify the relationship that the past has to the present
• To record and evaluate the accomplishments of
  individuals, agencies, or institutions
• To assist in understanding the culture in which we live

And much, much, much more…
Bridging research and collections
                                                                                     25/03/2013
                                                                                              20




Typical research requirements
Access to information


• Find digital materials relevant for research
• Search information stored in Metadata
    •    Poor quality of Metadata = Poor quality of research
•       Searching, filtering, navigating, summarization of data
•       Analyze papers for research online
•       Link materials relevant to research from other sources
•       Collection descriptions are relevant to the topic of
        research, but papers aren't
Bridging research and collections
                                                                       25/03/2013
                                                                                21




Typical research requirements
Datasets


Store datasets in a digital infrastructure to answer research
questions

•   Use best practice for visualization of datasets
•   Generate custom datasets for new research
•   Combine/compare datasets in time and/or place
•   Share datasets with other researchers (collaboration and
    crowdsourcing)
Bridging research and collections
                                                       25/03/2013
                                                                22




General goal of research

                All Data


           Possibly relevant
                 Data

           Definately relevant
                  Data

              Structured
              Knowledge
Bridging research and collections
                                                                                                    25/03/2013
                                                                                                             23




Sharing your research

• Publish scientific articles on websites relevant to the topic
  of research
• Share research datasets with other researchers
• Generate charts and maps in real-time in digital
  infrastructure based on live data
  •   Publish in articles and share on Wikipedia and other popular websites
• Make biographies of famous people more attractive with
  timelines of visual materials
Bridging research and collections
                                                                    25/03/2013
                                                                             24




Indexing (keywords)
For researchers


• Researchers publishing keywords in the beginning of
  every research paper
• Keyword in research paper = Index term in Metadata
• Keywords from papers stored as Metadata in library
  system
• Keywords used in text analyzing systems to create links
  with other papers on the same topic
Bridging research and collections
                                                            25/03/2013
                                                                     25




Solutions for researchers

Data:              Datasets:
• Search engines   • Maps
  Prototype          Product

• Linked data      • Charts
                     Product
  Prototype
• Timelines        • Visual Library System
                     Prototype
  Prototype
Bridging research and collections
                                                   25/03/2013
                                                            26




Datasets visualization tools: Maps
Bridging research and collections
                                                    25/03/2013
                                                             27




Datasets visualization tools: Charts
Bridging research and collections
                                                    25/03/2013
                                                             28




Datasets visual library explorer
Bridging research and collections
                                                                      25/03/2013
                                                                               29




Linked data for collectors

• 500000+ authority records in IISH collection
• Bibliographic records linked to authorities by collectors
• Link bibliographic records to authorities automatically in
  real time with Authority Linking Module
• Import Metadata from other sources (Google Books,
  WorldCat, etc) and link with our authorities
Bridging research and collections
                                                                        25/03/2013
                                                                                 30




Linked data for researchers

Metadata from IISH is available for harvesting:

•   Search (search.socialhistory.org)
•   OCLC's WorldCat
•   Europeana
•   Nederlab and other projects

Link authorities from Evergreen automatically to all other
systems to get more data for doing research
Bridging research and collections
                                                                     25/03/2013
                                                                              31




Project overview: Clio Infra

• Datasets Storage System
• Online Visualization of Datasets:
  •   maps, charts, timeline
• Tools to compare data for different countries in time
• Export of custom datasets
Bridging research and collections
                                                                     25/03/2013
                                                                              32




Project overview: HiTIME
Text Analyzing System


• Matching/linking of authority records from other systems:
  •   Locations
  •   Persons                 Named
  •   Organizations
  •   Dates
                              Entities
• NLP tools to recognize unknown entities
• Export to library as Metadata
• Visualization of Metadata on timelines, maps, charts
Bridging research and collections
                                                                      25/03/2013
                                                                               33




Workflow

    Presentation                     Research




                   Metadata system




                                      Storage
Bridging research and collections
                                                                         25/03/2013
                                                                                  34




What have we seen today?

We are here for you and together we work on:
•   Search & Discovery
    Metadata searching and filtering, Full-Text Search engines,
    Linked Data tools, Research Indexes (Controlled
    Vocabularies)
•   Visualization
    Charts, graphs, timelines, network connections tools
•   Analysis
    Data Mining, Summarization, Topic Modeling, Tools for
    Datasets
Bridging research and collections
                                                                                 25/03/2013
                                                                                          35




Questions?

•    Feel free to ask now
•    Ideas and questions can be sent by email to us




    Vyacheslav Tykhonov - Software Developer
    vty@iisg.nl
    http://www.linkedin.com/in/vyacheslavtikhonov
    Jerry de Vries - Information Analyst
    jvr@iisg.nl
    http://nl.linkedin.com/pub/jerry-de-vries/13/751/537

Contenu connexe

Tendances

'Data Management Planning: the role of institutions and researchers' eResearc...
'Data Management Planning: the role of institutions and researchers' eResearc...'Data Management Planning: the role of institutions and researchers' eResearc...
'Data Management Planning: the role of institutions and researchers' eResearc...
Marta Ribeiro
 

Tendances (20)

Ala cspace aspace rep services demo 2015
Ala cspace aspace rep services demo 2015Ala cspace aspace rep services demo 2015
Ala cspace aspace rep services demo 2015
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
圖書館趨勢觀察
圖書館趨勢觀察圖書館趨勢觀察
圖書館趨勢觀察
 
'Data Management Planning: the role of institutions and researchers' eResearc...
'Data Management Planning: the role of institutions and researchers' eResearc...'Data Management Planning: the role of institutions and researchers' eResearc...
'Data Management Planning: the role of institutions and researchers' eResearc...
 
Open Access & sharing research data: a Dutch workshop for phd in economics
Open Access & sharing research data: a Dutch workshop for phd in economicsOpen Access & sharing research data: a Dutch workshop for phd in economics
Open Access & sharing research data: a Dutch workshop for phd in economics
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan moore
 
Moving the repository upstream
Moving the repository upstreamMoving the repository upstream
Moving the repository upstream
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Open Science and Identifiers
Open Science and IdentifiersOpen Science and Identifiers
Open Science and Identifiers
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
 
Metadata - What Works, What Doesn't? 2009
Metadata - What Works, What Doesn't? 2009Metadata - What Works, What Doesn't? 2009
Metadata - What Works, What Doesn't? 2009
 
Data managementbasics issr_20130301
Data managementbasics issr_20130301Data managementbasics issr_20130301
Data managementbasics issr_20130301
 

En vedette (7)

HiTIME project
HiTIME projectHiTIME project
HiTIME project
 
The recovery of netherlands geographic information system (nlgis 2)
The recovery of netherlands geographic information system (nlgis 2)The recovery of netherlands geographic information system (nlgis 2)
The recovery of netherlands geographic information system (nlgis 2)
 
Data analysis in dataverse & visualization of datasets on historical maps
Data analysis in dataverse & visualization of datasets on historical mapsData analysis in dataverse & visualization of datasets on historical maps
Data analysis in dataverse & visualization of datasets on historical maps
 
FAIR Dataverse
FAIR DataverseFAIR Dataverse
FAIR Dataverse
 
API economy
API economyAPI economy
API economy
 
Clio infra Collabs data analysis tools
Clio infra Collabs data analysis toolsClio infra Collabs data analysis tools
Clio infra Collabs data analysis tools
 
Dataverse opportunities
Dataverse opportunitiesDataverse opportunities
Dataverse opportunities
 

Similaire à Bridging research and collections

Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Brigitte Jörg
 
DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12
Lee Dirks
 

Similaire à Bridging research and collections (20)

"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd Plenary
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
 
Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
RDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian ExperienceRDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian Experience
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 
Opendatasessions
OpendatasessionsOpendatasessions
Opendatasessions
 
DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12DataViz & Future of Research - LDirks SXSWiMar12
DataViz & Future of Research - LDirks SXSWiMar12
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Data sharing: How, what and why?
Data sharing: How, what and why?Data sharing: How, what and why?
Data sharing: How, what and why?
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
 
Research Data Management in GLAM: Managing Data for Cultural Heritage
Research Data Management in GLAM: Managing Data for Cultural HeritageResearch Data Management in GLAM: Managing Data for Cultural Heritage
Research Data Management in GLAM: Managing Data for Cultural Heritage
 
Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
 
AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016
 
National Research Data Archive MIDAS
National Research Data Archive MIDASNational Research Data Archive MIDAS
National Research Data Archive MIDAS
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research Series
 
EDINA / Data Library Overview
EDINA / Data Library OverviewEDINA / Data Library Overview
EDINA / Data Library Overview
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...
 

Plus de vty

Plus de vty (20)

Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
Decentralisation and knowledge graphs
Decentralisation and knowledge graphs Decentralisation and knowledge graphs
Decentralisation and knowledge graphs
 
Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure
 
Dataverse repository for research data in the COVID-19 Museum
Dataverse repository for research data  in the COVID-19 MuseumDataverse repository for research data  in the COVID-19 Museum
Dataverse repository for research data in the COVID-19 Museum
 
Metaverse for Dataverse
Metaverse for DataverseMetaverse for Dataverse
Metaverse for Dataverse
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repository
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
5 years of Dataverse evolution
5 years of Dataverse evolution 5 years of Dataverse evolution
5 years of Dataverse evolution
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataverse
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 

Bridging research and collections

  • 1. Bridging research and collections Vyacheslav Tykhonov - Software Developer vty@iisg.nl http://www.linkedin.com/in/vyacheslavtikhonov Jerry de Vries - Information Analyst jvr@iisg.nl http://nl.linkedin.com/pub/jerry-de-vries/13/751/537
  • 2. Bridging research and collections 25/03/2013 2 This presentation • Mission statement of IISH • Adjusting ICT-strategy • Requirements for software development • Solutions – Demo / Proof Of Concepts (POC) of projects & tools • Questions
  • 3. Bridging research and collections 25/03/2013 3 Mission statement The IISH conducts historical research on labour relations at a global scale and to this end collects data, which are made available to other researchers as well • Do research • Search, use, visualize and update data • Collecting and preserving the data • Make data available for research and public
  • 4. Bridging research and collections 25/03/2013 4 What is our data? • Metadata (describing data and collections) • Scans / Full-text • Image, sound, movie, books & serials • Datasets • Aggregation (Metadata, full-text papers and datasets) • Analogue, digitized and digital
  • 5. Bridging research and collections 25/03/2013 5 What are our target groups? We are listening to our target groups: • Researchers • Collectors • Public Audience We are collecting all ideas and requirements from you!
  • 6. Bridging research and collections 25/03/2013 6 Historical Research Methodology What is historical research in IISH? 1. Formulation of the research question 2. Data collection and/or literature review 3. Evaluation of materials 4. Data analysis 5. Write and publish articles 6. Sharing datasets
  • 7. Bridging research and collections 25/03/2013 7 Data Collecting Methodology What is collecting in IISH? • Collecting data • Storing data • Preservation • Digitization/scanning • OCR/full text • Metadata/MARC21/Indexing • Make data public available in digital infrastructure
  • 8. Bridging research and collections 25/03/2013 8 Customer Development Methodology What is software development in IISH? Our target groups are sharing with us: • Requirements • Experiments • Insights and ideas 1. Create prototype based on requirements and ideas 2. Present prototypes of software tools to our target group 3. Improve software tools after feedback from our target group
  • 9. Bridging research and collections 25/03/2013 9 Where? Who? IISH CODI Research WE General public
  • 10. Bridging research and collections 25/03/2013 10 Mission statement The IISH conducts historical research on labour relations at a global scale and to this end collects data, which are made available to other researchers as well Let’s now look into collecting first!
  • 11. Bridging research and collections 25/03/2013 11 Typical collectors requirements • Describe, index and store Metadata in digital library system • Improve Metadata • Based on computer based analysis and Natural Language Processing tools • Link Metadata from IISH to other Metadata systems • Search and discover digitized and digital born materials • Transform Metadata into research data (datasets)
  • 12. Bridging research and collections 25/03/2013 12 Indexing Extract entities and store it as terms in Metadata Collections Scans Metadata Manual Automatic CODI HiTIME
  • 13. Bridging research and collections 25/03/2013 13 Automatic indexing example Input from scan: Founded at the initiative of Vladimir I. Lenin in 1901 in Switzerland after the Second Congress of the RSDRP in 1903 the League became the main bulwark of Menshevism abroad until it disbanded in 1905. Metadata linked with Evergreen Authorities: Vladimir;;VladiMir;;566353;;Personal Name Congress;;Video Congress;;316063;;Meeting Name Lenin;;Lenin;;570134;;Uniform Title Switzerland;;Switzerland;;350823;;Geographic Name Second Congress of the RSDRP;;411162;;Meeting Name
  • 14. Bridging research and collections 25/03/2013 14 Solutions for collectors • Evergreen Library System Product • Metadata management Product • Metadata reports Product • Evergreen OAI protocol Product • Text analyzing tools (collectors & researchers) Prototype API
  • 15. Bridging research and collections 25/03/2013 15 Project overview: Evergreen Collectors Metadata Storage System • Perfect library solution to store Metadata in MARC21 standard • Open-Source License (free of charge for usage) • Flexible and Powerful solution, works with millions of MARC records • Export of all data in OAI-PMH protocol to link data with other systems • Visualization tools to present data online
  • 16. Bridging research and collections 25/03/2013 16 Evergreen Library System
  • 17. Bridging research and collections 25/03/2013 17 Metadata management
  • 18. Bridging research and collections 25/03/2013 18 Mission statement Remember: The IISH conducts historical research on labour relations at a global scale and to this end collects data, which are made available to other researchers as well Let’s do some research!
  • 19. Bridging research and collections 25/03/2013 19 What is historical research? The process of systematically examining past events to give an account of what has happened in the past Why do we conduct historical research? • To uncover the unknown • To answer questions • To identify the relationship that the past has to the present • To record and evaluate the accomplishments of individuals, agencies, or institutions • To assist in understanding the culture in which we live And much, much, much more…
  • 20. Bridging research and collections 25/03/2013 20 Typical research requirements Access to information • Find digital materials relevant for research • Search information stored in Metadata • Poor quality of Metadata = Poor quality of research • Searching, filtering, navigating, summarization of data • Analyze papers for research online • Link materials relevant to research from other sources • Collection descriptions are relevant to the topic of research, but papers aren't
  • 21. Bridging research and collections 25/03/2013 21 Typical research requirements Datasets Store datasets in a digital infrastructure to answer research questions • Use best practice for visualization of datasets • Generate custom datasets for new research • Combine/compare datasets in time and/or place • Share datasets with other researchers (collaboration and crowdsourcing)
  • 22. Bridging research and collections 25/03/2013 22 General goal of research All Data Possibly relevant Data Definately relevant Data Structured Knowledge
  • 23. Bridging research and collections 25/03/2013 23 Sharing your research • Publish scientific articles on websites relevant to the topic of research • Share research datasets with other researchers • Generate charts and maps in real-time in digital infrastructure based on live data • Publish in articles and share on Wikipedia and other popular websites • Make biographies of famous people more attractive with timelines of visual materials
  • 24. Bridging research and collections 25/03/2013 24 Indexing (keywords) For researchers • Researchers publishing keywords in the beginning of every research paper • Keyword in research paper = Index term in Metadata • Keywords from papers stored as Metadata in library system • Keywords used in text analyzing systems to create links with other papers on the same topic
  • 25. Bridging research and collections 25/03/2013 25 Solutions for researchers Data: Datasets: • Search engines • Maps Prototype Product • Linked data • Charts Product Prototype • Timelines • Visual Library System Prototype Prototype
  • 26. Bridging research and collections 25/03/2013 26 Datasets visualization tools: Maps
  • 27. Bridging research and collections 25/03/2013 27 Datasets visualization tools: Charts
  • 28. Bridging research and collections 25/03/2013 28 Datasets visual library explorer
  • 29. Bridging research and collections 25/03/2013 29 Linked data for collectors • 500000+ authority records in IISH collection • Bibliographic records linked to authorities by collectors • Link bibliographic records to authorities automatically in real time with Authority Linking Module • Import Metadata from other sources (Google Books, WorldCat, etc) and link with our authorities
  • 30. Bridging research and collections 25/03/2013 30 Linked data for researchers Metadata from IISH is available for harvesting: • Search (search.socialhistory.org) • OCLC's WorldCat • Europeana • Nederlab and other projects Link authorities from Evergreen automatically to all other systems to get more data for doing research
  • 31. Bridging research and collections 25/03/2013 31 Project overview: Clio Infra • Datasets Storage System • Online Visualization of Datasets: • maps, charts, timeline • Tools to compare data for different countries in time • Export of custom datasets
  • 32. Bridging research and collections 25/03/2013 32 Project overview: HiTIME Text Analyzing System • Matching/linking of authority records from other systems: • Locations • Persons Named • Organizations • Dates Entities • NLP tools to recognize unknown entities • Export to library as Metadata • Visualization of Metadata on timelines, maps, charts
  • 33. Bridging research and collections 25/03/2013 33 Workflow Presentation Research Metadata system Storage
  • 34. Bridging research and collections 25/03/2013 34 What have we seen today? We are here for you and together we work on: • Search & Discovery Metadata searching and filtering, Full-Text Search engines, Linked Data tools, Research Indexes (Controlled Vocabularies) • Visualization Charts, graphs, timelines, network connections tools • Analysis Data Mining, Summarization, Topic Modeling, Tools for Datasets
  • 35. Bridging research and collections 25/03/2013 35 Questions? • Feel free to ask now • Ideas and questions can be sent by email to us Vyacheslav Tykhonov - Software Developer vty@iisg.nl http://www.linkedin.com/in/vyacheslavtikhonov Jerry de Vries - Information Analyst jvr@iisg.nl http://nl.linkedin.com/pub/jerry-de-vries/13/751/537