SlideShare une entreprise Scribd logo
1  sur  50
Télécharger pour lire hors ligne
Hvordan sette opp en OAI-PMH
     metadata-innhøster
(PKP Open Archives Harvester)



            Magnus Enger
    Biblioteklaboratorie-workshop
   Bergen, 12. - 13. november 2007
System Requirements
●   PHP >= 4.2.x (including PHP 5.x); Microsoft IIS
    requires PHP 5.x
●   MySQL >= 3.23.23 (including MySQL 4.x/5.x)
    or PostgreSQL >= 7.1 (including PostgreSQL
    8.x)
●   Apache >= 1.3.2x or >= 2.0.4x or Microsoft IIS
    6 (untested)
●   Operating system: Any OS that supports the
    above software, including Linux, BSD, Solaris,
    Mac OS X, Windows
Support
Med SSH (PuTTY)/Telnet
●   Logg på serveren
●   Last ned fila
$ wget http://pkp.sfu.ca/harvester2/download/harvester-2.0.1.tar.gz

●   Pakk ut fila
$ tar -xvf harvester-2.0.1.tar.gz

●   Gå inn i mappa
$ cd harvester-2.0.1

●   Flytt innholdet til ønsket plass
$ mv * ~/subdomener/harvester/
Med FTP
●   Last ned fila til lokal maskin
●   Pakk ut fila
●   Logg på server med FTP-klient
●   Last opp de utpakkede filene til ønsket plass på
    serveren
Fremgangsmåte
●   Er beskrevet i fila docs/README
Endre filrettigheter
●   Gjør følgende filer/mapper skrivbare:
    –   config.inc.php (optional -- if not writable you will be
        prompted to manually overwrite this file during
        installation)
    –   public
    –   cache
    –   cache/t_cache
    –   cache/t_config
    –   cache/t_compile
    –   cache/_db
Lag opplastingsmappe
●   Lag en mappe for å lagre opplastede filer,
    gjerne utenfor server-rota
●   Gjør denne mappa skrivbar
Installasjon
●   Via vevleser
    –   http://yourdomain.com/path/to/harvester2/
●   Eller vi kommandolinja
    –   php tools/install.php
Rediger fila config.inc.php


; Use URL parameters instead of CGI PATH_INFO. This is useful for
; broken server setups that don't support the PATH_INFO environment
; variable.
disable_path_info = Off



Endre siste linja til:

disable_path_info = On
Problem
●   «login»:
    http://harvester.collib.info/index.php/login
●   Viser bare forsiden
●   Klikk på «HOME»:
    http://harvester.collib.info/index.php?page=index
●   Gå til URLen:
    http://harvester.collib.info/index.php?page=login
Hvordan finne høstbare arkiver?
●   «Registered Data Providers» fra OAI:
    http://www.openarchives.org/Register/BrowseSites
Eksempel: DUO ved UIO
Sets
●   Document types
●   Frequent occurrences of languages in the
    database
●   Documents where online fulltext-versions are
    available
●   Units at the university
Document types
●   Master thesis
●   Dissertation
●   Student thesis
●   Series titles
●   Report
●   Monography
●   Article
Frequent occurrences of languages
         in the database
●   Norwegian
●   English
●   French
●   Norwegian Bokmål
●   Norwegian Nynorsk
●   Swedish
●   German
Documents where online fulltext-
    versions are available
Units at the university
●   Humanities(2437)
●   HumanitiesMedia and Communication(358)
●   HumanitiesMedia and CommunicationMedia
    studies(331)
●   HumanitiesMedia and
    CommunicationJournalism(23)
●   HumanitiesArcheology, Conservation and
    Historical Studies(459)
«Update Metadata Index»




   Bruk tools/harvest.php !
Nyttige «kommandoer»
●   php harvest.php
    –   lister opp diverse opsjoner
●   php harvest.php list
    –   liste over arkivene
●   php harvest.php 1
    –   Høster metadata fra ett arkiv, tallet tilsvarer tallet i
        lista over
●   php harvest.php 1 verbose
    –   Som over, men med detaljerte meldinger om
        fremdriften
Flere nyttige «kommandoer»
●   php harvest.php all
    –   Høster data fra alle arkivene
●   php harvest.php all from=last
    –   Høster alle nye metadata siden sist innhøsting
    –   Kjør denne jevnlig ved hjelp av cron!
Hacke databasen
●   archive_settings       ●   records
●   archives               ●   rt_contexts
●   captchas               ●   rt_searches
●   crosswalk_fields       ●   rt_versions
●   crosswalks             ●   schema_plugins
●   email_templates        ●   search_keyword_list
●   email_templates_data   ●   search_object_keywords
●   entries                ●   search_objects
●   entry_attributes       ●   sessions
●   plugin_settings        ●   site_settings
●   raw_fields             ●   versions
Tabell: records
mysql> describe records;

+------------------+--------------+------+-----+---------+----------------+

| Field            | Type         | Null | Key | Default | Extra          |

+------------------+--------------+------+-----+---------+----------------+

| record_id        | int(11)      |       | PRI | NULL   | auto_increment |

| archive_id       | int(11)      |       |    | 0       |                |

| schema_plugin_id | int(11)      |       |    | 0       |                |

| identifier       | varchar(255) | YES   |    | NULL    |                |

| datestamp        | datetime     | YES   |    | NULL    |                |

+------------------+--------------+------+-----+---------+----------------+
Tabell: entries
mysql> describe entries;

+--------------+---------+------+-----+---------+----------------+

| Field       | Type       | Null | Key | Default | Extra          |

+--------------+---------+------+-----+---------+----------------+

| entry_id    | int(11) |          | PRI | NULL   | auto_increment |

| record_id   | int(11) |          | MUL | 0      |                |

| raw_field_id | int(11) |         | MUL | 0      |                |

| value       | text       | YES   |     | NULL   |                |

+--------------+---------+------+-----+---------+----------------+
Tabell: raw_fields
mysql> describe raw_fields;

+------------------+-------------+------+-----+---------+----------------+

| Field            | Type        | Null | Key | Default | Extra          |

+------------------+-------------+------+-----+---------+----------------+

| raw_field_id     | int(11)     |      | PRI | NULL    | auto_increment |

| name             | varchar(60) |      | MUL |         |                |

| schema_plugin_id | int(11)     |      |     | 0       |                |

+------------------+-------------+------+-----+---------+----------------+
Strukturen til en «record»
●   Record
    –   Entry a
         ●   raw_field_id = x
              –  raw_fields name = 'Title'
         ●   value = 'Kasus før og nå'

    –   Entry b
         ●   raw_field_id = y
              –  raw_fields name = 'Author'
         ●   value = 'Hansen, Hans'

    –   Entry c
         ●   raw_field_id = z
              –  raw_fields name = 'Publisher'
         ●   value = 'Universitetet i Bodø'
Hacke systemet
●   Interessante kataloger
    –   classes
    –   locale - språk
    –   pages
    –   plugins
    –   styles – CSS
    –   templates – benytter Smarty
Lære mer
●   README
    –   http://pkp.sfu.ca/harvester2/README
●   Administrator's Guide (17 s.)
    –   http://pkp.sfu.ca/harvester2/AdminGuide.pdf
●   Technical Reference (50 s.)
    –   http://pkp.sfu.ca/harvester2/TechnicalReference.pdf
Spørsmål
●   Bruk BibLab-wikien (Allmenningen)!
●   eller
●   magnus@enger.priv.no

Contenu connexe

Tendances

XtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance AlgorithmsXtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance AlgorithmsLaurynas Biveinis
 
Performance Profiling in Rust
Performance Profiling in RustPerformance Profiling in Rust
Performance Profiling in RustInfluxData
 
Hadoop 20111215
Hadoop 20111215Hadoop 20111215
Hadoop 20111215exsuns
 
From zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and ElasticsearchFrom zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and ElasticsearchRafał Kuć
 
Hadoop Installation and basic configuration
Hadoop Installation and basic configurationHadoop Installation and basic configuration
Hadoop Installation and basic configurationGerrit van Vuuren
 
Web scraping with nutch solr
Web scraping with nutch solrWeb scraping with nutch solr
Web scraping with nutch solrMike Frampton
 
Centralized + Unified Logging
Centralized + Unified LoggingCentralized + Unified Logging
Centralized + Unified LoggingGabor Kozma
 
Http capturing
Http capturingHttp capturing
Http capturingEric Ahn
 
Pgbr 2013 postgres on aws
Pgbr 2013   postgres on awsPgbr 2013   postgres on aws
Pgbr 2013 postgres on awsEmanuel Calvo
 
Perl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File ProcessingPerl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File ProcessingDanairat Thanabodithammachari
 
dns.workshop.hsgr
dns.workshop.hsgrdns.workshop.hsgr
dns.workshop.hsgrebalaskas
 
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRestPGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRestPGDay.Amsterdam
 
Boosting MongoDB performance
Boosting MongoDB performanceBoosting MongoDB performance
Boosting MongoDB performanceAlexei Panin
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functionsRupak Roy
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeWim Godden
 
Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)Anastasia Lubennikova
 
phptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorialphptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorialWim Godden
 

Tendances (20)

XtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance AlgorithmsXtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance Algorithms
 
Performance Profiling in Rust
Performance Profiling in RustPerformance Profiling in Rust
Performance Profiling in Rust
 
Hadoop 20111215
Hadoop 20111215Hadoop 20111215
Hadoop 20111215
 
From zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and ElasticsearchFrom zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and Elasticsearch
 
Hadoop Installation and basic configuration
Hadoop Installation and basic configurationHadoop Installation and basic configuration
Hadoop Installation and basic configuration
 
Web scraping with nutch solr
Web scraping with nutch solrWeb scraping with nutch solr
Web scraping with nutch solr
 
Centralized + Unified Logging
Centralized + Unified LoggingCentralized + Unified Logging
Centralized + Unified Logging
 
Http capturing
Http capturingHttp capturing
Http capturing
 
Perl Programming - 03 Programming File
Perl Programming - 03 Programming FilePerl Programming - 03 Programming File
Perl Programming - 03 Programming File
 
Pgbr 2013 postgres on aws
Pgbr 2013   postgres on awsPgbr 2013   postgres on aws
Pgbr 2013 postgres on aws
 
Perl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File ProcessingPerl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File Processing
 
dns.workshop.hsgr
dns.workshop.hsgrdns.workshop.hsgr
dns.workshop.hsgr
 
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRestPGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
 
Boosting MongoDB performance
Boosting MongoDB performanceBoosting MongoDB performance
Boosting MongoDB performance
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the code
 
Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)
 
phptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorialphptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorial
 
Backups
BackupsBackups
Backups
 
Containers for sysadmins
Containers for sysadminsContainers for sysadmins
Containers for sysadmins
 

En vedette

F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesLibriotech
 
Hva er fri programvare? (2011)
Hva er fri programvare? (2011)Hva er fri programvare? (2011)
Hva er fri programvare? (2011)Libriotech
 
Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2Libriotech
 
Koha og åpenhet
Koha og åpenhetKoha og åpenhet
Koha og åpenhetLibriotech
 
Snik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogenSnik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogenLibriotech
 
Hva er fri programvare
Hva er fri programvareHva er fri programvare
Hva er fri programvareLibriotech
 
Pode - en kort presentasjon
Pode - en kort presentasjonPode - en kort presentasjon
Pode - en kort presentasjonLibriotech
 

En vedette (9)

F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian libraries
 
Hva er fri programvare? (2011)
Hva er fri programvare? (2011)Hva er fri programvare? (2011)
Hva er fri programvare? (2011)
 
Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2
 
Hva er Koha?
Hva er Koha?Hva er Koha?
Hva er Koha?
 
Koha og åpenhet
Koha og åpenhetKoha og åpenhet
Koha og åpenhet
 
Snik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogenSnik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogen
 
Hva er fri programvare
Hva er fri programvareHva er fri programvare
Hva er fri programvare
 
Må MARC dø?
Må MARC dø?Må MARC dø?
Må MARC dø?
 
Pode - en kort presentasjon
Pode - en kort presentasjonPode - en kort presentasjon
Pode - en kort presentasjon
 

Similaire à Hvordan sette opp en OAI-PMH metadata-innhøster

MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527Saewoong Lee
 
Streaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQLStreaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQLBjoern Rost
 
MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015Dave Stokes
 
MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015Dave Stokes
 
MariaDB with SphinxSE
MariaDB with SphinxSEMariaDB with SphinxSE
MariaDB with SphinxSEColin Charles
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012Roland Bouman
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012Roland Bouman
 
Applied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System PresentationApplied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System PresentationRichard Crowley
 
DB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQLDB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQLMarcelo Altmann
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeWim Godden
 
Synapse india reviews on php website development
Synapse india reviews on php website developmentSynapse india reviews on php website development
Synapse india reviews on php website developmentsaritasingh19866
 
ProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQLProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQLRené Cannaò
 
Basic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database AdministratorsBasic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database AdministratorsSveta Smirnova
 
10 things I learned building Nomad packs
10 things I learned building Nomad packs10 things I learned building Nomad packs
10 things I learned building Nomad packsBram Vogelaar
 

Similaire à Hvordan sette opp en OAI-PMH metadata-innhøster (20)

MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527
 
Streaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQLStreaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQL
 
MySQLinsanity
MySQLinsanityMySQLinsanity
MySQLinsanity
 
MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015
 
MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015
 
Puppet Data Mining
Puppet Data MiningPuppet Data Mining
Puppet Data Mining
 
MariaDB with SphinxSE
MariaDB with SphinxSEMariaDB with SphinxSE
MariaDB with SphinxSE
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
 
Applied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System PresentationApplied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System Presentation
 
Percona toolkit
Percona toolkitPercona toolkit
Percona toolkit
 
DB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQLDB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQL
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the code
 
Mysql56 replication
Mysql56 replicationMysql56 replication
Mysql56 replication
 
Cache and Drupal
Cache and DrupalCache and Drupal
Cache and Drupal
 
Synapse india reviews on php website development
Synapse india reviews on php website developmentSynapse india reviews on php website development
Synapse india reviews on php website development
 
ProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQLProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQL
 
Basic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database AdministratorsBasic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database Administrators
 
10 things I learned building Nomad packs
10 things I learned building Nomad packs10 things I learned building Nomad packs
10 things I learned building Nomad packs
 
MySQL SQL Tutorial
MySQL SQL TutorialMySQL SQL Tutorial
MySQL SQL Tutorial
 

Plus de Libriotech

Contributing to Koha
Contributing to KohaContributing to Koha
Contributing to KohaLibriotech
 
Glitre og Öppna bibliotek
Glitre og Öppna bibliotekGlitre og Öppna bibliotek
Glitre og Öppna bibliotekLibriotech
 
Koha @ emtacl10
Koha @ emtacl10Koha @ emtacl10
Koha @ emtacl10Libriotech
 
Kort presentasjon av Koha i Norge
Kort presentasjon av Koha i NorgeKort presentasjon av Koha i Norge
Kort presentasjon av Koha i NorgeLibriotech
 
Bibliotek 2.0?
Bibliotek 2.0?Bibliotek 2.0?
Bibliotek 2.0?Libriotech
 
Google og bibliotekene
Google og bibliotekeneGoogle og bibliotekene
Google og bibliotekeneLibriotech
 
Libraries and social networks
Libraries and social networksLibraries and social networks
Libraries and social networksLibriotech
 
Sosiale nettverk og bibliotek
Sosiale nettverk og bibliotekSosiale nettverk og bibliotek
Sosiale nettverk og bibliotekLibriotech
 

Plus de Libriotech (9)

Contributing to Koha
Contributing to KohaContributing to Koha
Contributing to Koha
 
Glitre og Öppna bibliotek
Glitre og Öppna bibliotekGlitre og Öppna bibliotek
Glitre og Öppna bibliotek
 
Koha @ emtacl10
Koha @ emtacl10Koha @ emtacl10
Koha @ emtacl10
 
Koha
KohaKoha
Koha
 
Kort presentasjon av Koha i Norge
Kort presentasjon av Koha i NorgeKort presentasjon av Koha i Norge
Kort presentasjon av Koha i Norge
 
Bibliotek 2.0?
Bibliotek 2.0?Bibliotek 2.0?
Bibliotek 2.0?
 
Google og bibliotekene
Google og bibliotekeneGoogle og bibliotekene
Google og bibliotekene
 
Libraries and social networks
Libraries and social networksLibraries and social networks
Libraries and social networks
 
Sosiale nettverk og bibliotek
Sosiale nettverk og bibliotekSosiale nettverk og bibliotek
Sosiale nettverk og bibliotek
 

Dernier

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 

Dernier (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 

Hvordan sette opp en OAI-PMH metadata-innhøster

  • 1. Hvordan sette opp en OAI-PMH metadata-innhøster (PKP Open Archives Harvester) Magnus Enger Biblioteklaboratorie-workshop Bergen, 12. - 13. november 2007
  • 2.
  • 3.
  • 4. System Requirements ● PHP >= 4.2.x (including PHP 5.x); Microsoft IIS requires PHP 5.x ● MySQL >= 3.23.23 (including MySQL 4.x/5.x) or PostgreSQL >= 7.1 (including PostgreSQL 8.x) ● Apache >= 1.3.2x or >= 2.0.4x or Microsoft IIS 6 (untested) ● Operating system: Any OS that supports the above software, including Linux, BSD, Solaris, Mac OS X, Windows
  • 5.
  • 7. Med SSH (PuTTY)/Telnet ● Logg på serveren ● Last ned fila $ wget http://pkp.sfu.ca/harvester2/download/harvester-2.0.1.tar.gz ● Pakk ut fila $ tar -xvf harvester-2.0.1.tar.gz ● Gå inn i mappa $ cd harvester-2.0.1 ● Flytt innholdet til ønsket plass $ mv * ~/subdomener/harvester/
  • 8. Med FTP ● Last ned fila til lokal maskin ● Pakk ut fila ● Logg på server med FTP-klient ● Last opp de utpakkede filene til ønsket plass på serveren
  • 9.
  • 10. Fremgangsmåte ● Er beskrevet i fila docs/README
  • 11. Endre filrettigheter ● Gjør følgende filer/mapper skrivbare: – config.inc.php (optional -- if not writable you will be prompted to manually overwrite this file during installation) – public – cache – cache/t_cache – cache/t_config – cache/t_compile – cache/_db
  • 12. Lag opplastingsmappe ● Lag en mappe for å lagre opplastede filer, gjerne utenfor server-rota ● Gjør denne mappa skrivbar
  • 13. Installasjon ● Via vevleser – http://yourdomain.com/path/to/harvester2/ ● Eller vi kommandolinja – php tools/install.php
  • 14.
  • 15. Rediger fila config.inc.php ; Use URL parameters instead of CGI PATH_INFO. This is useful for ; broken server setups that don't support the PATH_INFO environment ; variable. disable_path_info = Off Endre siste linja til: disable_path_info = On
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. Problem ● «login»: http://harvester.collib.info/index.php/login ● Viser bare forsiden ● Klikk på «HOME»: http://harvester.collib.info/index.php?page=index ● Gå til URLen: http://harvester.collib.info/index.php?page=login
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. Hvordan finne høstbare arkiver? ● «Registered Data Providers» fra OAI: http://www.openarchives.org/Register/BrowseSites
  • 28.
  • 29.
  • 30.
  • 31. Sets ● Document types ● Frequent occurrences of languages in the database ● Documents where online fulltext-versions are available ● Units at the university
  • 32. Document types ● Master thesis ● Dissertation ● Student thesis ● Series titles ● Report ● Monography ● Article
  • 33. Frequent occurrences of languages in the database ● Norwegian ● English ● French ● Norwegian Bokmål ● Norwegian Nynorsk ● Swedish ● German
  • 34. Documents where online fulltext- versions are available
  • 35. Units at the university ● Humanities(2437) ● HumanitiesMedia and Communication(358) ● HumanitiesMedia and CommunicationMedia studies(331) ● HumanitiesMedia and CommunicationJournalism(23) ● HumanitiesArcheology, Conservation and Historical Studies(459)
  • 36. «Update Metadata Index» Bruk tools/harvest.php !
  • 37.
  • 38.
  • 39.
  • 40. Nyttige «kommandoer» ● php harvest.php – lister opp diverse opsjoner ● php harvest.php list – liste over arkivene ● php harvest.php 1 – Høster metadata fra ett arkiv, tallet tilsvarer tallet i lista over ● php harvest.php 1 verbose – Som over, men med detaljerte meldinger om fremdriften
  • 41.
  • 42. Flere nyttige «kommandoer» ● php harvest.php all – Høster data fra alle arkivene ● php harvest.php all from=last – Høster alle nye metadata siden sist innhøsting – Kjør denne jevnlig ved hjelp av cron!
  • 43. Hacke databasen ● archive_settings ● records ● archives ● rt_contexts ● captchas ● rt_searches ● crosswalk_fields ● rt_versions ● crosswalks ● schema_plugins ● email_templates ● search_keyword_list ● email_templates_data ● search_object_keywords ● entries ● search_objects ● entry_attributes ● sessions ● plugin_settings ● site_settings ● raw_fields ● versions
  • 44. Tabell: records mysql> describe records; +------------------+--------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------------+--------------+------+-----+---------+----------------+ | record_id | int(11) | | PRI | NULL | auto_increment | | archive_id | int(11) | | | 0 | | | schema_plugin_id | int(11) | | | 0 | | | identifier | varchar(255) | YES | | NULL | | | datestamp | datetime | YES | | NULL | | +------------------+--------------+------+-----+---------+----------------+
  • 45. Tabell: entries mysql> describe entries; +--------------+---------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +--------------+---------+------+-----+---------+----------------+ | entry_id | int(11) | | PRI | NULL | auto_increment | | record_id | int(11) | | MUL | 0 | | | raw_field_id | int(11) | | MUL | 0 | | | value | text | YES | | NULL | | +--------------+---------+------+-----+---------+----------------+
  • 46. Tabell: raw_fields mysql> describe raw_fields; +------------------+-------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------------+-------------+------+-----+---------+----------------+ | raw_field_id | int(11) | | PRI | NULL | auto_increment | | name | varchar(60) | | MUL | | | | schema_plugin_id | int(11) | | | 0 | | +------------------+-------------+------+-----+---------+----------------+
  • 47. Strukturen til en «record» ● Record – Entry a ● raw_field_id = x – raw_fields name = 'Title' ● value = 'Kasus før og nå' – Entry b ● raw_field_id = y – raw_fields name = 'Author' ● value = 'Hansen, Hans' – Entry c ● raw_field_id = z – raw_fields name = 'Publisher' ● value = 'Universitetet i Bodø'
  • 48. Hacke systemet ● Interessante kataloger – classes – locale - språk – pages – plugins – styles – CSS – templates – benytter Smarty
  • 49. Lære mer ● README – http://pkp.sfu.ca/harvester2/README ● Administrator's Guide (17 s.) – http://pkp.sfu.ca/harvester2/AdminGuide.pdf ● Technical Reference (50 s.) – http://pkp.sfu.ca/harvester2/TechnicalReference.pdf
  • 50. Spørsmål ● Bruk BibLab-wikien (Allmenningen)! ● eller ● magnus@enger.priv.no