SlideShare une entreprise Scribd logo
1  sur  53
Biodiversity Data Provider Software Hands-on exercises with TAPIR Stockholm Biodiversity Informatics Symposium 2008 (SBIS2008) Dag Terje Filip Endresen, Nordic Genetic Resources Center (Sweden) / Bioversity International (Italy)
Fallacies of Distributed Computing The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. Topology doesn't change. There is one administrator. Transport cost is zero. The network is homogeneous. This list of fallacies came about at Sun Microsystems around 1994. 2
TAPIR 3 Cartoon by Sasha Kopf (Creative Commons)
TAPIR TAPIR - TDWG Access Protocol for Information Retrieval. During the 2004 TDWG meeting in Christchurch, NZ, work started on a unified protocol and named TAPIR. TAPIR is based on the protocol from the two data provider software, BioCASE and DiGIR.  4
Provider software, wrappers DiGIR (2002, not active) http://digir.sourceforge.net BioCASE (2003, PyWrapper) http://www.biocase.org PyWrapper3 (2006, not active) http://trac.pywrapper.org/ TapirLink (2007) http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink GBIF Provider Toolkit (2009) http://code.google.com/p/gbif-providertoolkit 5
BioCASE 2.5.ORC 6 The BioCASE provider software is a product of the EU funded BioCASE project (2001-2004).  Developed at BGBM in Berlin.  Last updated in April 2008, with support for Python version 2.5 and less required external Implement the BioCASE provider to share data as ABCD 2.06. http://www.biocase.org
1. Make sure you have Python 2.5 installed  (command line: python –v) 2.Download the latest provider software from http://www.biocase.org 3. Uncompress the BioCASE provider software to a folder on your system [provider_software_2.5.0RC.tar.gz]     (tar –xzvf provider_...tar.gz) 4. Run setup.py, (python setup.py) 5. Configure your web server to mount biocase/www as http://localhost/biocase/ Hint: You will find an example for httpd.conf as the last terminal output from running setup.py 7 BioCASE 2.5.ORC
BioCASE 2.5.ORC 6. Visit the library test page:  http://localhost/biocase/utilities/testlibs.cgi 6a. Download latest 4 Suite  from http://4suite.org/ Uncompress and install  [4Suite-XML-1.0.2.tar.bz2] 6b. Install additional python  libraries, including  the desired  database driver. For each  python package:  (python setup.py install) 6c. Graphviz is useful to  visualize the database table structure. 8
BioCASE 2.5.ORC 7. Configuration ,[object Object]
 Database connection
 Database table structure
 Mapping of data model to standard schema9
BioCASE 2.5.ORC 8. Query Form The manual query form is illustrative for understanding exactly how the wrapper software works!   http://localhost/biocase/utilities/queryforms/qf_manual.cgi?dsa=sesto 10
11
PyWrapper3 Home:http://trac.pywrapper.org/ Primary developers: Markus Döring, Javier de la Torre 14/07/2008 - Development stalled We are sorry to inform you that development of the TAPIR branch of PyWrapper has been stalled. The latest 3.1 alpha version is not stable and not recommended for production!  (Message from the home page) ,[object Object]
PyWrapper 3.1.0 alpha (development version, works with Python 2.5)PyWrapper is tested and verified to work fine with Windows, Mac OS X and Linux. 12
Required configuration Web server: Any CGI compliant web server: Apache, IIS etc. (The built in CherryPy web server can also be used). Database: Major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work.  Python PyWrapper is developed with the Python programming language. (The latest version from the SVN code repository works with Python version 2.5) 13 Apache, MySQL and Python are open source software, free to use - even for commercial products.
Installation http://trac.pywrapper.org/pywrapper/wiki/InstallationGuide 1. Download the latest PyWrapper3 installer. 	Use SVN export or checkout for Python 2.5 support 2. Uncompress to a folder of your choice. 	Example: “/usr/local/pywrapper3/” 	Example: “C:ywrapper Local installation: If you have a Subversion client installed, you may use the automatic installer. (Local Python and libraries are installed to your pywrapper folder) promt$  svn export svn://svn.pywrapper.org:80/pywrapper/trunk pywrapper promt$  cd pywrapper/tools promt$  /bin/shinstall.sh This will require that you have a bash shell, and probably that you have a Unix line system like e.g. FreeBSD, Linux or Mac OS X… 3. Execute: pywrapper/setup.py 	Example:  promt$  python setup.py (Mac OS X, Linux) 	On Windows locate setup.py and double-click 14
Start standalone server Execute start_server.py(default port is 8080) promt$ cdwebapp/  promt$ ./start_server.py 8088(example to start on port 8088) On a Windows system you may do this in a MS-DOS window (or double-click the file - if you accept the default port). Some messages will pass across your screen. Please be patient, this could take a minute. Wait for the message “start server …” and find find PyWrapper at: http://localhost:8088/pywrapper 15
Configuration After successful installation, you will need to configure your data provider. Follow the instructions from the PyWrapper documentation web page to configure. Data sources. If you provide more datasets or several databases they will be configured as individual data sources (dsa). Database connection. For PyWrapper to access your database. Database structure. Define the relevant database tables, the primary keys and foreign keys. Data model. Map your database model to the standard represented by the XML Schemas you choose. http://trac.pywrapper.org/pywrapper/wiki/Documentation 16
Screen examples PyWrapper comes with a graphical web based configuration tool For more information and more screen dumps from the configuration of PyWrapper, see: http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i 17
TapirLink 0.6.1 18
TapirLink 0.6.1 Home: http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink Primary developers: Renato De Giovanni, Dave Vieglais Download: http://sourceforge.net/project/showfiles.php?group_id=38190 Uncompress PHP source code Eg: /usr/local/tapir/tapirlink Mount admin and www directory for your web server. Example: Apache “httpd.conf” Alias /tapirlink   "/usr/local/tapir/tapirlink/www” Alias /tapirlink-admin   "/usr/local/tapir/tapirlink/admin"  <Location /tapirlink>  	Order allow,deny  	Allow from all </Location>  <Location /tapirlink-admin>  	Order allow,deny  	Allow from all </Location>  Read permissions on all directories Write on cache, config, log, statistics 19
TapirLink 0.6.1 Start by adding a new resource http://localhost/tapirlink-admin/ Step 1: Describe your new resource 20
TapirLink 0.6.1 Step 2: Data source, database connection Step 3: Table structure 21
TapirLink 0.6.1 Step 4: Filter Step 5: Select mapping standards to use 22
TapirLink 0.6.1 Step 5b: Mapping data schema (ABCD 2.06) etc… 23
TapirLink 0.6.1 Step 5c: Mapping data concepts (Darwin Core) etc… Step 5d: Remember that DwC have an extension for geospatial descriptors etc… 24
TapirLink 0.6.1 Step 6: Settings New resource successfully configured 25
TapirLink 0.6.1 Test resource with client form: http://localhost/tapirlink/tapir_client.php The XML Client form is very illustrative for understanding exactly how the wrapper software works!  26
Service interface 27
Example of a service request 	All exchanged data is formatted with XML tags. 28
Example of a service response 29 ...
Example TAPIR service request 30
Example TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ... 31
Example TAPIR service SEARCH request 32
Example TAPIR service Search response 33
Example of OAI-PMH service request OAI-PMH requests are expressed as HTTP requests. OAI-PMH requests must be submitted using either the HTTP GET or POST methods. http://an.oa.org/OAI-script?verb=GetRecord &identifier=oai:arXiv.org:hep-th/9901001 &metadataPrefix=oai_dc 34
Example of OAI-PMH service RESPONSE OAI-PMH responses are formatted as HTTP responses. With The Content-Type as text/xml. 35
OAI-PMH PROTOCOL, metadata formats 36 Request types (verb): Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords For purposes of interoperability, the metadataPrefix `oai_dc’ is reserved for Dublin Core. Communities adopt own metadataPrefixesfor own metadata fomats. Relevant formats/schemas for Biodiversity Informatics are Darwin Core and ABCD.
Why sharing data? 37
[http://data.gbif.org/] 38 [http://data.gbif.org/datasets/resource/1487]
GBIF PGR Network 2 [http://data.gbif.org/datasets/network/2] 39
40 Distributed network The image is from the BioCASE web site
41 Decentralized network GBIF (Global Biodiversity Information Facility) ALIS (Accession Level  Information System) USER Svalbard Global Seed Vault (Safe Backup) (USDA ARS National  Germplasm  Repositories...) Web  Services USDA GRIN (USA) SINGER (CGIAR) (CGIAR International Future Harvest  gene banks...) EURISCO (Europe) MCPD IHAR (Poland) WUR CGN (Netherlands) NordGen (Northern Europe) IPK Gatersleben (Germany) (Other European  gene banks...)
42 Crop Wild Relatives LKA ARM BOL National Datasets are shared with  the central  CWR data index. The national  datasets as well as  access to other International  datasets are provided from the CWR data portal. MDG EURISCO UZB http://www.cropwildrelatives.org SINGER
Data portal example 43
44 http://wwwdev.ngb.se/portal/index.php?scope=demo
45
46
47
48
49 Outlook The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community. Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work. Use of GBIF technology and integration of PGR collection data into GBIF allows PGR users to simultaneously search PGR collections and other biodiversity collections, and to get access to the data (and possibly the material) of relevant biodiversity collections. The establishment of new data portals on a specific crop, a regional thematic network or similar subset of the total global biodiversity datasets; can be done with rather few efforts! This requires only that all the relevant datasets are provided by GBIF compatible web services (like the BioCASE PyWrapper).

Contenu connexe

Tendances

Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopSomeshwar Kale
 
Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Hortonworks
 

Tendances (20)

NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
ICESat-2 Metadata and Status
ICESat-2 Metadata and StatusICESat-2 Metadata and Status
ICESat-2 Metadata and Status
 
Utilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud ComputingUtilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud Computing
 
Efficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAPEfficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAP
 
NetCDF and HDF5
NetCDF and HDF5NetCDF and HDF5
NetCDF and HDF5
 
May 2013 HUG: HCatalog/Hive Data Out
May 2013 HUG: HCatalog/Hive Data OutMay 2013 HUG: HCatalog/Hive Data Out
May 2013 HUG: HCatalog/Hive Data Out
 
HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)
 
HDF Update 2016
HDF Update 2016HDF Update 2016
HDF Update 2016
 
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
 
Product Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the WebProduct Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the Web
 
Hive hcatalog
Hive hcatalogHive hcatalog
Hive hcatalog
 
NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)
 
Matlab, Big Data, and HDF Server
Matlab, Big Data, and HDF ServerMatlab, Big Data, and HDF Server
Matlab, Big Data, and HDF Server
 
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFViewHDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
 
Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4
 
Implementing HDF5 in MATLAB
Implementing HDF5 in MATLABImplementing HDF5 in MATLAB
Implementing HDF5 in MATLAB
 
Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5
 
HDF Tools Tutorial
HDF Tools TutorialHDF Tools Tutorial
HDF Tools Tutorial
 
Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012
 
Hdf5 intro
Hdf5 introHdf5 intro
Hdf5 intro
 

Similaire à Data exchange alternatives, SBIS conference in Stockholm (2008)

Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)Dag Endresen
 
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Thomas K. Y. Lam
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network ProcessingRyousei Takano
 
Datastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsDatastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsshanker_uma
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Dynamic Hadoop Clusters
Dynamic Hadoop ClustersDynamic Hadoop Clusters
Dynamic Hadoop ClustersSteve Loughran
 
PCIC Data Portal 2.0
PCIC Data Portal 2.0PCIC Data Portal 2.0
PCIC Data Portal 2.0James Hiebert
 
Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Principled Technologies
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin Kuberton
 
Hadoop and big data training
Hadoop and big data trainingHadoop and big data training
Hadoop and big data trainingagiamas
 
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016StackIQ
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014NoSQLmatters
 
Provisioning Servers Made Easy
Provisioning Servers Made EasyProvisioning Servers Made Easy
Provisioning Servers Made EasyAll Things Open
 
Programmability in spss 14
Programmability in spss 14Programmability in spss 14
Programmability in spss 14Armand Ruis
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopIRJET Journal
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesNicola Ferraro
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developerPaul Czarkowski
 

Similaire à Data exchange alternatives, SBIS conference in Stockholm (2008) (20)

Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
 
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network Processing
 
Datastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsDatastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobs
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Dynamic Hadoop Clusters
Dynamic Hadoop ClustersDynamic Hadoop Clusters
Dynamic Hadoop Clusters
 
PCIC Data Portal 2.0
PCIC Data Portal 2.0PCIC Data Portal 2.0
PCIC Data Portal 2.0
 
Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin
 
Hadoop and big data training
Hadoop and big data trainingHadoop and big data training
Hadoop and big data training
 
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016
 
Hadoop
HadoopHadoop
Hadoop
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
 
Provisioning Servers Made Easy
Provisioning Servers Made EasyProvisioning Servers Made Easy
Provisioning Servers Made Easy
 
Programmability in spss 14
Programmability in spss 14Programmability in spss 14
Programmability in spss 14
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with Kubernetes
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 

Plus de Dag Endresen

Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Dag Endresen
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Dag Endresen
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023Dag Endresen
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbariumDag Endresen
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC RomeDag Endresen
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24Dag Endresen
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Dag Endresen
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022Dag Endresen
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20Dag Endresen
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03Dag Endresen
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...Dag Endresen
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Dag Endresen
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portalsDag Endresen
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open ScienceDag Endresen
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementDag Endresen
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...Dag Endresen
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019Dag Endresen
 

Plus de Dag Endresen (20)

Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbarium
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open Science
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data management
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
 

Dernier

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 

Dernier (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 

Data exchange alternatives, SBIS conference in Stockholm (2008)

  • 1. Biodiversity Data Provider Software Hands-on exercises with TAPIR Stockholm Biodiversity Informatics Symposium 2008 (SBIS2008) Dag Terje Filip Endresen, Nordic Genetic Resources Center (Sweden) / Bioversity International (Italy)
  • 2. Fallacies of Distributed Computing The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. Topology doesn't change. There is one administrator. Transport cost is zero. The network is homogeneous. This list of fallacies came about at Sun Microsystems around 1994. 2
  • 3. TAPIR 3 Cartoon by Sasha Kopf (Creative Commons)
  • 4. TAPIR TAPIR - TDWG Access Protocol for Information Retrieval. During the 2004 TDWG meeting in Christchurch, NZ, work started on a unified protocol and named TAPIR. TAPIR is based on the protocol from the two data provider software, BioCASE and DiGIR. 4
  • 5. Provider software, wrappers DiGIR (2002, not active) http://digir.sourceforge.net BioCASE (2003, PyWrapper) http://www.biocase.org PyWrapper3 (2006, not active) http://trac.pywrapper.org/ TapirLink (2007) http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink GBIF Provider Toolkit (2009) http://code.google.com/p/gbif-providertoolkit 5
  • 6. BioCASE 2.5.ORC 6 The BioCASE provider software is a product of the EU funded BioCASE project (2001-2004). Developed at BGBM in Berlin. Last updated in April 2008, with support for Python version 2.5 and less required external Implement the BioCASE provider to share data as ABCD 2.06. http://www.biocase.org
  • 7. 1. Make sure you have Python 2.5 installed (command line: python –v) 2.Download the latest provider software from http://www.biocase.org 3. Uncompress the BioCASE provider software to a folder on your system [provider_software_2.5.0RC.tar.gz] (tar –xzvf provider_...tar.gz) 4. Run setup.py, (python setup.py) 5. Configure your web server to mount biocase/www as http://localhost/biocase/ Hint: You will find an example for httpd.conf as the last terminal output from running setup.py 7 BioCASE 2.5.ORC
  • 8. BioCASE 2.5.ORC 6. Visit the library test page: http://localhost/biocase/utilities/testlibs.cgi 6a. Download latest 4 Suite from http://4suite.org/ Uncompress and install [4Suite-XML-1.0.2.tar.bz2] 6b. Install additional python libraries, including the desired database driver. For each python package: (python setup.py install) 6c. Graphviz is useful to visualize the database table structure. 8
  • 9.
  • 11. Database table structure
  • 12. Mapping of data model to standard schema9
  • 13. BioCASE 2.5.ORC 8. Query Form The manual query form is illustrative for understanding exactly how the wrapper software works! http://localhost/biocase/utilities/queryforms/qf_manual.cgi?dsa=sesto 10
  • 14. 11
  • 15.
  • 16. PyWrapper 3.1.0 alpha (development version, works with Python 2.5)PyWrapper is tested and verified to work fine with Windows, Mac OS X and Linux. 12
  • 17. Required configuration Web server: Any CGI compliant web server: Apache, IIS etc. (The built in CherryPy web server can also be used). Database: Major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work. Python PyWrapper is developed with the Python programming language. (The latest version from the SVN code repository works with Python version 2.5) 13 Apache, MySQL and Python are open source software, free to use - even for commercial products.
  • 18. Installation http://trac.pywrapper.org/pywrapper/wiki/InstallationGuide 1. Download the latest PyWrapper3 installer. Use SVN export or checkout for Python 2.5 support 2. Uncompress to a folder of your choice. Example: “/usr/local/pywrapper3/” Example: “C:ywrapper Local installation: If you have a Subversion client installed, you may use the automatic installer. (Local Python and libraries are installed to your pywrapper folder) promt$ svn export svn://svn.pywrapper.org:80/pywrapper/trunk pywrapper promt$ cd pywrapper/tools promt$ /bin/shinstall.sh This will require that you have a bash shell, and probably that you have a Unix line system like e.g. FreeBSD, Linux or Mac OS X… 3. Execute: pywrapper/setup.py Example: promt$ python setup.py (Mac OS X, Linux) On Windows locate setup.py and double-click 14
  • 19. Start standalone server Execute start_server.py(default port is 8080) promt$ cdwebapp/ promt$ ./start_server.py 8088(example to start on port 8088) On a Windows system you may do this in a MS-DOS window (or double-click the file - if you accept the default port). Some messages will pass across your screen. Please be patient, this could take a minute. Wait for the message “start server …” and find find PyWrapper at: http://localhost:8088/pywrapper 15
  • 20. Configuration After successful installation, you will need to configure your data provider. Follow the instructions from the PyWrapper documentation web page to configure. Data sources. If you provide more datasets or several databases they will be configured as individual data sources (dsa). Database connection. For PyWrapper to access your database. Database structure. Define the relevant database tables, the primary keys and foreign keys. Data model. Map your database model to the standard represented by the XML Schemas you choose. http://trac.pywrapper.org/pywrapper/wiki/Documentation 16
  • 21. Screen examples PyWrapper comes with a graphical web based configuration tool For more information and more screen dumps from the configuration of PyWrapper, see: http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i 17
  • 23. TapirLink 0.6.1 Home: http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink Primary developers: Renato De Giovanni, Dave Vieglais Download: http://sourceforge.net/project/showfiles.php?group_id=38190 Uncompress PHP source code Eg: /usr/local/tapir/tapirlink Mount admin and www directory for your web server. Example: Apache “httpd.conf” Alias /tapirlink "/usr/local/tapir/tapirlink/www” Alias /tapirlink-admin "/usr/local/tapir/tapirlink/admin" <Location /tapirlink> Order allow,deny Allow from all </Location> <Location /tapirlink-admin> Order allow,deny Allow from all </Location> Read permissions on all directories Write on cache, config, log, statistics 19
  • 24. TapirLink 0.6.1 Start by adding a new resource http://localhost/tapirlink-admin/ Step 1: Describe your new resource 20
  • 25. TapirLink 0.6.1 Step 2: Data source, database connection Step 3: Table structure 21
  • 26. TapirLink 0.6.1 Step 4: Filter Step 5: Select mapping standards to use 22
  • 27. TapirLink 0.6.1 Step 5b: Mapping data schema (ABCD 2.06) etc… 23
  • 28. TapirLink 0.6.1 Step 5c: Mapping data concepts (Darwin Core) etc… Step 5d: Remember that DwC have an extension for geospatial descriptors etc… 24
  • 29. TapirLink 0.6.1 Step 6: Settings New resource successfully configured 25
  • 30. TapirLink 0.6.1 Test resource with client form: http://localhost/tapirlink/tapir_client.php The XML Client form is very illustrative for understanding exactly how the wrapper software works! 26
  • 32. Example of a service request All exchanged data is formatted with XML tags. 28
  • 33. Example of a service response 29 ...
  • 34. Example TAPIR service request 30
  • 35. Example TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ... 31
  • 36. Example TAPIR service SEARCH request 32
  • 37. Example TAPIR service Search response 33
  • 38. Example of OAI-PMH service request OAI-PMH requests are expressed as HTTP requests. OAI-PMH requests must be submitted using either the HTTP GET or POST methods. http://an.oa.org/OAI-script?verb=GetRecord &identifier=oai:arXiv.org:hep-th/9901001 &metadataPrefix=oai_dc 34
  • 39. Example of OAI-PMH service RESPONSE OAI-PMH responses are formatted as HTTP responses. With The Content-Type as text/xml. 35
  • 40. OAI-PMH PROTOCOL, metadata formats 36 Request types (verb): Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords For purposes of interoperability, the metadataPrefix `oai_dc’ is reserved for Dublin Core. Communities adopt own metadataPrefixesfor own metadata fomats. Relevant formats/schemas for Biodiversity Informatics are Darwin Core and ABCD.
  • 43. GBIF PGR Network 2 [http://data.gbif.org/datasets/network/2] 39
  • 44. 40 Distributed network The image is from the BioCASE web site
  • 45. 41 Decentralized network GBIF (Global Biodiversity Information Facility) ALIS (Accession Level Information System) USER Svalbard Global Seed Vault (Safe Backup) (USDA ARS National Germplasm Repositories...) Web Services USDA GRIN (USA) SINGER (CGIAR) (CGIAR International Future Harvest gene banks...) EURISCO (Europe) MCPD IHAR (Poland) WUR CGN (Netherlands) NordGen (Northern Europe) IPK Gatersleben (Germany) (Other European gene banks...)
  • 46. 42 Crop Wild Relatives LKA ARM BOL National Datasets are shared with the central CWR data index. The national datasets as well as access to other International datasets are provided from the CWR data portal. MDG EURISCO UZB http://www.cropwildrelatives.org SINGER
  • 49. 45
  • 50. 46
  • 51. 47
  • 52. 48
  • 53. 49 Outlook The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community. Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work. Use of GBIF technology and integration of PGR collection data into GBIF allows PGR users to simultaneously search PGR collections and other biodiversity collections, and to get access to the data (and possibly the material) of relevant biodiversity collections. The establishment of new data portals on a specific crop, a regional thematic network or similar subset of the total global biodiversity datasets; can be done with rather few efforts! This requires only that all the relevant datasets are provided by GBIF compatible web services (like the BioCASE PyWrapper).
  • 54. Participation and the sharing of your institute datasets with global and national biodiversity projects is important for your public and scientific visibility, promoting the use (usefulness) of your data and ultimately for the continued funding of your institutional activities. 50
  • 55. Special thanks to Bioversity International [http://www.bioversityinternational.org] GBIF, Global Biodiversity Information Facility [http://www.gbif.org] BioCASE, The Biological Collection Access Service for Europe. [http://www.biocase.org] TDWG, Biodiversity Information Standards [http://www.tdwg.org] 51
  • 56. Special thanks to BioCASE and PyWrapper3 software Markus Döring Javier de la Torre DiGIR and TapirLink software Renato de Giovanni Dave Vieglais 52
  • 57. Thank you for listening! 53

Notes de l'éditeur

  1. Image source: University of Ottawa, Distributed Computing Research Group: http://www.genie.uottawa.ca/research/rsrch_site.php?lang=e&id=90 (Google Images).See also: http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing
  2. IMAGE: http://blog.tapirtype.com/cartoons/ [Creative Commons License: http://creativecommons.org/licenses/by-nc-sa/3.0/us/]
  3. http://www.tdwg.org
  4. More details see:GBIF NODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  5. More details see:GBIF NODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  6. Apache HTTPD server:http://www.apache.org/MySQL database server:http://www.mysql.com/Python programming language: http://www.python.org
  7. In some cases with proxies:svn co svn://svn.pywrapper.org/pywrapper/trunk pywrapper
  8. More details see:GBIFNODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  9. IMAGE source: http://commons.wikimedia.org/wiki/Image:Handshake_(Workshop_Cologne_%2706).jpeg; Copyright: GNU Public Licence
  10. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  11. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  12. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  13. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  14. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  15. Photographer Dag Terje Endresen (NordGen Picture Archive, image 002986)http://www.nordgen.org/sesto/index.php?scp=ngb&thm=pictures&mod=det&id=002986&img_size=768x512