SlideShare une entreprise Scribd logo
1  sur  55
CKAN 2 Introduction 
Presenter: 李承錱 Cheng-Jen Lee (Sol) 
Email: cjlee AT iis.sinica.edu.tw 
This work is licensed under a 
Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 2
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 3
About CKAN 
● The Comprehensive Knowledge Archive 
Network 
● A powerful data management system 
● Publishing 
● Sharing 
● Finding 
● Using Data 
2014/5/22 4
About CKAN 
83 instances 
around the world 
in May 2014 
2014/5/22 5
CKAN 2 
2014/5/22 6
Feature Tour 
2014/5/22 7
Demo Site 
demo.ckan.org 
2014/5/22 8
Feature Tour (1) 
Publish & Find Datasets 
Add Dataset Basic 
Information 
2014/5/22 9
Feature Tour (1) 
Publish & Find Datasets 
Add Data 
Under the Dataset 
2014/5/22 10
Feature Tour (1) 
Publish & Find Datasets 
Add Metadata 
About the Dataset 
2014/5/22 11
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Keywords 
2014/5/22 12
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Geographical Features 
2014/5/22 13
Feature Tour (2) 
Store & Manage Data 
Data Explorer: 
recline_preview (csv, xls) 
json_preview 
pdf_preview 
ckanext-spatial 
2014/5/22 14
Feature Tour (2) 
Store & Manage Data 
Graphing data 
2014/5/22 15
Feature Tour (3) 
Engage with Users & Others 
Share 
2014/5/22 16
Feature Tour (3) 
Engage with Users & Others 
Organization 
2014/5/22 17
Feature Tour (3) 
Engage with Users & Others 
Manage Users of 
an Organization 
2014/5/22 18
Feature Tour (3) 
Engage with Users & Others 
Manage Role of Members 
Admin: edit datasets & members 
Editor: edit datasets 
Viewer: view (private) datasets 
Note: Public datasets are visible to 
everyone 
2014/5/22 19
Feature Tour (3) 
Engage with Users & Others 
Harvest and Federation 
2014/5/22 20
Feature Tour (3) 
Engage with Users & Others 
History 
2014/5/22 21
Feature Tour (4) 
Customise & Extend 
● RESTful JSON APIs 
– The Action API 
– The DataStore API 
– The FileStore API... 
● Extensions (over 60) 
– ckanext-harvest 
– ckanext-spatial 
Open source is good! 
● Themable 
● Integrates with other CMS (ex. Drupal) 
2014/5/22 22
CKAN and 5 ★ Open Data 
2014/5/22 23
CKAN and 5 ★ Open Data 
● ★ Make your stuff available on the Web (whatever 
format) under an open license 
2014/5/22 24
CKAN and 5 ★ Open Data 
● ★★ Make it available as structured data (e.g., 
Excel instead of image scan of a table) 
★★★ Use non-proprietary formats (e.g., CSV 
instead of Excel) 
– Accept any data format 
– Beautiful data demonstration 
– Datastore : Indexing for structure data 
2014/5/22 25
CKAN and 5 ★ Open Data 
● ★★★★ Use URIs to denote things, so that people 
can point at your stuff 
– Permanent link for each dataset 
– Get Dataset URI through API 
● ★★★★★ Link your data to other data to provide 
context 
– Linked data and RDF for metadata 
2014/5/22 26
CKAN and 5 ★ Open Data 
● RDF for metadata 
– DCAT and Dublin Core 
– curl -L -H "Accept:application/rdf+xml" 
http://thedatahub.org/dataset/gold-prices 
2014/5/22 27
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 28
United Kingdom 
DATA.GOV.UK 
2014/5/22 29
United States 
DATA.GOV 
2014/5/22 30
Brazil 
DADOS.GOV.BR 
2014/5/22 31
European Union 
PUBLICDATA.EU 
2014/5/22 32
Geospatial Data Explorer: 
Lat/Long field 
2014/5/22 33
Geospatial Data Explorer: 
GeoJSON 
2014/5/22 34
Geospatial Data Explorer: 
WMS 
2014/5/22 35
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 36
Issues 
● CJK Support 
– CJK Search 
– Some broken translations 
– Frequent updates 
● Extensions compatibilities 
● Tons of tweaks needed 
● Performance Issue 
● Complicated Architecture 
2014/5/22 37
System Architecture 
2014/5/22 38
Issues (Cont'd) 
● What You Should Know 
– Python & Pylons 
– ckan plugins toolkit 
– SQLAlchemy & SQL 
– HTML, JavaScript 
– Babel 
– Web Server (Nginx, uWSGI...) 
2014/5/22 39
Resources 
● Official Documents: 
– http://docs.ckan.org/en/latest/ 
● Installation Notes (in Chinese): 
– https://ckan-docs-tw.readthedocs.org/ 
● CKAN Development Discussions: 
– http://lists.okfn.org/mailman/listinfo/ckan-dev 
● CKAN Taiwan Interest Group: 
– https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 
2014/5/22 40
Thanks for your attention! 
Any Q? 
Email: u10313335 AT citi.sinica.edu.tw 
http://about.me/sollee 
2014/5/22 41
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 42
Install from Source 
● Virtual environment 
● Checkout the source (via GIT) 
– https://github.com/okfn/ckan 
● Create a CKAN config file 
● Setup Jetty & Solr 
● Initialize Database (user, db) 
● Link to who.ini 
● Create a Sysadmin User 
● Deployment (nginx + uWSGI) 
● Install other extensions... 
2014/5/22 43
Installation Notes 
● https://ckan-docs-tw.readthedocs.org/ 
2014/5/22 44
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 45
Harvesters 
● ckanext-harvest 
– Remote harvesting extension 
– https://github.com/okfn/ckanext-harvest 
● Source Type 
– CKAN (built-in) 
– CSW 
– WAF 
– Custom (csv/xls/website… etc) 
2014/5/22 46
Harvested from TGOS 
CSW service 
2014/5/22 47
Harvesters 
http://Mydomain.com/harvest 
2014/5/22 48
Harvesters 
Add a new harvest source 
2014/5/22 49
Harvesters 
Create a harvest job 
2014/5/22 50
Harvesters 
Overview of harvested datasets 
2014/5/22 51
Harvesters 
Background Process 
● Manually 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester gather_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester fetch_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester run -c 
/etc/ckan/default/production.ini 
2014/5/22 52
Harvesters 
Background Process 
● Automatically 
– Supervisor (for gather & fetch consumer) 
– Cron (for run) 
2014/5/22 53
Harvesters 
Custom harvester 
● Implement the harvester interface 
to perform harvesting operations 
● Three stages 
● gather: get the identification 
● fetch: fetch the contents 
● import: create ckan package(dataset) 
2014/5/22 54
Harvesters 
The harvesting interface 
from base import HarvesterBase 
class SRDAHarvester(HarvesterBase): 
def _set_config(self,config_str): 
def info(self): 
... 
def gather_stage(self, harvest_job): 
def fetch_stage(self, harvest_object): 
def import_stage(self, harvest_object): 
See the extension site for details 
An example (SRDA): http://goo.gl/ZMnND7 
2014/5/22 55

Contenu connexe

Tendances

ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sources
Chengjen Lee
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
scorlosquet
 

Tendances (20)

ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sources
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platform
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
Apache Marmotta - Introduction
Apache Marmotta - IntroductionApache Marmotta - Introduction
Apache Marmotta - Introduction
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data
 
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
 
Towards Data Operations
Towards Data OperationsTowards Data Operations
Towards Data Operations
 
Presentation data collection and gtfs
Presentation data collection and gtfsPresentation data collection and gtfs
Presentation data collection and gtfs
 
Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Land
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
Drupal and RDF
Drupal and RDFDrupal and RDF
Drupal and RDF
 
Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Pinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastorePinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastore
 
Iceberg: a fast table format for S3
Iceberg: a fast table format for S3Iceberg: a fast table format for S3
Iceberg: a fast table format for S3
 
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
 

En vedette

En vedette (10)

EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
 
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
 
Getting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, SingaporeGetting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, Singapore
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKAN
 
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
 
20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結
 
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
 
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
 
Minicurso de CKAN
Minicurso de CKANMinicurso de CKAN
Minicurso de CKAN
 

Similaire à ckan 2.0 Introduction (20140522 updated)

DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
Digitised Manuscripts to Europeana
 
Spark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng ChenSpark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng Chen
Guancheng (G.C.) Chen
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
scorlosquet
 
Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
Stitch Fix Algorithms
 
Infrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache SparkInfrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache Spark
Databricks
 

Similaire à ckan 2.0 Introduction (20140522 updated) (20)

DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
 
Open Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-MayOpen Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-May
 
Spark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng ChenSpark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng Chen
 
Drupal 7 and RDF
Drupal 7 and RDFDrupal 7 and RDF
Drupal 7 and RDF
 
Sparkler - Spark Crawler
Sparkler - Spark Crawler Sparkler - Spark Crawler
Sparkler - Spark Crawler
 
ODN - Technical introduction of the platform
ODN - Technical introduction of the platformODN - Technical introduction of the platform
ODN - Technical introduction of the platform
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
 
Sprint planning
Sprint planningSprint planning
Sprint planning
 
PostgreSQL ecosystem
PostgreSQL ecosystemPostgreSQL ecosystem
PostgreSQL ecosystem
 
PTG recap
PTG recapPTG recap
PTG recap
 
Introduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCFIntroduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCF
 
OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023
 
[scala.by] Launching new application fast
[scala.by] Launching new application fast[scala.by] Launching new application fast
[scala.by] Launching new application fast
 
scale12x
scale12xscale12x
scale12x
 
Database Migrations with Gradle and Liquibase
Database Migrations with Gradle and LiquibaseDatabase Migrations with Gradle and Liquibase
Database Migrations with Gradle and Liquibase
 
Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
 
Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
 
Deploy Eclipse hawBit in Production
Deploy Eclipse hawBit in ProductionDeploy Eclipse hawBit in Production
Deploy Eclipse hawBit in Production
 
Infrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache SparkInfrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache Spark
 

Plus de Chengjen Lee (11)

Preserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary EventsPreserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary Events
 
Retooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioRetooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.io
 
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
 
“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN
 
CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)
 
CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)
 
CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例
 
Report 140227
Report 140227Report 140227
Report 140227
 
Report 140213
Report 140213Report 140213
Report 140213
 
Introduction to Pelican
Introduction to PelicanIntroduction to Pelican
Introduction to Pelican
 
ckan 2.0: a deeper look
ckan 2.0: a deeper lookckan 2.0: a deeper look
ckan 2.0: a deeper look
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

ckan 2.0 Introduction (20140522 updated)

  • 1. CKAN 2 Introduction Presenter: 李承錱 Cheng-Jen Lee (Sol) Email: cjlee AT iis.sinica.edu.tw This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
  • 2. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 2
  • 3. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 3
  • 4. About CKAN ● The Comprehensive Knowledge Archive Network ● A powerful data management system ● Publishing ● Sharing ● Finding ● Using Data 2014/5/22 4
  • 5. About CKAN 83 instances around the world in May 2014 2014/5/22 5
  • 9. Feature Tour (1) Publish & Find Datasets Add Dataset Basic Information 2014/5/22 9
  • 10. Feature Tour (1) Publish & Find Datasets Add Data Under the Dataset 2014/5/22 10
  • 11. Feature Tour (1) Publish & Find Datasets Add Metadata About the Dataset 2014/5/22 11
  • 12. Feature Tour (1) Publish & Find Datasets Filter By Keywords 2014/5/22 12
  • 13. Feature Tour (1) Publish & Find Datasets Filter By Geographical Features 2014/5/22 13
  • 14. Feature Tour (2) Store & Manage Data Data Explorer: recline_preview (csv, xls) json_preview pdf_preview ckanext-spatial 2014/5/22 14
  • 15. Feature Tour (2) Store & Manage Data Graphing data 2014/5/22 15
  • 16. Feature Tour (3) Engage with Users & Others Share 2014/5/22 16
  • 17. Feature Tour (3) Engage with Users & Others Organization 2014/5/22 17
  • 18. Feature Tour (3) Engage with Users & Others Manage Users of an Organization 2014/5/22 18
  • 19. Feature Tour (3) Engage with Users & Others Manage Role of Members Admin: edit datasets & members Editor: edit datasets Viewer: view (private) datasets Note: Public datasets are visible to everyone 2014/5/22 19
  • 20. Feature Tour (3) Engage with Users & Others Harvest and Federation 2014/5/22 20
  • 21. Feature Tour (3) Engage with Users & Others History 2014/5/22 21
  • 22. Feature Tour (4) Customise & Extend ● RESTful JSON APIs – The Action API – The DataStore API – The FileStore API... ● Extensions (over 60) – ckanext-harvest – ckanext-spatial Open source is good! ● Themable ● Integrates with other CMS (ex. Drupal) 2014/5/22 22
  • 23. CKAN and 5 ★ Open Data 2014/5/22 23
  • 24. CKAN and 5 ★ Open Data ● ★ Make your stuff available on the Web (whatever format) under an open license 2014/5/22 24
  • 25. CKAN and 5 ★ Open Data ● ★★ Make it available as structured data (e.g., Excel instead of image scan of a table) ★★★ Use non-proprietary formats (e.g., CSV instead of Excel) – Accept any data format – Beautiful data demonstration – Datastore : Indexing for structure data 2014/5/22 25
  • 26. CKAN and 5 ★ Open Data ● ★★★★ Use URIs to denote things, so that people can point at your stuff – Permanent link for each dataset – Get Dataset URI through API ● ★★★★★ Link your data to other data to provide context – Linked data and RDF for metadata 2014/5/22 26
  • 27. CKAN and 5 ★ Open Data ● RDF for metadata – DCAT and Dublin Core – curl -L -H "Accept:application/rdf+xml" http://thedatahub.org/dataset/gold-prices 2014/5/22 27
  • 28. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 28
  • 30. United States DATA.GOV 2014/5/22 30
  • 33. Geospatial Data Explorer: Lat/Long field 2014/5/22 33
  • 34. Geospatial Data Explorer: GeoJSON 2014/5/22 34
  • 35. Geospatial Data Explorer: WMS 2014/5/22 35
  • 36. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 36
  • 37. Issues ● CJK Support – CJK Search – Some broken translations – Frequent updates ● Extensions compatibilities ● Tons of tweaks needed ● Performance Issue ● Complicated Architecture 2014/5/22 37
  • 39. Issues (Cont'd) ● What You Should Know – Python & Pylons – ckan plugins toolkit – SQLAlchemy & SQL – HTML, JavaScript – Babel – Web Server (Nginx, uWSGI...) 2014/5/22 39
  • 40. Resources ● Official Documents: – http://docs.ckan.org/en/latest/ ● Installation Notes (in Chinese): – https://ckan-docs-tw.readthedocs.org/ ● CKAN Development Discussions: – http://lists.okfn.org/mailman/listinfo/ckan-dev ● CKAN Taiwan Interest Group: – https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 2014/5/22 40
  • 41. Thanks for your attention! Any Q? Email: u10313335 AT citi.sinica.edu.tw http://about.me/sollee 2014/5/22 41
  • 42. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 42
  • 43. Install from Source ● Virtual environment ● Checkout the source (via GIT) – https://github.com/okfn/ckan ● Create a CKAN config file ● Setup Jetty & Solr ● Initialize Database (user, db) ● Link to who.ini ● Create a Sysadmin User ● Deployment (nginx + uWSGI) ● Install other extensions... 2014/5/22 43
  • 44. Installation Notes ● https://ckan-docs-tw.readthedocs.org/ 2014/5/22 44
  • 45. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 45
  • 46. Harvesters ● ckanext-harvest – Remote harvesting extension – https://github.com/okfn/ckanext-harvest ● Source Type – CKAN (built-in) – CSW – WAF – Custom (csv/xls/website… etc) 2014/5/22 46
  • 47. Harvested from TGOS CSW service 2014/5/22 47
  • 49. Harvesters Add a new harvest source 2014/5/22 49
  • 50. Harvesters Create a harvest job 2014/5/22 50
  • 51. Harvesters Overview of harvested datasets 2014/5/22 51
  • 52. Harvesters Background Process ● Manually – (pyenv) $ paster --plugin=ckanext-harvest harvester gather_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester fetch_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester run -c /etc/ckan/default/production.ini 2014/5/22 52
  • 53. Harvesters Background Process ● Automatically – Supervisor (for gather & fetch consumer) – Cron (for run) 2014/5/22 53
  • 54. Harvesters Custom harvester ● Implement the harvester interface to perform harvesting operations ● Three stages ● gather: get the identification ● fetch: fetch the contents ● import: create ckan package(dataset) 2014/5/22 54
  • 55. Harvesters The harvesting interface from base import HarvesterBase class SRDAHarvester(HarvesterBase): def _set_config(self,config_str): def info(self): ... def gather_stage(self, harvest_job): def fetch_stage(self, harvest_object): def import_stage(self, harvest_object): See the extension site for details An example (SRDA): http://goo.gl/ZMnND7 2014/5/22 55

Notes de l'éditeur

  1. 由 Open Knowledge Foundation 支持
  2. 由 Open Knowledge Foundation 支持
  3. 公布網址
  4. Data.gov.uk 即是 drupal+CKAN
  5. Store the raw data and metadata. Visualise structured data with interactive tables, graphs and maps.
  6. TGOS: 國土資訊圖資服務平台