SlideShare une entreprise Scribd logo
1  sur  49
Télécharger pour lire hors ligne
Helium makes Zeppelin Fly
Moon soo Lee (moon@zepl.com)
Hoon Park (1ambda@zepl.com)
Ahyoung Ryu (ahyoungryu@zepl.com)
@ZEPL
Who we are?
What is Apache Zeppelin?
A web-based notebook that enables interactive data analytics. You can make beautiful data-driven,
interactive and collaborative documents with SQL, Scala and more.
When do you need Zeppelin?
Engineers Data Scientists Business user
Visualizations
Access control
Report
When multiple tools and different peoples are involved in your data pipeline
JDBC
Notebook
Zeppelin allows use multiple interpreter
(language backend) at the same time
● Real-time collaboration
● Authentication
● Notebook ACL
● Interpreter ACL
● Fine grained interpreter session -
notebook/user mapping
● Built-in scheduler
● Pluggable notebook storage
● Pluggable interpreter
Zeppelin supports multi-user environment
Built-in Visualization
6 Basic visualizations are built-in, (in addition to matplotlib, ggplot integration)
(Table, Bar chart, Pie chart, Area chart, Scatter chart, Line chart)
liumHe
2
4.0026
Hoon Park ZEPL
linkedin.com/in/1ambda
@
Why isn’t it easy to add new visualizations?
- Dependent on Zeppelin release cycle
- Restricted License (e.g. commercial chart)
Problems: Built-in Visualizations
“What if I want to display things differently?”
Let’s add pluggable visualizations
- external add-ons
- can update frequently
- highly customizable
- shared via online registry
Solution?
Problems: Built-in Visualizations
Why isn’t it easy to add new visualizations?
- Restricted License (e.g. highchart)
- Dependent on Zeppelin release cycle
liumHe
2
4.0026
Apache Zeppelin: Helium
VISUALIZATION
DEMO
Helium Visualization Examples
“What about extending Interpreters?”
“Can we leverage Helium framework for interpreters?”
Backgrounds: Interpreter
Execute paragraph (code) and return output
- separated JVM process
- %spark, %jdbc, %python, ...
Apache Zeppelin: Helium
SPELL
It is not easy to add a new interpreter
- interpreters written in Java
- sometimes need to handle HTML dynamically
Can’t be combined with other interpreters
- e.g. Spark + Markdown
- Interpreter != Display System
Problems: Backend Interpreter
“Can we write interpreters easily?”
Backgrounds: Display System
customize interpreted outputs in frontend
- can be combined with interpreters
- %html, %table, %angular, ...
Problems: Backend Interpreter
“Frontend interpreter can be a display system”
Easy to create and handle HTML
- written in JS
- can utilize many existing JS libraries
(flowchart, sigmajs, vega, papaparse, ...)
Can be a display system like %html, %table
- e.g. Spark Interpreter + Markdown Display
- allows to customize output %myGraph
SPELL: Frontend Interpreter
Apache Zeppelin: Helium
SPELL
DEMO
Helium Spell Examples
lium Online RegistryHe
2
4.0026
Ahyoung Ryu @ ZEPL
linkedin.com/in/AhyoungRyu
How can we share
VISUALIZATION or
SPELL
packages?
ONLINE REGISTRIES
ONLINE REGISTRIES
ONLINE REGISTRIES
Online Registry for Helium packages?
THEN WE SHOULD CONSIDER
1. Who/ How can we build up the infra and operate the service?
- Need to set up the authentication system
- Need user/ package DB
- Versioning/ building / packaging
- …
2. External library Licenses
Too complicated..
Is there any other SIMPLE way to solve this?
A SOLUTION ALREADY EXISTED
A SOLUTION ALREADY EXISTED
Helium VISUALIZATION & SPELL packages are
package
A SOLUTION ALREADY EXISTED
The package information can be saved in registry!
Helium VISUALIZATION & SPELL packages are
package
Then, is there any way to fetch
ONLY Helium package information
from registry?
Create
a Helium package
HOW?
HOW?
Publish it to
Registry
http://registry.npmjs.org/
Create
a Helium package
HOW?
Publish it to
Registry
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
Does the package has zeppelin-vis or
zeppelin-spell as its dependency?
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
If so, take only necessary metadata:
- name
- description
- version
- license
- ...
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
Integrate
whole data and create
helium.json
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
Integrate
whole data and create
helium.json
Save the file in
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
Save the file in
Trigger
Lambda function
every 1 hour using
Integrate
whole data and create
helium.json
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
HOW?
Publish it to
Registry
Read
helium.json
Integrate
whole data and create
helium.json
Trigger
Lambda function
every 1 hour using
Save the file in
Filter Helium pkg
& fetch only
necessary metadata
Create
a Helium package
JIRA ISSUES
- ZEPPELIN-1973 : List all available Helium packages in Zeppelin website
- ZEPPELIN-2004 : List helium packages in Zeppelin GUI by reading file
WHEN CAN I USE THIS?
- Not included in Zeppelin latest version 0.7.X
- Will be available in Zeppelin 0.8.0
- Release plan ?
extends Zeppelin eco-system
Interpreters
liumHe
2
4.0026
Visualizations
Spell
Map
Heatmap
Range
Bubble
Spline
Sigma
D3
Markdown Translator
Flowchart
Spark
Python
JDBC Groovy
Geode
FlinkCassandra
Kylin
Users / Developers
2833
3rd parties
Useful service
Zeppelin notebook online viewer
https://www.zeppelinhub.com/viewer
Notebook sharing and collaboration
https://www.zeppelinhub.com
Future Roadmap
0.7.2
- Maintenance release.
0.8.0
- Helium online registry.
- Interpreter Cluster mode.
1.0
- Finest, the most stable release
2Q
/2017
3Q
/2017
3-4Q
/2017
Hopefully
Thanks
Q&A

Contenu connexe

Tendances

AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜 AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
崇之 清水
 

Tendances (20)

Building an API with Django and Django REST Framework
Building an API with Django and Django REST FrameworkBuilding an API with Django and Django REST Framework
Building an API with Django and Django REST Framework
 
Knowing Laravel 5 : The most popular PHP framework
Knowing Laravel 5 : The most popular PHP frameworkKnowing Laravel 5 : The most popular PHP framework
Knowing Laravel 5 : The most popular PHP framework
 
Apache Camel - The integration library
Apache Camel - The integration libraryApache Camel - The integration library
Apache Camel - The integration library
 
Microservices with Apache Camel
Microservices with Apache CamelMicroservices with Apache Camel
Microservices with Apache Camel
 
OpenFaaS JeffConf 2017 - Milan
OpenFaaS JeffConf 2017 - MilanOpenFaaS JeffConf 2017 - Milan
OpenFaaS JeffConf 2017 - Milan
 
AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜 AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
AWS SDK for PHP のインストールから 始めるクラウドマスターへの道 〜 Promise による非同期オペレーション 〜
 
Interoperability at Apache Software Foundation
Interoperability at Apache Software FoundationInteroperability at Apache Software Foundation
Interoperability at Apache Software Foundation
 
Running Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoop
 
Apache Flink Hands-On
Apache Flink Hands-OnApache Flink Hands-On
Apache Flink Hands-On
 
OpenFaaS serverless framework for Docker and Kubernetes - London
OpenFaaS serverless framework for Docker and Kubernetes - LondonOpenFaaS serverless framework for Docker and Kubernetes - London
OpenFaaS serverless framework for Docker and Kubernetes - London
 
Presentation tim numann
Presentation tim numannPresentation tim numann
Presentation tim numann
 
Function as a Service
Function as a ServiceFunction as a Service
Function as a Service
 
Alfresco Development Framework Basic
Alfresco Development Framework BasicAlfresco Development Framework Basic
Alfresco Development Framework Basic
 
Microservices with apache_camel_barcelona
Microservices with apache_camel_barcelonaMicroservices with apache_camel_barcelona
Microservices with apache_camel_barcelona
 
Getting started with Apache Camel - jDays 2013
Getting started with Apache Camel - jDays 2013Getting started with Apache Camel - jDays 2013
Getting started with Apache Camel - jDays 2013
 
OpenFaaS 2019 Project Update
OpenFaaS 2019 Project UpdateOpenFaaS 2019 Project Update
OpenFaaS 2019 Project Update
 
Manage any AWS resources with Terraform 0.12 - April 2020
Manage any AWS resources with Terraform 0.12 - April 2020Manage any AWS resources with Terraform 0.12 - April 2020
Manage any AWS resources with Terraform 0.12 - April 2020
 
Integration using Apache Camel and Groovy
Integration using Apache Camel and GroovyIntegration using Apache Camel and Groovy
Integration using Apache Camel and Groovy
 
Getting started with Apache Camel - May 2013
Getting started with Apache Camel - May 2013Getting started with Apache Camel - May 2013
Getting started with Apache Camel - May 2013
 
Getting started with Apache Camel presentation at BarcelonaJUG, january 2014
Getting started with Apache Camel presentation at BarcelonaJUG, january 2014Getting started with Apache Camel presentation at BarcelonaJUG, january 2014
Getting started with Apache Camel presentation at BarcelonaJUG, january 2014
 

Similaire à Apache Zeppelin and Helium @ApacheCon 2017 may, FL

Cakefest higher education
Cakefest higher educationCakefest higher education
Cakefest higher education
therealphpdiddy
 
How to? Drupal developer toolkit. Dennis Povshedny.
How to? Drupal developer toolkit. Dennis Povshedny.How to? Drupal developer toolkit. Dennis Povshedny.
How to? Drupal developer toolkit. Dennis Povshedny.
DrupalCampDN
 

Similaire à Apache Zeppelin and Helium @ApacheCon 2017 may, FL (20)

Helium makes Zeppelin fly!
Helium makes Zeppelin fly!Helium makes Zeppelin fly!
Helium makes Zeppelin fly!
 
Flink in action
Flink in actionFlink in action
Flink in action
 
Apache Zeppelin, Helium and Beyond
Apache Zeppelin, Helium and BeyondApache Zeppelin, Helium and Beyond
Apache Zeppelin, Helium and Beyond
 
Data science lifecycle with Apache Zeppelin
Data science lifecycle with Apache ZeppelinData science lifecycle with Apache Zeppelin
Data science lifecycle with Apache Zeppelin
 
Introduction To Eclipse RCP
Introduction To Eclipse RCPIntroduction To Eclipse RCP
Introduction To Eclipse RCP
 
Cakefest higher education
Cakefest higher educationCakefest higher education
Cakefest higher education
 
Drupal development
Drupal development Drupal development
Drupal development
 
From content to search: speed-dating Apache Solr (ApacheCON 2018)
From content to search: speed-dating Apache Solr (ApacheCON 2018)From content to search: speed-dating Apache Solr (ApacheCON 2018)
From content to search: speed-dating Apache Solr (ApacheCON 2018)
 
Php Conference Brazil - Phalcon Giant Killer
Php Conference Brazil - Phalcon Giant KillerPhp Conference Brazil - Phalcon Giant Killer
Php Conference Brazil - Phalcon Giant Killer
 
Unified Batch and Real-Time Stream Processing Using Apache Flink
Unified Batch and Real-Time Stream Processing Using Apache FlinkUnified Batch and Real-Time Stream Processing Using Apache Flink
Unified Batch and Real-Time Stream Processing Using Apache Flink
 
Eclipse Overview
Eclipse Overview Eclipse Overview
Eclipse Overview
 
Lightweight web frameworks
Lightweight web frameworksLightweight web frameworks
Lightweight web frameworks
 
Apache Zeppelin Helium and Beyond
Apache Zeppelin Helium and BeyondApache Zeppelin Helium and Beyond
Apache Zeppelin Helium and Beyond
 
php basics
php basicsphp basics
php basics
 
Developing OpenResty Framework
Developing OpenResty FrameworkDeveloping OpenResty Framework
Developing OpenResty Framework
 
Flyr PHP micro-framework
Flyr PHP micro-frameworkFlyr PHP micro-framework
Flyr PHP micro-framework
 
Phalcon 2 - PHP Brazil Conference
Phalcon 2 - PHP Brazil ConferencePhalcon 2 - PHP Brazil Conference
Phalcon 2 - PHP Brazil Conference
 
How to? Drupal developer toolkit. Dennis Povshedny.
How to? Drupal developer toolkit. Dennis Povshedny.How to? Drupal developer toolkit. Dennis Povshedny.
How to? Drupal developer toolkit. Dennis Povshedny.
 
Introduction to node.js
Introduction to node.jsIntroduction to node.js
Introduction to node.js
 
Headless approach for offloading heavy tasks in Magento
Headless approach for offloading heavy tasks in MagentoHeadless approach for offloading heavy tasks in Magento
Headless approach for offloading heavy tasks in Magento
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Apache Zeppelin and Helium @ApacheCon 2017 may, FL