dataviz on d3.js + elasticsearch

Mathieu Elie
Mathieu Elieelasticsearch consultant à Mathieu Elie
dataviz on
                   bordeaux open data
                     + elasticsearch
                         + d3js
                      + angular.js + google refine

mardi 19 mars 13
goal
                   • full stack dataviz
                   • front js centric with angular.js and sass
                   • d3.js -> best lib for dataviz (a bit complex)
                   • elasticsearch scalable search engine access
                     form js ajax client
                   • grab open data and reformat with google
                     refine


mardi 19 mars 13
philosophy

                   • smarter and smarter browsers will grab a
                     huge part of the stack
                   • scalable NoSQL solution talk REST: super
                     easy access
                   • you always can enrich, refine, store, model
                     data from anysource: yes you can !



mardi 19 mars 13
go open data !
                   • go to http://opendata.bordeaux.fr/
                   • we focus ou capc contemporary museum
                     of bordeaux
                   • http://opendata.bordeaux.fr/content/
                     collections-du-capc-musee-dart-
                     contemporain
                   • curl https://
                     bdxconfigogdi.blob.core.windows.net/
                     converteddata/capc.csv -o capc.csv
mardi 19 mars 13
google refine
                   • data many times isn’t perfect: humans
                     input, unsuitable model...
                   • with google refine you can tidy data, enrich
                     with web services call and so on...
                   • i think you should never think you don’t
                     have the good data for the job
                   • be smart and be creativ: you have
                     everything you need, all the time !
mardi 19 mars 13
import data in
                            elasticsearch
                   • bulk api import for es is handy
                   • http://www.elasticsearch.org/guide/
                     reference/api/bulk.html
                   • we use templating in google refine to
                     export data to the correct format
                   • { "index" : { "_index" : "musees", "_type" :
                     "capc"} } [snipp]


mardi 19 mars 13
elasticsearch

                   • scalable search engine
                   • adding more power == adding more nodes
                   • sharding
                   • replication
                   • fault tolerant

mardi 19 mars 13
elasticsearch
                   • store unstructured document (json) in
                     indexes (NoSQL way)
                   • talk REST (api)
                   • advanced query langage
                   • multiple analyzers (tokens, languages, etc...)
                   • blazing fast !
                   • no alternativ solutions (to my advice) (and
                     kimshy advice too ;) )
mardi 19 mars 13
yeoman
                   • perfect tool for the new browser focus
                     stack coming soon
                   • yeoman init angular
                   • yeoman init angular:route   capc
                   • yeoman install d3
                   • yeoman install jquery
                   • yeoman server -> yeah !!!
mardi 19 mars 13
angular.js

                   • mvc framework for browser (js)
                   • by google
                   • rely better on html doc
                   • cleaner controller handling than backbone
                     (to my advice)



mardi 19 mars 13
d3.js

                   • best javascript lib for dataviz (to my advice)
                   • slow learning curve
                   • based on svg browser markup
                   • data and dom oriented
                   • generic (go low level with svg easily)

mardi 19 mars 13
dataviz
                   • make appears informations from pure data
                   • you should explore, analyse and be creativ
                     to grab the most value possible from the
                     data: go as far as you can
                   • tables (excell way) -> forgive
                   • visualization will reveal your data: data is
                     made to be know to take decision, manage,
                     understand

mardi 19 mars 13
open data

                   • today, wide range of datas come to the
                     public domain
                   • again, without computing and vizualisation,
                     data has a poor value
                   • open data without data scientists and data
                     visualization has no futur



mardi 19 mars 13
futur
                   • with low bandwith, storage and computing
                      cost, you can grab open data from a lot of
                      sources
                   • you can put them in big data store and
                      make analysis, relations between, with open
                      source technologies
                   • you can then share this to the word with
                      data visualization on your website, blogs...

mardi 19 mars 13
                   • amazing isn’t it ?? !!!!!
questions ? infos ?

                   • https://github.com/mathieue
                   • http://www.mathieu-elie.net/
                   • https://twitter.com/mathieuel
                   • https://www.facebook.com/mathieuelietech

mardi 19 mars 13
1 sur 15

Recommandé

elasticsearch basics workshop par
elasticsearch basics workshopelasticsearch basics workshop
elasticsearch basics workshopMathieu Elie
3.2K vues47 diapositives
Data Visualization on the Tech Side par
Data Visualization on the Tech SideData Visualization on the Tech Side
Data Visualization on the Tech SideMathieu Elie
5.8K vues36 diapositives
Data vizualisation: d3.js + sinatra + elasticsearch par
Data vizualisation: d3.js + sinatra + elasticsearchData vizualisation: d3.js + sinatra + elasticsearch
Data vizualisation: d3.js + sinatra + elasticsearchMathieu Elie
6.5K vues15 diapositives
Scrapy par
ScrapyScrapy
ScrapyFrancisco Sousa
3.1K vues41 diapositives
Appli légère avec d3.js, sinatra, elasticsearch et capucine par
Appli légère avec d3.js, sinatra, elasticsearch et capucineAppli légère avec d3.js, sinatra, elasticsearch et capucine
Appli légère avec d3.js, sinatra, elasticsearch et capucineyann ARMAND
978 vues15 diapositives
ElasticSearch - index server used as a document database par
ElasticSearch - index server used as a document databaseElasticSearch - index server used as a document database
ElasticSearch - index server used as a document databaseRobert Lujo
5.5K vues43 diapositives

Contenu connexe

Tendances

On Again; Off Again - Benjamin Young - ebookcraft 2017 par
On Again; Off Again - Benjamin Young - ebookcraft 2017On Again; Off Again - Benjamin Young - ebookcraft 2017
On Again; Off Again - Benjamin Young - ebookcraft 2017BookNet Canada
2.5K vues34 diapositives
CouchDB Open Source Bridge par
CouchDB Open Source BridgeCouchDB Open Source Bridge
CouchDB Open Source BridgeChris Anderson
3.4K vues73 diapositives
Mongo db php_shaken_not_stirred_joomlafrappe par
Mongo db php_shaken_not_stirred_joomlafrappeMongo db php_shaken_not_stirred_joomlafrappe
Mongo db php_shaken_not_stirred_joomlafrappeSpyros Passas
1.2K vues36 diapositives
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version) par
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)Sammy Fung
6.6K vues21 diapositives
NoSQL: Why, When, and How par
NoSQL: Why, When, and HowNoSQL: Why, When, and How
NoSQL: Why, When, and HowBigBlueHat
6.2K vues123 diapositives
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby... par
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...guest5b1607
7.4K vues31 diapositives

Tendances(20)

On Again; Off Again - Benjamin Young - ebookcraft 2017 par BookNet Canada
On Again; Off Again - Benjamin Young - ebookcraft 2017On Again; Off Again - Benjamin Young - ebookcraft 2017
On Again; Off Again - Benjamin Young - ebookcraft 2017
BookNet Canada2.5K vues
Mongo db php_shaken_not_stirred_joomlafrappe par Spyros Passas
Mongo db php_shaken_not_stirred_joomlafrappeMongo db php_shaken_not_stirred_joomlafrappe
Mongo db php_shaken_not_stirred_joomlafrappe
Spyros Passas1.2K vues
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version) par Sammy Fung
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Sammy Fung6.6K vues
NoSQL: Why, When, and How par BigBlueHat
NoSQL: Why, When, and HowNoSQL: Why, When, and How
NoSQL: Why, When, and How
BigBlueHat6.2K vues
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby... par guest5b1607
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
guest5b16077.4K vues
Introduction to Elasticsearch par Jason Austin
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
Jason Austin7.8K vues
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting par Myles Braithwaite
Apache CouchDB Presentation @ Sept. 2104 GTALUG MeetingApache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Simple search with elastic search par markstory
Simple search with elastic searchSimple search with elastic search
Simple search with elastic search
markstory6.6K vues
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh... par BookNet Canada
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
BookNet Canada338 vues
NoSQL - An introduction to CouchDB par Jonathan Weiss
NoSQL - An introduction to CouchDBNoSQL - An introduction to CouchDB
NoSQL - An introduction to CouchDB
Jonathan Weiss1.8K vues
Web History 101, or How the Future is Unwritten par BookNet Canada
Web History 101, or How the Future is UnwrittenWeb History 101, or How the Future is Unwritten
Web History 101, or How the Future is Unwritten
BookNet Canada514 vues
An Introduction to Elastic Search. par Jurriaan Persyn
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.
Jurriaan Persyn71.1K vues
What I learnt: Elastic search & Kibana : introduction, installtion & configur... par Rahul K Chauhan
What I learnt: Elastic search & Kibana : introduction, installtion & configur...What I learnt: Elastic search & Kibana : introduction, installtion & configur...
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
Rahul K Chauhan1.1K vues
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ... par Elżbieta Bednarek
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
Elżbieta Bednarek1.2K vues

Similaire à dataviz on d3.js + elasticsearch

Big Data Analysis : Deciphering the haystack par
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Srinath Perera
3.7K vues47 diapositives
Data collection & communication par
Data collection & communicationData collection & communication
Data collection & communicationGeckoboard
3.3K vues11 diapositives
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An... par
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...Benjamin Nussbaum
65.8K vues38 diapositives
Mechanical curator - Technical notes par
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notesbenosteen
674 vues37 diapositives
From Developer to Data Scientist par
From Developer to Data ScientistFrom Developer to Data Scientist
From Developer to Data ScientistGaines Kergosien
790 vues27 diapositives
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich par
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow ZurichHow to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow ZurichPatrick Baumgartner
1.2K vues18 diapositives

Similaire à dataviz on d3.js + elasticsearch(20)

Big Data Analysis : Deciphering the haystack par Srinath Perera
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
Srinath Perera3.7K vues
Data collection & communication par Geckoboard
Data collection & communicationData collection & communication
Data collection & communication
Geckoboard3.3K vues
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An... par Benjamin Nussbaum
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Benjamin Nussbaum65.8K vues
Mechanical curator - Technical notes par benosteen
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notes
benosteen674 vues
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich par Patrick Baumgartner
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow ZurichHow to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
Wsrest 2013 par Caelum
Wsrest 2013Wsrest 2013
Wsrest 2013
Caelum2.1K vues
Pre-Aggregated Analytics And Social Feeds Using MongoDB par Rackspace
Pre-Aggregated Analytics And Social Feeds Using MongoDBPre-Aggregated Analytics And Social Feeds Using MongoDB
Pre-Aggregated Analytics And Social Feeds Using MongoDB
Rackspace1.1K vues
Data Science at Scale - The DevOps Approach par Mihai Criveti
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
Mihai Criveti126 vues
Real time data viz with Spark Streaming, Kafka and D3.js par Ben Laird
Real time data viz with Spark Streaming, Kafka and D3.jsReal time data viz with Spark Streaming, Kafka and D3.js
Real time data viz with Spark Streaming, Kafka and D3.js
Ben Laird13.3K vues
Trending with Purpose par Jason Dixon
Trending with PurposeTrending with Purpose
Trending with Purpose
Jason Dixon2.1K vues
Graph Databases par thai
Graph DatabasesGraph Databases
Graph Databases
thai166 vues
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P... par Wes McKinney
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Wes McKinney103.9K vues

dataviz on d3.js + elasticsearch

  • 1. dataviz on bordeaux open data + elasticsearch + d3js + angular.js + google refine mardi 19 mars 13
  • 2. goal • full stack dataviz • front js centric with angular.js and sass • d3.js -> best lib for dataviz (a bit complex) • elasticsearch scalable search engine access form js ajax client • grab open data and reformat with google refine mardi 19 mars 13
  • 3. philosophy • smarter and smarter browsers will grab a huge part of the stack • scalable NoSQL solution talk REST: super easy access • you always can enrich, refine, store, model data from anysource: yes you can ! mardi 19 mars 13
  • 4. go open data ! • go to http://opendata.bordeaux.fr/ • we focus ou capc contemporary museum of bordeaux • http://opendata.bordeaux.fr/content/ collections-du-capc-musee-dart- contemporain • curl https:// bdxconfigogdi.blob.core.windows.net/ converteddata/capc.csv -o capc.csv mardi 19 mars 13
  • 5. google refine • data many times isn’t perfect: humans input, unsuitable model... • with google refine you can tidy data, enrich with web services call and so on... • i think you should never think you don’t have the good data for the job • be smart and be creativ: you have everything you need, all the time ! mardi 19 mars 13
  • 6. import data in elasticsearch • bulk api import for es is handy • http://www.elasticsearch.org/guide/ reference/api/bulk.html • we use templating in google refine to export data to the correct format • { "index" : { "_index" : "musees", "_type" : "capc"} } [snipp] mardi 19 mars 13
  • 7. elasticsearch • scalable search engine • adding more power == adding more nodes • sharding • replication • fault tolerant mardi 19 mars 13
  • 8. elasticsearch • store unstructured document (json) in indexes (NoSQL way) • talk REST (api) • advanced query langage • multiple analyzers (tokens, languages, etc...) • blazing fast ! • no alternativ solutions (to my advice) (and kimshy advice too ;) ) mardi 19 mars 13
  • 9. yeoman • perfect tool for the new browser focus stack coming soon • yeoman init angular • yeoman init angular:route capc • yeoman install d3 • yeoman install jquery • yeoman server -> yeah !!! mardi 19 mars 13
  • 10. angular.js • mvc framework for browser (js) • by google • rely better on html doc • cleaner controller handling than backbone (to my advice) mardi 19 mars 13
  • 11. d3.js • best javascript lib for dataviz (to my advice) • slow learning curve • based on svg browser markup • data and dom oriented • generic (go low level with svg easily) mardi 19 mars 13
  • 12. dataviz • make appears informations from pure data • you should explore, analyse and be creativ to grab the most value possible from the data: go as far as you can • tables (excell way) -> forgive • visualization will reveal your data: data is made to be know to take decision, manage, understand mardi 19 mars 13
  • 13. open data • today, wide range of datas come to the public domain • again, without computing and vizualisation, data has a poor value • open data without data scientists and data visualization has no futur mardi 19 mars 13
  • 14. futur • with low bandwith, storage and computing cost, you can grab open data from a lot of sources • you can put them in big data store and make analysis, relations between, with open source technologies • you can then share this to the word with data visualization on your website, blogs... mardi 19 mars 13 • amazing isn’t it ?? !!!!!
  • 15. questions ? infos ? • https://github.com/mathieue • http://www.mathieu-elie.net/ • https://twitter.com/mathieuel • https://www.facebook.com/mathieuelietech mardi 19 mars 13