SlideShare une entreprise Scribd logo
1  sur  15
Télécharger pour lire hors ligne
dataviz on
                   bordeaux open data
                     + elasticsearch
                         + d3js
                      + angular.js + google refine

mardi 19 mars 13
goal
                   • full stack dataviz
                   • front js centric with angular.js and sass
                   • d3.js -> best lib for dataviz (a bit complex)
                   • elasticsearch scalable search engine access
                     form js ajax client
                   • grab open data and reformat with google
                     refine


mardi 19 mars 13
philosophy

                   • smarter and smarter browsers will grab a
                     huge part of the stack
                   • scalable NoSQL solution talk REST: super
                     easy access
                   • you always can enrich, refine, store, model
                     data from anysource: yes you can !



mardi 19 mars 13
go open data !
                   • go to http://opendata.bordeaux.fr/
                   • we focus ou capc contemporary museum
                     of bordeaux
                   • http://opendata.bordeaux.fr/content/
                     collections-du-capc-musee-dart-
                     contemporain
                   • curl https://
                     bdxconfigogdi.blob.core.windows.net/
                     converteddata/capc.csv -o capc.csv
mardi 19 mars 13
google refine
                   • data many times isn’t perfect: humans
                     input, unsuitable model...
                   • with google refine you can tidy data, enrich
                     with web services call and so on...
                   • i think you should never think you don’t
                     have the good data for the job
                   • be smart and be creativ: you have
                     everything you need, all the time !
mardi 19 mars 13
import data in
                            elasticsearch
                   • bulk api import for es is handy
                   • http://www.elasticsearch.org/guide/
                     reference/api/bulk.html
                   • we use templating in google refine to
                     export data to the correct format
                   • { "index" : { "_index" : "musees", "_type" :
                     "capc"} } [snipp]


mardi 19 mars 13
elasticsearch

                   • scalable search engine
                   • adding more power == adding more nodes
                   • sharding
                   • replication
                   • fault tolerant

mardi 19 mars 13
elasticsearch
                   • store unstructured document (json) in
                     indexes (NoSQL way)
                   • talk REST (api)
                   • advanced query langage
                   • multiple analyzers (tokens, languages, etc...)
                   • blazing fast !
                   • no alternativ solutions (to my advice) (and
                     kimshy advice too ;) )
mardi 19 mars 13
yeoman
                   • perfect tool for the new browser focus
                     stack coming soon
                   • yeoman init angular
                   • yeoman init angular:route   capc
                   • yeoman install d3
                   • yeoman install jquery
                   • yeoman server -> yeah !!!
mardi 19 mars 13
angular.js

                   • mvc framework for browser (js)
                   • by google
                   • rely better on html doc
                   • cleaner controller handling than backbone
                     (to my advice)



mardi 19 mars 13
d3.js

                   • best javascript lib for dataviz (to my advice)
                   • slow learning curve
                   • based on svg browser markup
                   • data and dom oriented
                   • generic (go low level with svg easily)

mardi 19 mars 13
dataviz
                   • make appears informations from pure data
                   • you should explore, analyse and be creativ
                     to grab the most value possible from the
                     data: go as far as you can
                   • tables (excell way) -> forgive
                   • visualization will reveal your data: data is
                     made to be know to take decision, manage,
                     understand

mardi 19 mars 13
open data

                   • today, wide range of datas come to the
                     public domain
                   • again, without computing and vizualisation,
                     data has a poor value
                   • open data without data scientists and data
                     visualization has no futur



mardi 19 mars 13
futur
                   • with low bandwith, storage and computing
                      cost, you can grab open data from a lot of
                      sources
                   • you can put them in big data store and
                      make analysis, relations between, with open
                      source technologies
                   • you can then share this to the word with
                      data visualization on your website, blogs...

mardi 19 mars 13
                   • amazing isn’t it ?? !!!!!
questions ? infos ?

                   • https://github.com/mathieue
                   • http://www.mathieu-elie.net/
                   • https://twitter.com/mathieuel
                   • https://www.facebook.com/mathieuelietech

mardi 19 mars 13

Contenu connexe

Tendances

CouchDB Open Source Bridge
CouchDB Open Source BridgeCouchDB Open Source Bridge
CouchDB Open Source Bridge
Chris Anderson
 
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Sammy Fung
 
NoSQL: Why, When, and How
NoSQL: Why, When, and HowNoSQL: Why, When, and How
NoSQL: Why, When, and How
BigBlueHat
 

Tendances (20)

On Again; Off Again - Benjamin Young - ebookcraft 2017
On Again; Off Again - Benjamin Young - ebookcraft 2017On Again; Off Again - Benjamin Young - ebookcraft 2017
On Again; Off Again - Benjamin Young - ebookcraft 2017
 
CouchDB Open Source Bridge
CouchDB Open Source BridgeCouchDB Open Source Bridge
CouchDB Open Source Bridge
 
Mongo db php_shaken_not_stirred_joomlafrappe
Mongo db php_shaken_not_stirred_joomlafrappeMongo db php_shaken_not_stirred_joomlafrappe
Mongo db php_shaken_not_stirred_joomlafrappe
 
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
 
NoSQL: Why, When, and How
NoSQL: Why, When, and HowNoSQL: Why, When, and How
NoSQL: Why, When, and How
 
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
 
Drupal 6 to 7 migration
Drupal 6 to 7 migrationDrupal 6 to 7 migration
Drupal 6 to 7 migration
 
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Apache CouchDB Presentation @ Sept. 2104 GTALUG MeetingApache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
 
Scrapy.for.dummies
Scrapy.for.dummiesScrapy.for.dummies
Scrapy.for.dummies
 
Simple search with elastic search
Simple search with elastic searchSimple search with elastic search
Simple search with elastic search
 
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
 
NoSQL - An introduction to CouchDB
NoSQL - An introduction to CouchDBNoSQL - An introduction to CouchDB
NoSQL - An introduction to CouchDB
 
Web History 101, or How the Future is Unwritten
Web History 101, or How the Future is UnwrittenWeb History 101, or How the Future is Unwritten
Web History 101, or How the Future is Unwritten
 
Elasticsearch: You know, for search! and more!
Elasticsearch: You know, for search! and more!Elasticsearch: You know, for search! and more!
Elasticsearch: You know, for search! and more!
 
An Introduction to Elastic Search.
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.
 
Managing Your Content with Elasticsearch
Managing Your Content with ElasticsearchManaging Your Content with Elasticsearch
Managing Your Content with Elasticsearch
 
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
What I learnt: Elastic search & Kibana : introduction, installtion & configur...What I learnt: Elastic search & Kibana : introduction, installtion & configur...
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
 
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
 

Similaire à dataviz on d3.js + elasticsearch

Mechanical curator - Technical notes
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notes
benosteen
 
Wsrest 2013
Wsrest 2013Wsrest 2013
Wsrest 2013
Caelum
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
tutorialsruby
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
tutorialsruby
 

Similaire à dataviz on d3.js + elasticsearch (20)

Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
 
Data collection & communication
Data collection & communicationData collection & communication
Data collection & communication
 
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
 
Mechanical curator - Technical notes
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notes
 
From Developer to Data Scientist
From Developer to Data ScientistFrom Developer to Data Scientist
From Developer to Data Scientist
 
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow ZurichHow to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
 
Wsrest 2013
Wsrest 2013Wsrest 2013
Wsrest 2013
 
Lean & agile with MongoDB
Lean & agile with MongoDBLean & agile with MongoDB
Lean & agile with MongoDB
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
 
Pre-Aggregated Analytics And Social Feeds Using MongoDB
Pre-Aggregated Analytics And Social Feeds Using MongoDBPre-Aggregated Analytics And Social Feeds Using MongoDB
Pre-Aggregated Analytics And Social Feeds Using MongoDB
 
Sql saturday el salvador 2016 - Me, A Data Scientist?
Sql saturday el salvador 2016 - Me, A Data Scientist?Sql saturday el salvador 2016 - Me, A Data Scientist?
Sql saturday el salvador 2016 - Me, A Data Scientist?
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
 
Big data 101
Big data 101Big data 101
Big data 101
 
Real time data viz with Spark Streaming, Kafka and D3.js
Real time data viz with Spark Streaming, Kafka and D3.jsReal time data viz with Spark Streaming, Kafka and D3.js
Real time data viz with Spark Streaming, Kafka and D3.js
 
Trending with Purpose
Trending with PurposeTrending with Purpose
Trending with Purpose
 
Graph Databases
Graph DatabasesGraph Databases
Graph Databases
 
Couchbase
CouchbaseCouchbase
Couchbase
 
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
 
NoSQL-Overview
NoSQL-OverviewNoSQL-Overview
NoSQL-Overview
 

dataviz on d3.js + elasticsearch

  • 1. dataviz on bordeaux open data + elasticsearch + d3js + angular.js + google refine mardi 19 mars 13
  • 2. goal • full stack dataviz • front js centric with angular.js and sass • d3.js -> best lib for dataviz (a bit complex) • elasticsearch scalable search engine access form js ajax client • grab open data and reformat with google refine mardi 19 mars 13
  • 3. philosophy • smarter and smarter browsers will grab a huge part of the stack • scalable NoSQL solution talk REST: super easy access • you always can enrich, refine, store, model data from anysource: yes you can ! mardi 19 mars 13
  • 4. go open data ! • go to http://opendata.bordeaux.fr/ • we focus ou capc contemporary museum of bordeaux • http://opendata.bordeaux.fr/content/ collections-du-capc-musee-dart- contemporain • curl https:// bdxconfigogdi.blob.core.windows.net/ converteddata/capc.csv -o capc.csv mardi 19 mars 13
  • 5. google refine • data many times isn’t perfect: humans input, unsuitable model... • with google refine you can tidy data, enrich with web services call and so on... • i think you should never think you don’t have the good data for the job • be smart and be creativ: you have everything you need, all the time ! mardi 19 mars 13
  • 6. import data in elasticsearch • bulk api import for es is handy • http://www.elasticsearch.org/guide/ reference/api/bulk.html • we use templating in google refine to export data to the correct format • { "index" : { "_index" : "musees", "_type" : "capc"} } [snipp] mardi 19 mars 13
  • 7. elasticsearch • scalable search engine • adding more power == adding more nodes • sharding • replication • fault tolerant mardi 19 mars 13
  • 8. elasticsearch • store unstructured document (json) in indexes (NoSQL way) • talk REST (api) • advanced query langage • multiple analyzers (tokens, languages, etc...) • blazing fast ! • no alternativ solutions (to my advice) (and kimshy advice too ;) ) mardi 19 mars 13
  • 9. yeoman • perfect tool for the new browser focus stack coming soon • yeoman init angular • yeoman init angular:route capc • yeoman install d3 • yeoman install jquery • yeoman server -> yeah !!! mardi 19 mars 13
  • 10. angular.js • mvc framework for browser (js) • by google • rely better on html doc • cleaner controller handling than backbone (to my advice) mardi 19 mars 13
  • 11. d3.js • best javascript lib for dataviz (to my advice) • slow learning curve • based on svg browser markup • data and dom oriented • generic (go low level with svg easily) mardi 19 mars 13
  • 12. dataviz • make appears informations from pure data • you should explore, analyse and be creativ to grab the most value possible from the data: go as far as you can • tables (excell way) -> forgive • visualization will reveal your data: data is made to be know to take decision, manage, understand mardi 19 mars 13
  • 13. open data • today, wide range of datas come to the public domain • again, without computing and vizualisation, data has a poor value • open data without data scientists and data visualization has no futur mardi 19 mars 13
  • 14. futur • with low bandwith, storage and computing cost, you can grab open data from a lot of sources • you can put them in big data store and make analysis, relations between, with open source technologies • you can then share this to the word with data visualization on your website, blogs... mardi 19 mars 13 • amazing isn’t it ?? !!!!!
  • 15. questions ? infos ? • https://github.com/mathieue • http://www.mathieu-elie.net/ • https://twitter.com/mathieuel • https://www.facebook.com/mathieuelietech mardi 19 mars 13