SlideShare une entreprise Scribd logo
1  sur  71
BIG DATA Open Source AnalyticsBIG DATA Open Source Analytics
Table of Contents
Stratebi
Introduction
Introduction to Big Data
Current issues
Scalability
Databases
Big Data history
Big Data diagram
Tools
Hadoop
Hbase
Hive
So… what else?
About us - Stratebi
Customers trusting in Open Source Business Intelligence
Private Sector
Public Sector
Open Source Big Data - Stratebi
Understanding information…Understanding information…
Open Source Big Data - Stratebi
Data was not stored
Beginning of the use of DBs
and basic reports
Business Intelligence.
Great variety of visual resources
to analyze data
Open Source Big Data- Stratebi
Data analysis profits:
Competitive advantages
Customer satisfaction evaluation
Business process improvement
Increase sales
…
Open Source Big Data - Stratebi
New data analysis techniques and processesNew data analysis techniques and processes
New BI solutions
New visual resources
New data sources
Cloud solutions
Latest trends
Social Intelligence
Mailing intelligence
…
Open Source Big Data - Stratebi
Corporations and organizations noticeCorporations and organizations notice
that…that…
Open Source Big Data - Stratebi
Open Source Big Data - Stratebi
Open Source Big Data - Stratebi
Data analysis to increase performance and be faster
Open Source Big Data - Stratebi
Lap telemetry Monaco Grand Prix (total 78 laps)
Open Source Big Data - Stratebi
So…So…
What is consideredWhat is considered
Big DataBig Data??
Open Source Big Data - Stratebi
Open Source Big Data - Stratebi
Big Data ArchitectureBig Data Architecture
Open Source Big Data - Stratebi
Scalability
Vertical
+ CPU
+ RAM
Data types
Structured
Unstructured
Current challengesCurrent challenges
Horizontal
More nodes
Open Source Big Data - Stratebi
Unstructured
Structured
Data typesData types
A data structure is a particular way of storing and organizing
data in a computer so that it can be used efficiently.
List: http://en.wikipedia.org/wiki/List_of_data_structures
Primitive data types: Boolean, chart, float, double …
Unstructured information refers to information that either does
not have a pre-defined data model or is not organized in a
pre-defined manner.
Open Source Big Data - Stratebi
Data read
High data read cost in
JOINS
Massive Joins
Relational model
Current challengesCurrent challenges
Transactional
Are transactions required
and consistent?
Can it be represented as a relational
model?
Open Source Big Data - Stratebi
Types of Big Data DBs. Not Only SQL (NoSQL)Types of Big Data DBs. Not Only SQL (NoSQL)
In response to these problems a NoSQL paradigm appeared.
NoSQL is not a substitute for relational databases
Instead it is used in other specific scenarios
Not all problems can be solved using a RDBMS
Developer has a range of possibilities and can select the best to deal
with a specific problem
There are several NoSQL systems focusing on typical issues (scaling,
increasing performance…) in a different way
Open Source Big Data - Stratebi
Types of Big Data DBs. Not Only SQL (NoSQL)Types of Big Data DBs. Not Only SQL (NoSQL)
Key-Value data stores
Columnar
databases
Document-oriented
databases
Graph databasesObject oriented
databases
Do not replace relational model. Specific scenarios.Do not replace relational model. Specific scenarios.
Open Source Big Data - Stratebi
Key-Value stores
Easy to use
Value stored in a collection of binary
data (BLOB)
Content is not relevant to database,
only the key and its associated value
are important
No schema required (columns, data
types) to store information
Scalability: from key X to X+100 in Server 1, from X+101 to X+200 in Server2
Open Source Big Data - Stratebi
Document-oriented databases
Key-value store with the special feature that store is not stored
with a predefined format and not as a binary field.
Open Source Big Data - Stratebi
Object oriented databases
Systems in which information is represented in the form
of objects
Based in OID and not in primary keys
Hierarchical relations can be represented
Object-oriented database management systems never had the expected
impact, but have several market niches such as some scientific applications
Open Source Big Data - Stratebi
Graph databases
Graph structures with nodes, edges, and properties used
to represent and store data
Compared with relational databases, graph databases
are often faster for associative data sets
Only useful if your data can be represented using a
network
Open Source Big Data - Stratebi
Columnar databases
Column databases store data tables as sections of columns of
data rather than as rows of data.
Reduce read time
Inefficient on writing operations
Used in data warehouses and
Business Intelligence systems
Ideal for calculating indicators
over aggregated data
Open Source Big Data - Stratebi
Are these DBs?Are these DBs?
Open Source Big Data - Stratebi
A brief historical review…
First Google implementations needed multiplying
huge matrices to calculate PageRanks
In order to manage big data sets algorithms and frameworks
capable of processing terabytes were created
An early application able
to carry out MapReduce
data processing paradigm
was implemented in
Hadoop, initially designed
by Doug Cutting
Open Source Big Data - Stratebi
Software framework that supports distributed applications,
licensed under the Apache v2 license.
Hadoop was derived from Google's MapReduce and
Google File System papers
is the largest contributor to the project
Written in the Java programming language
Hadoop is based in a file system and is not a database
About Apache HadoopAbout Apache Hadoop
Open Source Big Data - Stratebi
About Apache HadoopAbout Apache Hadoop
Open Source Big Data - Stratebi
Why use Hadoop?Why use Hadoop?
Need to compress data
Nodes fail every day
Common infrastructure
Efficient
Easy to use
Open Source
Open Source Big Data - Stratebi
Why use Hadoop?Why use Hadoop?
Open Source Big Data - Stratebi
Common usesCommon uses
Searches
Log processing
Recommendation systems
Analytics (Facebook, Linkedin)
Image and video processing (NASA)
Data retention
Open Source Big Data - Stratebi
Hadoop ComponentsHadoop Components
Open Source Big Data - Stratebi
HDFS file systemHDFS file system
Open Source Big Data - Stratebi
HDFS file systemHDFS file system
Hadoop Distributed File System (HDFS) is a distributed file
system
Each node in a Hadoop instance typically has a single
data node
Uses the TCP/IP layer for communication
Achieves reliability by replicating the data across
multiple hosts
Data nodes can talk to each other to rebalance data,
to move copies around, and to keep the replication of
data high
Open Source Big Data - Stratebi
MAP ReduceMAP Reduce
Consists in a Job
Tracker
Job Tracker assigns a
task to idle Task Tracker
nodes in the cluster
Open Source Big Data - Stratebi
How to do MapReduce?How to do MapReduce?
Map
The Map function is applied in parallel to every pair
in the input dataset and produces a list of pairs for
each call
Map (key1, value1) –> list (key2, value2)
Open Source Big Data - Stratebi
How to do MapReduce?How to do MapReduce?
Reduce
Reduce phase collects all pairs with the same key
from all lists and groups them together, creating
one group for each key
Reduce function is then applied in parallel to each
group created by Map() function and produces a
collection of values in the same domain
Thus the MapReduce framework converts a list of
(key, value) pairs into a list of values
Reduce (key2, list(value2)) –> list(value3)
Open Source Big Data - Stratebi
MapReduceMapReduce
Open Source Big Data - Stratebi
MapReduceMapReduce
Open Source Big Data - Stratebi
MapReduce WordCount exampleMapReduce WordCount example
Open Source Big Data - Stratebi
MapReduce WordCount exampleMapReduce WordCount example
bin/hadoop jar hadoop-*-examples.jar wordcount [-m <#maps>] [-r <#reducers>] <in-dir> <out-dir>
Open Source Big Data - Stratebi
MapReduce WordCount exampleMapReduce WordCount example
Open Source Big Data - Stratebi
Sounds difficultSounds difficult
Are there anyAre there any
tools to help us?tools to help us?
Open Source Big Data - Stratebi
What is HBase?What is HBase?
HBase is an open source distributed database modeled
after Google's BigTable
Hbase allows linear scaling by adding more servers to
the system
Runs on top of HDFS, providing BigTable-like capabilities
for Hadoop
HBase is written in Java
Open Source Big Data - Stratebi
What is HBase?What is HBase?
Hbase is suitable when you require high read/write
speeds in a BigData infrastructure.
HBase is able to store enormous tables (billions of rows
and millions of columns) in a cluster composed by basic
nodes
Working modes
Open Source Big Data - Stratebi
What is HBase?What is HBase?
Hbase commands
Open Source Big Data - Stratebi
What is Hive?What is Hive?
Hive is a data warehouse infrastructure built on top of
Hadoop for providing data summarization, query, and
analysis
Provides an SQL-like language called HiveQL while
maintaining full support for map/reduce
Built-in user defined functions (UDFs) to manipulate
dates, strings, and other data-mining tools.
Hive supports extending the UDF set to handle use-
cases not supported by built-in functions
Open Source Big Data - Stratebi
I am a complete JavaI am a complete Java
noob and need help…noob and need help…
What can I do?What can I do?
Open Source Big Data - Stratebi
Graphical ETL tool
included in Pentaho suite
Built to help in processes
of Extracting, Transporting,
Transforming and Loading
data.
Supports deployment on
single node computers as
well as on a cloud, or
cluster.
What is Kettle?What is Kettle?
Open Source Big Data - Stratebi
• View perspective:
• Database connections
• Steps
• Hops
• Slave server
• Kettle cluster schemas
• Design perspective:
• Inputs
• Outputs
• Lookups
• Transform
• Joins
• Scripting
• Data Warehouse
• Mapping
• Job
• Inline
• Experimental
Open Source Big Data - Stratebi
Main Big Data steps in KettleMain Big Data steps in Kettle
Open Source Big Data - Stratebi
Word Count exampleWord Count example
Open Source Big Data - Stratebi
Word Count exampleWord Count example
Configuring MapReduceConfiguring MapReduce
Open Source Big Data - Stratebi
Word Count exampleWord Count example
Configuring MapReduceConfiguring MapReduce
Open Source Big Data - Stratebi
Word Count exampleWord Count example
Open Source Big Data - Stratebi
Word Count exampleWord Count example
Open Source Big Data - Stratebi
Configuring MapReduce with HbaseConfiguring MapReduce with Hbase
Open Source Big Data - Stratebi
Configuring MapReduce with HbaseConfiguring MapReduce with Hbase
Open Source Big Data - Stratebi
Using Hive as data sourceUsing Hive as data source
Open Source Big Data - Stratebi
Big Data project and Business IntelligenceBig Data project and Business Intelligence
Open Source Big Data - Stratebi
Big Data project and Business IntelligenceBig Data project and Business Intelligence
Open Source Big Data - Stratebi
Big Data project and Business Intelligence.Big Data project and Business Intelligence.
Smart City Case StudySmart City Case Study
Open Source Big Data - Stratebi
Visualization – Social Media dashboardsVisualization – Social Media dashboards
Open Source Big Data - Stratebi
Visualization – Operational dashboardVisualization – Operational dashboard
Open Source Big Data - Stratebi
Visualization – Operational dashboardVisualization – Operational dashboard
Open Source Big Data - Stratebi
Visualization- Geographic dashboardVisualization- Geographic dashboard
Open Source Big Data - Stratebi
Visualization – Advanced charts (Treemap, Sunburst ...)Visualization – Advanced charts (Treemap, Sunburst ...)
Open Source Big Data - Stratebi
Open Source Big Data - Stratebi
Stratebi is a Spanish company located in Madrid,
Barcelona and with a delegation in Sao Paulo, we are
a group of professionals with a wide experience in
Information systems and Technologic solutions related
to the field of open source software and Business
Intelligence.
Contact details:
info@stratebi.com
www.stratebi.com
Phones: (+34) 917883410 - (+34) 931844325
About usAbout us

Contenu connexe

Tendances

Big Data Final Presentation
Big Data Final PresentationBig Data Final Presentation
Big Data Final Presentation17aroumougamh
 
BigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRTBigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRTAmrit Chhetri
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemMd. Hasan Basri (Angel)
 
Introduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.bizIntroduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.bizITJobZone.biz
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry PerspectiveCloudera, Inc.
 
Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP vinoth kumar
 
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerBig Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerMark Kromer
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introductionsaisreealekhya
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataVipin Batra
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopEdureka!
 
Hadoop and big data
Hadoop and big dataHadoop and big data
Hadoop and big dataYukti Kaura
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in detailsMahmoud Yassin
 
Big data analytics with hadoop volume 2
Big data analytics with hadoop volume 2Big data analytics with hadoop volume 2
Big data analytics with hadoop volume 2Imviplav
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopAmir Shaikh
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabatinabati
 

Tendances (20)

Big Data Final Presentation
Big Data Final PresentationBig Data Final Presentation
Big Data Final Presentation
 
BigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRTBigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRT
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-System
 
Introduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.bizIntroduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.biz
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
 
Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP
 
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerBig Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL Server
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
Whatisbigdataandwhylearnhadoop
 
Hadoop and big data
Hadoop and big dataHadoop and big data
Hadoop and big data
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
Big data analytics with hadoop volume 2
Big data analytics with hadoop volume 2Big data analytics with hadoop volume 2
Big data analytics with hadoop volume 2
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and Hadoop
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big data analytics - hadoop
Big data analytics - hadoopBig data analytics - hadoop
Big data analytics - hadoop
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 

Similaire à Big Data Analytics 2014

Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBhavya Gulati
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHitendra Kumar
 
Haddop in Business Intelligence
Haddop in Business IntelligenceHaddop in Business Intelligence
Haddop in Business IntelligenceHGanesh
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeSysfore Technologies
 
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookHow Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookAmr Awadallah
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010nzhang
 
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune amrutupre
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with HadoopNalini Mehta
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfDIVYA370851
 
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Cognizant
 

Similaire à Big Data Analytics 2014 (20)

Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edge
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data and tools
Big data and tools Big data and tools
Big data and tools
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Haddop in Business Intelligence
Haddop in Business IntelligenceHaddop in Business Intelligence
Haddop in Business Intelligence
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookHow Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
 
Case study on big data
Case study on big dataCase study on big data
Case study on big data
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
 
NoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and AnalyticsNoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and Analytics
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with Hadoop
 
Data science big data and analytics
Data science big data and analyticsData science big data and analytics
Data science big data and analytics
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdf
 
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
 
paper
paperpaper
paper
 
Hadoop
HadoopHadoop
Hadoop
 
G017143640
G017143640G017143640
G017143640
 

Plus de Stratebi

Destinos turisticos inteligentes
Destinos turisticos inteligentesDestinos turisticos inteligentes
Destinos turisticos inteligentesStratebi
 
Azure Synapse
Azure SynapseAzure Synapse
Azure SynapseStratebi
 
Options for Dashboards with Python
Options for Dashboards with PythonOptions for Dashboards with Python
Options for Dashboards with PythonStratebi
 
Dashboards with Python
Dashboards with PythonDashboards with Python
Dashboards with PythonStratebi
 
PowerBI Tips y buenas practicas
PowerBI Tips y buenas practicasPowerBI Tips y buenas practicas
PowerBI Tips y buenas practicasStratebi
 
Machine Learning Meetup Spain
Machine Learning Meetup SpainMachine Learning Meetup Spain
Machine Learning Meetup SpainStratebi
 
LinceBI IIoT (Industrial Internet of Things)
LinceBI IIoT (Industrial Internet of Things)LinceBI IIoT (Industrial Internet of Things)
LinceBI IIoT (Industrial Internet of Things)Stratebi
 
SAP - PowerBI integration
SAP - PowerBI integrationSAP - PowerBI integration
SAP - PowerBI integrationStratebi
 
Aplicaciones Big Data Marketing
Aplicaciones Big Data MarketingAplicaciones Big Data Marketing
Aplicaciones Big Data MarketingStratebi
 
A federated information infrastructure that works
A federated information infrastructure that works A federated information infrastructure that works
A federated information infrastructure that works Stratebi
 
9 problemas en proyectos Data Analytics
9 problemas en proyectos Data Analytics9 problemas en proyectos Data Analytics
9 problemas en proyectos Data AnalyticsStratebi
 
PowerBI: Soluciones, Aplicaciones y Cursos
PowerBI: Soluciones, Aplicaciones y CursosPowerBI: Soluciones, Aplicaciones y Cursos
PowerBI: Soluciones, Aplicaciones y CursosStratebi
 
Sports Analytics
Sports AnalyticsSports Analytics
Sports AnalyticsStratebi
 
Vertica Extreme Analysis
Vertica Extreme AnalysisVertica Extreme Analysis
Vertica Extreme AnalysisStratebi
 
Businesss Intelligence con Vertica y PowerBI
Businesss Intelligence con Vertica y PowerBIBusinesss Intelligence con Vertica y PowerBI
Businesss Intelligence con Vertica y PowerBIStratebi
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overviewStratebi
 
Talend Cloud en detalle
Talend Cloud en detalleTalend Cloud en detalle
Talend Cloud en detalleStratebi
 
Master Data Management (MDM) con Talend
Master Data Management (MDM) con TalendMaster Data Management (MDM) con Talend
Master Data Management (MDM) con TalendStratebi
 
Talend Introducion
Talend IntroducionTalend Introducion
Talend IntroducionStratebi
 
Talent Analytics
Talent AnalyticsTalent Analytics
Talent AnalyticsStratebi
 

Plus de Stratebi (20)

Destinos turisticos inteligentes
Destinos turisticos inteligentesDestinos turisticos inteligentes
Destinos turisticos inteligentes
 
Azure Synapse
Azure SynapseAzure Synapse
Azure Synapse
 
Options for Dashboards with Python
Options for Dashboards with PythonOptions for Dashboards with Python
Options for Dashboards with Python
 
Dashboards with Python
Dashboards with PythonDashboards with Python
Dashboards with Python
 
PowerBI Tips y buenas practicas
PowerBI Tips y buenas practicasPowerBI Tips y buenas practicas
PowerBI Tips y buenas practicas
 
Machine Learning Meetup Spain
Machine Learning Meetup SpainMachine Learning Meetup Spain
Machine Learning Meetup Spain
 
LinceBI IIoT (Industrial Internet of Things)
LinceBI IIoT (Industrial Internet of Things)LinceBI IIoT (Industrial Internet of Things)
LinceBI IIoT (Industrial Internet of Things)
 
SAP - PowerBI integration
SAP - PowerBI integrationSAP - PowerBI integration
SAP - PowerBI integration
 
Aplicaciones Big Data Marketing
Aplicaciones Big Data MarketingAplicaciones Big Data Marketing
Aplicaciones Big Data Marketing
 
A federated information infrastructure that works
A federated information infrastructure that works A federated information infrastructure that works
A federated information infrastructure that works
 
9 problemas en proyectos Data Analytics
9 problemas en proyectos Data Analytics9 problemas en proyectos Data Analytics
9 problemas en proyectos Data Analytics
 
PowerBI: Soluciones, Aplicaciones y Cursos
PowerBI: Soluciones, Aplicaciones y CursosPowerBI: Soluciones, Aplicaciones y Cursos
PowerBI: Soluciones, Aplicaciones y Cursos
 
Sports Analytics
Sports AnalyticsSports Analytics
Sports Analytics
 
Vertica Extreme Analysis
Vertica Extreme AnalysisVertica Extreme Analysis
Vertica Extreme Analysis
 
Businesss Intelligence con Vertica y PowerBI
Businesss Intelligence con Vertica y PowerBIBusinesss Intelligence con Vertica y PowerBI
Businesss Intelligence con Vertica y PowerBI
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Talend Cloud en detalle
Talend Cloud en detalleTalend Cloud en detalle
Talend Cloud en detalle
 
Master Data Management (MDM) con Talend
Master Data Management (MDM) con TalendMaster Data Management (MDM) con Talend
Master Data Management (MDM) con Talend
 
Talend Introducion
Talend IntroducionTalend Introducion
Talend Introducion
 
Talent Analytics
Talent AnalyticsTalent Analytics
Talent Analytics
 

Dernier

Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...Associazione Digital Days
 
Pitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckPitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckHajeJanKamps
 
Planetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifePlanetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifeBhavana Pujan Kendra
 
Onemonitar Android Spy App Features: Explore Advanced Monitoring Capabilities
Onemonitar Android Spy App Features: Explore Advanced Monitoring CapabilitiesOnemonitar Android Spy App Features: Explore Advanced Monitoring Capabilities
Onemonitar Android Spy App Features: Explore Advanced Monitoring CapabilitiesOne Monitar
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMVoces Mineras
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in PhilippinesDavidSamuel525586
 
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...Hector Del Castillo, CPM, CPMM
 
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...SOFTTECHHUB
 
Welding Electrode Making Machine By Deccan Dynamics
Welding Electrode Making Machine By Deccan DynamicsWelding Electrode Making Machine By Deccan Dynamics
Welding Electrode Making Machine By Deccan DynamicsIndiaMART InterMESH Limited
 
Send Files | Sendbig.comSend Files | Sendbig.com
Send Files | Sendbig.comSend Files | Sendbig.comSend Files | Sendbig.comSend Files | Sendbig.com
Send Files | Sendbig.comSend Files | Sendbig.comSendBig4
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxappkodes
 
Jewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreJewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreNZSG
 
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdf
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdftrending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdf
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdfMintel Group
 
Supercharge Your eCommerce Stores-acowebs
Supercharge Your eCommerce Stores-acowebsSupercharge Your eCommerce Stores-acowebs
Supercharge Your eCommerce Stores-acowebsGOKUL JS
 
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdfChris Skinner
 
BAILMENT & PLEDGE business law notes.pptx
BAILMENT & PLEDGE business law notes.pptxBAILMENT & PLEDGE business law notes.pptx
BAILMENT & PLEDGE business law notes.pptxran17april2001
 
WSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfWSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfJamesConcepcion7
 
WSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfWSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfJamesConcepcion7
 
Healthcare Feb. & Mar. Healthcare Newsletter
Healthcare Feb. & Mar. Healthcare NewsletterHealthcare Feb. & Mar. Healthcare Newsletter
Healthcare Feb. & Mar. Healthcare NewsletterJamesConcepcion7
 

Dernier (20)

Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
Lucia Ferretti, Lead Business Designer; Matteo Meschini, Business Designer @T...
 
Pitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckPitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deck
 
Planetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifePlanetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in Life
 
Onemonitar Android Spy App Features: Explore Advanced Monitoring Capabilities
Onemonitar Android Spy App Features: Explore Advanced Monitoring CapabilitiesOnemonitar Android Spy App Features: Explore Advanced Monitoring Capabilities
Onemonitar Android Spy App Features: Explore Advanced Monitoring Capabilities
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQM
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in Philippines
 
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...
How Generative AI Is Transforming Your Business | Byond Growth Insights | Apr...
 
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...
How To Simplify Your Scheduling with AI Calendarfly The Hassle-Free Online Bo...
 
Welding Electrode Making Machine By Deccan Dynamics
Welding Electrode Making Machine By Deccan DynamicsWelding Electrode Making Machine By Deccan Dynamics
Welding Electrode Making Machine By Deccan Dynamics
 
Send Files | Sendbig.comSend Files | Sendbig.com
Send Files | Sendbig.comSend Files | Sendbig.comSend Files | Sendbig.comSend Files | Sendbig.com
Send Files | Sendbig.comSend Files | Sendbig.com
 
The Bizz Quiz-E-Summit-E-Cell-IITPatna.pptx
The Bizz Quiz-E-Summit-E-Cell-IITPatna.pptxThe Bizz Quiz-E-Summit-E-Cell-IITPatna.pptx
The Bizz Quiz-E-Summit-E-Cell-IITPatna.pptx
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptx
 
Jewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreJewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource Centre
 
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdf
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdftrending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdf
trending-flavors-and-ingredients-in-salty-snacks-us-2024_Redacted-V2.pdf
 
Supercharge Your eCommerce Stores-acowebs
Supercharge Your eCommerce Stores-acowebsSupercharge Your eCommerce Stores-acowebs
Supercharge Your eCommerce Stores-acowebs
 
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf
20220816-EthicsGrade_Scorecard-JP_Morgan_Chase-Q2-63_57.pdf
 
BAILMENT & PLEDGE business law notes.pptx
BAILMENT & PLEDGE business law notes.pptxBAILMENT & PLEDGE business law notes.pptx
BAILMENT & PLEDGE business law notes.pptx
 
WSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfWSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdf
 
WSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfWSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdf
 
Healthcare Feb. & Mar. Healthcare Newsletter
Healthcare Feb. & Mar. Healthcare NewsletterHealthcare Feb. & Mar. Healthcare Newsletter
Healthcare Feb. & Mar. Healthcare Newsletter
 

Big Data Analytics 2014

  • 1. BIG DATA Open Source AnalyticsBIG DATA Open Source Analytics
  • 2. Table of Contents Stratebi Introduction Introduction to Big Data Current issues Scalability Databases Big Data history Big Data diagram Tools Hadoop Hbase Hive So… what else?
  • 3. About us - Stratebi
  • 4. Customers trusting in Open Source Business Intelligence Private Sector Public Sector
  • 5. Open Source Big Data - Stratebi Understanding information…Understanding information…
  • 6. Open Source Big Data - Stratebi Data was not stored Beginning of the use of DBs and basic reports Business Intelligence. Great variety of visual resources to analyze data
  • 7. Open Source Big Data- Stratebi Data analysis profits: Competitive advantages Customer satisfaction evaluation Business process improvement Increase sales …
  • 8. Open Source Big Data - Stratebi New data analysis techniques and processesNew data analysis techniques and processes New BI solutions New visual resources New data sources Cloud solutions Latest trends Social Intelligence Mailing intelligence …
  • 9. Open Source Big Data - Stratebi Corporations and organizations noticeCorporations and organizations notice that…that…
  • 10. Open Source Big Data - Stratebi
  • 11. Open Source Big Data - Stratebi
  • 12. Open Source Big Data - Stratebi Data analysis to increase performance and be faster
  • 13. Open Source Big Data - Stratebi Lap telemetry Monaco Grand Prix (total 78 laps)
  • 14. Open Source Big Data - Stratebi So…So… What is consideredWhat is considered Big DataBig Data??
  • 15. Open Source Big Data - Stratebi
  • 16. Open Source Big Data - Stratebi Big Data ArchitectureBig Data Architecture
  • 17. Open Source Big Data - Stratebi Scalability Vertical + CPU + RAM Data types Structured Unstructured Current challengesCurrent challenges Horizontal More nodes
  • 18. Open Source Big Data - Stratebi Unstructured Structured Data typesData types A data structure is a particular way of storing and organizing data in a computer so that it can be used efficiently. List: http://en.wikipedia.org/wiki/List_of_data_structures Primitive data types: Boolean, chart, float, double … Unstructured information refers to information that either does not have a pre-defined data model or is not organized in a pre-defined manner.
  • 19. Open Source Big Data - Stratebi Data read High data read cost in JOINS Massive Joins Relational model Current challengesCurrent challenges Transactional Are transactions required and consistent? Can it be represented as a relational model?
  • 20. Open Source Big Data - Stratebi Types of Big Data DBs. Not Only SQL (NoSQL)Types of Big Data DBs. Not Only SQL (NoSQL) In response to these problems a NoSQL paradigm appeared. NoSQL is not a substitute for relational databases Instead it is used in other specific scenarios Not all problems can be solved using a RDBMS Developer has a range of possibilities and can select the best to deal with a specific problem There are several NoSQL systems focusing on typical issues (scaling, increasing performance…) in a different way
  • 21. Open Source Big Data - Stratebi Types of Big Data DBs. Not Only SQL (NoSQL)Types of Big Data DBs. Not Only SQL (NoSQL) Key-Value data stores Columnar databases Document-oriented databases Graph databasesObject oriented databases Do not replace relational model. Specific scenarios.Do not replace relational model. Specific scenarios.
  • 22. Open Source Big Data - Stratebi Key-Value stores Easy to use Value stored in a collection of binary data (BLOB) Content is not relevant to database, only the key and its associated value are important No schema required (columns, data types) to store information Scalability: from key X to X+100 in Server 1, from X+101 to X+200 in Server2
  • 23. Open Source Big Data - Stratebi Document-oriented databases Key-value store with the special feature that store is not stored with a predefined format and not as a binary field.
  • 24. Open Source Big Data - Stratebi Object oriented databases Systems in which information is represented in the form of objects Based in OID and not in primary keys Hierarchical relations can be represented Object-oriented database management systems never had the expected impact, but have several market niches such as some scientific applications
  • 25. Open Source Big Data - Stratebi Graph databases Graph structures with nodes, edges, and properties used to represent and store data Compared with relational databases, graph databases are often faster for associative data sets Only useful if your data can be represented using a network
  • 26. Open Source Big Data - Stratebi Columnar databases Column databases store data tables as sections of columns of data rather than as rows of data. Reduce read time Inefficient on writing operations Used in data warehouses and Business Intelligence systems Ideal for calculating indicators over aggregated data
  • 27. Open Source Big Data - Stratebi Are these DBs?Are these DBs?
  • 28. Open Source Big Data - Stratebi A brief historical review… First Google implementations needed multiplying huge matrices to calculate PageRanks In order to manage big data sets algorithms and frameworks capable of processing terabytes were created An early application able to carry out MapReduce data processing paradigm was implemented in Hadoop, initially designed by Doug Cutting
  • 29. Open Source Big Data - Stratebi Software framework that supports distributed applications, licensed under the Apache v2 license. Hadoop was derived from Google's MapReduce and Google File System papers is the largest contributor to the project Written in the Java programming language Hadoop is based in a file system and is not a database About Apache HadoopAbout Apache Hadoop
  • 30. Open Source Big Data - Stratebi About Apache HadoopAbout Apache Hadoop
  • 31. Open Source Big Data - Stratebi Why use Hadoop?Why use Hadoop? Need to compress data Nodes fail every day Common infrastructure Efficient Easy to use Open Source
  • 32. Open Source Big Data - Stratebi Why use Hadoop?Why use Hadoop?
  • 33. Open Source Big Data - Stratebi Common usesCommon uses Searches Log processing Recommendation systems Analytics (Facebook, Linkedin) Image and video processing (NASA) Data retention
  • 34. Open Source Big Data - Stratebi Hadoop ComponentsHadoop Components
  • 35. Open Source Big Data - Stratebi HDFS file systemHDFS file system
  • 36. Open Source Big Data - Stratebi HDFS file systemHDFS file system Hadoop Distributed File System (HDFS) is a distributed file system Each node in a Hadoop instance typically has a single data node Uses the TCP/IP layer for communication Achieves reliability by replicating the data across multiple hosts Data nodes can talk to each other to rebalance data, to move copies around, and to keep the replication of data high
  • 37. Open Source Big Data - Stratebi MAP ReduceMAP Reduce Consists in a Job Tracker Job Tracker assigns a task to idle Task Tracker nodes in the cluster
  • 38. Open Source Big Data - Stratebi How to do MapReduce?How to do MapReduce? Map The Map function is applied in parallel to every pair in the input dataset and produces a list of pairs for each call Map (key1, value1) –> list (key2, value2)
  • 39. Open Source Big Data - Stratebi How to do MapReduce?How to do MapReduce? Reduce Reduce phase collects all pairs with the same key from all lists and groups them together, creating one group for each key Reduce function is then applied in parallel to each group created by Map() function and produces a collection of values in the same domain Thus the MapReduce framework converts a list of (key, value) pairs into a list of values Reduce (key2, list(value2)) –> list(value3)
  • 40. Open Source Big Data - Stratebi MapReduceMapReduce
  • 41. Open Source Big Data - Stratebi MapReduceMapReduce
  • 42. Open Source Big Data - Stratebi MapReduce WordCount exampleMapReduce WordCount example
  • 43. Open Source Big Data - Stratebi MapReduce WordCount exampleMapReduce WordCount example bin/hadoop jar hadoop-*-examples.jar wordcount [-m <#maps>] [-r <#reducers>] <in-dir> <out-dir>
  • 44. Open Source Big Data - Stratebi MapReduce WordCount exampleMapReduce WordCount example
  • 45. Open Source Big Data - Stratebi Sounds difficultSounds difficult Are there anyAre there any tools to help us?tools to help us?
  • 46. Open Source Big Data - Stratebi What is HBase?What is HBase? HBase is an open source distributed database modeled after Google's BigTable Hbase allows linear scaling by adding more servers to the system Runs on top of HDFS, providing BigTable-like capabilities for Hadoop HBase is written in Java
  • 47. Open Source Big Data - Stratebi What is HBase?What is HBase? Hbase is suitable when you require high read/write speeds in a BigData infrastructure. HBase is able to store enormous tables (billions of rows and millions of columns) in a cluster composed by basic nodes Working modes
  • 48. Open Source Big Data - Stratebi What is HBase?What is HBase? Hbase commands
  • 49. Open Source Big Data - Stratebi What is Hive?What is Hive? Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis Provides an SQL-like language called HiveQL while maintaining full support for map/reduce Built-in user defined functions (UDFs) to manipulate dates, strings, and other data-mining tools. Hive supports extending the UDF set to handle use- cases not supported by built-in functions
  • 50. Open Source Big Data - Stratebi I am a complete JavaI am a complete Java noob and need help…noob and need help… What can I do?What can I do?
  • 51. Open Source Big Data - Stratebi Graphical ETL tool included in Pentaho suite Built to help in processes of Extracting, Transporting, Transforming and Loading data. Supports deployment on single node computers as well as on a cloud, or cluster. What is Kettle?What is Kettle?
  • 52. Open Source Big Data - Stratebi • View perspective: • Database connections • Steps • Hops • Slave server • Kettle cluster schemas • Design perspective: • Inputs • Outputs • Lookups • Transform • Joins • Scripting • Data Warehouse • Mapping • Job • Inline • Experimental
  • 53. Open Source Big Data - Stratebi Main Big Data steps in KettleMain Big Data steps in Kettle
  • 54. Open Source Big Data - Stratebi Word Count exampleWord Count example
  • 55. Open Source Big Data - Stratebi Word Count exampleWord Count example Configuring MapReduceConfiguring MapReduce
  • 56. Open Source Big Data - Stratebi Word Count exampleWord Count example Configuring MapReduceConfiguring MapReduce
  • 57. Open Source Big Data - Stratebi Word Count exampleWord Count example
  • 58. Open Source Big Data - Stratebi Word Count exampleWord Count example
  • 59. Open Source Big Data - Stratebi Configuring MapReduce with HbaseConfiguring MapReduce with Hbase
  • 60. Open Source Big Data - Stratebi Configuring MapReduce with HbaseConfiguring MapReduce with Hbase
  • 61. Open Source Big Data - Stratebi Using Hive as data sourceUsing Hive as data source
  • 62. Open Source Big Data - Stratebi Big Data project and Business IntelligenceBig Data project and Business Intelligence
  • 63. Open Source Big Data - Stratebi Big Data project and Business IntelligenceBig Data project and Business Intelligence
  • 64. Open Source Big Data - Stratebi Big Data project and Business Intelligence.Big Data project and Business Intelligence. Smart City Case StudySmart City Case Study
  • 65. Open Source Big Data - Stratebi Visualization – Social Media dashboardsVisualization – Social Media dashboards
  • 66. Open Source Big Data - Stratebi Visualization – Operational dashboardVisualization – Operational dashboard
  • 67. Open Source Big Data - Stratebi Visualization – Operational dashboardVisualization – Operational dashboard
  • 68. Open Source Big Data - Stratebi Visualization- Geographic dashboardVisualization- Geographic dashboard
  • 69. Open Source Big Data - Stratebi Visualization – Advanced charts (Treemap, Sunburst ...)Visualization – Advanced charts (Treemap, Sunburst ...)
  • 70. Open Source Big Data - Stratebi
  • 71. Open Source Big Data - Stratebi Stratebi is a Spanish company located in Madrid, Barcelona and with a delegation in Sao Paulo, we are a group of professionals with a wide experience in Information systems and Technologic solutions related to the field of open source software and Business Intelligence. Contact details: info@stratebi.com www.stratebi.com Phones: (+34) 917883410 - (+34) 931844325 About usAbout us