SlideShare a Scribd company logo
1 of 23
Download to read offline
Echo State Hoeffding Tree Learning
Diego Marr´on (dmarron@ac.upc.edu)
Jesse Read (jesse.read@telecom-paristech.fr)
Albert Bifet (albert.bifet@telecom-paristech.fr)
Talel Abdessalem (talel.abdessalem@telecom-paristech.fr)
Eduard Ayguad´e (eduard.ayguade@bsc.es)
Jos´e R. Herrero (josepr@ac.upc.edu)
ACML 2016
Hamilton, New Zeland
Introduction ESHT Evaluations Conclusions
Introduction
• Real-time classification of Big Data streams is becoming
essential in a variety of application domains.
• Real-time classification imposes some challenges:
• Deal with potentially infinite streams
• Strong temporal dependences
• React to changes on the stream
• Response time and memory are bounded
2/18
Introduction ESHT Evaluations Conclusions
Real Time Classification
• In real-time classification:
• Hoeffding Tree (HT) is the streaming state-of-the art decision
tree
• HTs are powerful and easy–to–deploy (no hyper-parameter to
tune)
• But, they are unable to capture strong temporal dependences
• Recurrent Neural Networks (RNN) are very popular nowadays
3/18
Introduction ESHT Evaluations Conclusions
Recurrent Neural Networks
• Recurrent Neural Networks (RNNs) are the state-of-the-art in
handwriting recognition, speech recognition, natural language
processing among others
• They are able to capture time dependences
• But their use for data streams is not straight forward
• Very sensitive to hyper-parameters configuration
• Training requires many iterations over data...
• ...and large amount of time
4/18
Introduction ESHT Evaluations Conclusions
RNN: Echo State Network
• A type of Recurrent Neural Network
• Echo State Layer (ESL):
• Dynamics only driven by the input
• Requires very few computations
• Easy to understand hyper-parameters
• Can capture time dependences
• ESN also requires the hyper-parameters needed by the NN
• Gradient Descent methods have slow convergence
5/18
Introduction ESHT Evaluations Conclusions
Contribution
• Objective:
• Need to model the evolution of the stream over time
• Reduce number of hyper-parameters
• Reduce amount of samples needed to learn
• In this work we present the ESHT:
• Combination of HT + ESL
• To learn temporal dependences in data streams in real-time
• Requires less hyper-parameters than the ESN
6/18
Introduction ESHT Evaluations Conclusions
ESHT
• Echo State Layer (ESL):
• Only needs two hyper-parameters:
• Alpha (α): weights events in X(n) importance over new ones
• Density: Wres is a sparse matrix with given density
• Encodes time-dependences
• FIMT-DD: Hoeffding tree for regression
• Works out-of-the-box: no hyper-parameters tuning
7/18
Introduction ESHT Evaluations Conclusions
ESHT: Evaluation Methodology
• We propose the ESHT to learn character-stream functions:
• Counter (skipped in this presentation)
• lastIndexOf
• emailFilter
• lastIndexOf Evaluation:
• Study the effects of hyper-parameters: α and density
• Alpha (α): weights events in X(n) importance over new ones
• Density: Wres is a sparse matrix with given density
• Use 1,000 neurons on the ESL
• emailFilter evaluation:
• We focus on the speed of learning
• Use outcomes from previous evaluations to configure the
ESHT for this task
• Metrics:
• Cumulative loss
• We consider an error if |yt − ˆy| >= 0.5
8/18
Introduction ESHT Evaluations Conclusions
Input format
• Input is a vector of floats
• Number of attributes = number of input symbols
• Attribute representing current symbol set to 0.5
• Other attributes are set to zero
9/18
Introduction ESHT Evaluations Conclusions
LastIndexOf
• Counts the number of time steps since the current symbol was
last observed
• Input stream is randomly generated
• We 2,3 and 4 symbols
10/18
Introduction ESHT Evaluations Conclusions
LastIndexOf: Vector vs Scalar Input
• Vector input improves accuracy in all cases
• Specially with 4 symbols
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
α
Accuracy(%)
2symbols density=0.4
2symbols-vec density=0.4
3symbols density=0.4
3symbols-vec density=0.4
4symbols density=0.4
4symbols-vec density=0.4
11/18
Introduction ESHT Evaluations Conclusions
LastIndexOf: Alpha and Density vs Accuracy
• Lower values of alpha (α) have low accuracy
• There is no clear correlation between accuracy and density
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Alpha (α)
Accuracy(%)
2symbols density=0.1
2symbols density=0.4
3symbols density=0.1
3symbols density=0.4
4symbols density=0.1
4symbols density=0.4
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0.3
0.4
0.5
0.6
0.7
0.8
Density
Accuracy(%)
α=0.2
α=0.3
α=0.4
α=0.5
α=0.6
α=0.7
α=0.8
α=0.9
α=1.0
12/18
Introduction ESHT Evaluations Conclusions
EmailFilter
• ESHT configuration:
• ESL: 4,000 neurons
• α = 1.0 and density = 0.1
• Outputs the length on the next space character
• Dataset: 20 newsgroups dataset
• Extracted 590 characters and repeated them 8 times
• To reduce the memory usage we used an input vector of 4
symbols
13/18
Introduction ESHT Evaluations Conclusions
EmailFilter: Recurrence vs Non Recurrence
• Non-recurrent methods (FIMT-DD and NN) fail to capture
temporal dependences
• NN defaults to majority class
Algorithm Density α Learning rate Loss Accuracy (%)
FIMT-DD - - - 4,119.7 91.61
NN - - 0.8 2,760 97.80
ESN1 0.2 1.0 0.1 1,032 98.47
ESN2 0.7 1.0 0.1 850 98.47
ESHT 0.1 1.0 - 180 99.75
14/18
Introduction ESHT Evaluations Conclusions
EmailFilter: ESN vs ESHT
• After 500 samples the ESHT loss is close to 0 (and 0 loss
after the 1,000 samples)
0
1,000
2,000
3,000
4,000
0
200
400
600
800
1,000
1,200
500
# Samples
CummulativeLoss
ESN1
ESN2
ESHT
15/18
Introduction ESHT Evaluations Conclusions
Conclusions and Future Work
• Conclusions:
• We presented the ESHT to learn temporal dependences in data
streams in real-time
• The ESHT requires less hyper-parameters than the ESN
• Our proof-of-concept implementation is able to learn faster
than an ESN (Most of them at first shot)
• Future Work:
• We are currently reimplementing our prototype so we can test
larger input sequences
• We need to study the effects of the initial state vanishing in
large sequences
16/18
Thank you
Echo State Hoeffding Tree Learning
Diego Marr´on (dmarron@ac.upc.edu)
Jesse Read (jesse.read@telecom-paristech.fr)
Albert Bifet (albert.bifet@telecom-paristech.fr)
Talel Abdessalem (talel.abdessalem@telecom-paristech.fr)
Eduard Ayguad´e (eduard.ayguade@bsc.es)
Jos´e R. Herrero (josepr@ac.upc.edu)
ACML 2016
Hamilton, New Zeland
ESHT: Module Architecture
• In each evaluation we use the following architecture
• Label generator implements the function to be learnt
1/0
Counter: Introduction
• Stream of zeros and ones randomly generated
• Input is a scalar
• Two variants:
• Option1: Outputs cumulative count
• Option2: Outputs total count on the next zero
2/0
Counter: Cumulative Loss
• After 200 samples the loss is stable
0
200
400
600
800
1,000
0
10
20
30
# Samples
CummulativeLoss
Op1(density=0.3,α=1.0)
Op1(density=1.0,α=0.7)
Op2(density=0.8,α=1.0)
Op2(density=0.8,α=0.7)
3/0
Counter: Alpha and Density vs Accuracy
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0.5
0.6
0.7
0.8
0.9
1
Alpha (α)
Accuracy(%)
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0.5
0.6
0.7
0.8
0.9
1
Density (%)
Accuracy(%)
4/0
EmailFilter: ASCII to 4 symbols Table
ASCII Domain 4-Symbols Domain
Original Symbols Target Symbol Target Symbol Index
[t n r]+ Single space 0
[a-zA-Z0-9] x 1
@ @ 2
. . 3
5/0

More Related Content

What's hot

ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2
Shrayes Ramesh
 

What's hot (12)

Calculus ppt format
Calculus ppt formatCalculus ppt format
Calculus ppt format
 
Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017
 
Deep learning from scratch
Deep learning from scratch Deep learning from scratch
Deep learning from scratch
 
Nearest neighbour algorithm
Nearest neighbour algorithmNearest neighbour algorithm
Nearest neighbour algorithm
 
Practical deep learning for computer vision
Practical deep learning for computer visionPractical deep learning for computer vision
Practical deep learning for computer vision
 
Deep Learning for Computer Vision: Optimization (UPC 2016)
Deep Learning for Computer Vision: Optimization (UPC 2016)Deep Learning for Computer Vision: Optimization (UPC 2016)
Deep Learning for Computer Vision: Optimization (UPC 2016)
 
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
 
Relaxed Parsing of Regular Approximations of String-Embedded Languages
Relaxed Parsing of Regular Approximations of String-Embedded LanguagesRelaxed Parsing of Regular Approximations of String-Embedded Languages
Relaxed Parsing of Regular Approximations of String-Embedded Languages
 
ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2
 
Recent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and BeyondRecent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and Beyond
 
Integer sequence
Integer sequenceInteger sequence
Integer sequence
 
Clustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn TutorialClustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn Tutorial
 

Viewers also liked

Resolucon de la imagen.pptxr
Resolucon de la imagen.pptxrResolucon de la imagen.pptxr
Resolucon de la imagen.pptxr
Alfredo Torres
 
Desarrollo del pensamiento y la creatividad
Desarrollo del pensamiento y la creatividadDesarrollo del pensamiento y la creatividad
Desarrollo del pensamiento y la creatividad
rocioleon-2
 
Jorge Newbery
Jorge NewberyJorge Newbery
Jorge Newbery
day_nekoo
 
Por que los perros viven menos que nosotros
Por que los perros viven menos que nosotrosPor que los perros viven menos que nosotros
Por que los perros viven menos que nosotros
kdorronsoro
 
Presentación2.pptx planos
Presentación2.pptx planosPresentación2.pptx planos
Presentación2.pptx planos
Alfredo Torres
 

Viewers also liked (20)

las tecnologías de la información y comunicación (TIC)
las tecnologías de la información y comunicación (TIC)las tecnologías de la información y comunicación (TIC)
las tecnologías de la información y comunicación (TIC)
 
Los angulos
Los angulosLos angulos
Los angulos
 
Resolucon de la imagen.pptxr
Resolucon de la imagen.pptxrResolucon de la imagen.pptxr
Resolucon de la imagen.pptxr
 
Nr energy
Nr energyNr energy
Nr energy
 
Historia "Una buena pesadilla"
Historia "Una buena pesadilla"Historia "Una buena pesadilla"
Historia "Una buena pesadilla"
 
T3 misw simetria_mm
T3 misw simetria_mmT3 misw simetria_mm
T3 misw simetria_mm
 
Presentacion
PresentacionPresentacion
Presentacion
 
Desarrollo del pensamiento y la creatividad
Desarrollo del pensamiento y la creatividadDesarrollo del pensamiento y la creatividad
Desarrollo del pensamiento y la creatividad
 
Leo da vinci
Leo da vinciLeo da vinci
Leo da vinci
 
Presentación gustavo
Presentación gustavoPresentación gustavo
Presentación gustavo
 
Creatividad
CreatividadCreatividad
Creatividad
 
Proporcionalidad abc
Proporcionalidad abcProporcionalidad abc
Proporcionalidad abc
 
Jorge Newbery
Jorge NewberyJorge Newbery
Jorge Newbery
 
T15 misw derivada_lf
T15 misw derivada_lfT15 misw derivada_lf
T15 misw derivada_lf
 
Presentación1
Presentación1Presentación1
Presentación1
 
Académicos a honorarios: entrando en materia (19/08/2011)
Académicos a honorarios: entrando en materia (19/08/2011)Académicos a honorarios: entrando en materia (19/08/2011)
Académicos a honorarios: entrando en materia (19/08/2011)
 
Proyecto haarp
Proyecto haarpProyecto haarp
Proyecto haarp
 
Por que los perros viven menos que nosotros
Por que los perros viven menos que nosotrosPor que los perros viven menos que nosotros
Por que los perros viven menos que nosotros
 
RFra_FinalPaper
RFra_FinalPaperRFra_FinalPaper
RFra_FinalPaper
 
Presentación2.pptx planos
Presentación2.pptx planosPresentación2.pptx planos
Presentación2.pptx planos
 

Similar to Echo State Hoeffding Tree Learning

RNN and LSTM model description and working advantages and disadvantages
RNN and LSTM model description and working advantages and disadvantagesRNN and LSTM model description and working advantages and disadvantages
RNN and LSTM model description and working advantages and disadvantages
AbhijitVenkatesh1
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
Mohamed Sobh
 

Similar to Echo State Hoeffding Tree Learning (20)

Low-latency Multi-threaded Ensemble Learning for Dynamic Big Data Streams
Low-latency Multi-threaded Ensemble Learning for Dynamic Big Data StreamsLow-latency Multi-threaded Ensemble Learning for Dynamic Big Data Streams
Low-latency Multi-threaded Ensemble Learning for Dynamic Big Data Streams
 
recurrent_neural_networks_april_2020.pptx
recurrent_neural_networks_april_2020.pptxrecurrent_neural_networks_april_2020.pptx
recurrent_neural_networks_april_2020.pptx
 
Beating Floating Point at its Own Game: Posit Arithmetic
Beating Floating Point at its Own Game: Posit ArithmeticBeating Floating Point at its Own Game: Posit Arithmetic
Beating Floating Point at its Own Game: Posit Arithmetic
 
RNN and LSTM model description and working advantages and disadvantages
RNN and LSTM model description and working advantages and disadvantagesRNN and LSTM model description and working advantages and disadvantages
RNN and LSTM model description and working advantages and disadvantages
 
[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need
 
Seq2Seq (encoder decoder) model
Seq2Seq (encoder decoder) modelSeq2Seq (encoder decoder) model
Seq2Seq (encoder decoder) model
 
Dataworkz odsc london 2018
Dataworkz odsc london 2018Dataworkz odsc london 2018
Dataworkz odsc london 2018
 
Reservoir Computing Overview (with emphasis on Liquid State Machines)
Reservoir Computing Overview (with emphasis on Liquid State Machines)Reservoir Computing Overview (with emphasis on Liquid State Machines)
Reservoir Computing Overview (with emphasis on Liquid State Machines)
 
04 accelerating dl inference with (open)capi and posit numbers
04 accelerating dl inference with (open)capi and posit numbers04 accelerating dl inference with (open)capi and posit numbers
04 accelerating dl inference with (open)capi and posit numbers
 
Searching Algorithms
Searching AlgorithmsSearching Algorithms
Searching Algorithms
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
08 neural networks
08 neural networks08 neural networks
08 neural networks
 
Hardware Acceleration for Machine Learning
Hardware Acceleration for Machine LearningHardware Acceleration for Machine Learning
Hardware Acceleration for Machine Learning
 
Spark Summit EU talk by Ram Sriharsha and Vlad Feinberg
Spark Summit EU talk by Ram Sriharsha and Vlad FeinbergSpark Summit EU talk by Ram Sriharsha and Vlad Feinberg
Spark Summit EU talk by Ram Sriharsha and Vlad Feinberg
 
Histograms at scale - Monitorama 2019
Histograms at scale - Monitorama 2019Histograms at scale - Monitorama 2019
Histograms at scale - Monitorama 2019
 
Distributed Decision Tree Learning for Mining Big Data Streams
Distributed Decision Tree Learning for Mining Big Data StreamsDistributed Decision Tree Learning for Mining Big Data Streams
Distributed Decision Tree Learning for Mining Big Data Streams
 
Model-based programming and AI-assisted software development
Model-based programming and AI-assisted software developmentModel-based programming and AI-assisted software development
Model-based programming and AI-assisted software development
 
An Introduction to Distributed Data Streaming
An Introduction to Distributed Data StreamingAn Introduction to Distributed Data Streaming
An Introduction to Distributed Data Streaming
 
Approximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming ApplicationsApproximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming Applications
 
Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
 

Recently uploaded

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 

Recently uploaded (20)

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 

Echo State Hoeffding Tree Learning

  • 1. Echo State Hoeffding Tree Learning Diego Marr´on (dmarron@ac.upc.edu) Jesse Read (jesse.read@telecom-paristech.fr) Albert Bifet (albert.bifet@telecom-paristech.fr) Talel Abdessalem (talel.abdessalem@telecom-paristech.fr) Eduard Ayguad´e (eduard.ayguade@bsc.es) Jos´e R. Herrero (josepr@ac.upc.edu) ACML 2016 Hamilton, New Zeland
  • 2. Introduction ESHT Evaluations Conclusions Introduction • Real-time classification of Big Data streams is becoming essential in a variety of application domains. • Real-time classification imposes some challenges: • Deal with potentially infinite streams • Strong temporal dependences • React to changes on the stream • Response time and memory are bounded 2/18
  • 3. Introduction ESHT Evaluations Conclusions Real Time Classification • In real-time classification: • Hoeffding Tree (HT) is the streaming state-of-the art decision tree • HTs are powerful and easy–to–deploy (no hyper-parameter to tune) • But, they are unable to capture strong temporal dependences • Recurrent Neural Networks (RNN) are very popular nowadays 3/18
  • 4. Introduction ESHT Evaluations Conclusions Recurrent Neural Networks • Recurrent Neural Networks (RNNs) are the state-of-the-art in handwriting recognition, speech recognition, natural language processing among others • They are able to capture time dependences • But their use for data streams is not straight forward • Very sensitive to hyper-parameters configuration • Training requires many iterations over data... • ...and large amount of time 4/18
  • 5. Introduction ESHT Evaluations Conclusions RNN: Echo State Network • A type of Recurrent Neural Network • Echo State Layer (ESL): • Dynamics only driven by the input • Requires very few computations • Easy to understand hyper-parameters • Can capture time dependences • ESN also requires the hyper-parameters needed by the NN • Gradient Descent methods have slow convergence 5/18
  • 6. Introduction ESHT Evaluations Conclusions Contribution • Objective: • Need to model the evolution of the stream over time • Reduce number of hyper-parameters • Reduce amount of samples needed to learn • In this work we present the ESHT: • Combination of HT + ESL • To learn temporal dependences in data streams in real-time • Requires less hyper-parameters than the ESN 6/18
  • 7. Introduction ESHT Evaluations Conclusions ESHT • Echo State Layer (ESL): • Only needs two hyper-parameters: • Alpha (α): weights events in X(n) importance over new ones • Density: Wres is a sparse matrix with given density • Encodes time-dependences • FIMT-DD: Hoeffding tree for regression • Works out-of-the-box: no hyper-parameters tuning 7/18
  • 8. Introduction ESHT Evaluations Conclusions ESHT: Evaluation Methodology • We propose the ESHT to learn character-stream functions: • Counter (skipped in this presentation) • lastIndexOf • emailFilter • lastIndexOf Evaluation: • Study the effects of hyper-parameters: α and density • Alpha (α): weights events in X(n) importance over new ones • Density: Wres is a sparse matrix with given density • Use 1,000 neurons on the ESL • emailFilter evaluation: • We focus on the speed of learning • Use outcomes from previous evaluations to configure the ESHT for this task • Metrics: • Cumulative loss • We consider an error if |yt − ˆy| >= 0.5 8/18
  • 9. Introduction ESHT Evaluations Conclusions Input format • Input is a vector of floats • Number of attributes = number of input symbols • Attribute representing current symbol set to 0.5 • Other attributes are set to zero 9/18
  • 10. Introduction ESHT Evaluations Conclusions LastIndexOf • Counts the number of time steps since the current symbol was last observed • Input stream is randomly generated • We 2,3 and 4 symbols 10/18
  • 11. Introduction ESHT Evaluations Conclusions LastIndexOf: Vector vs Scalar Input • Vector input improves accuracy in all cases • Specially with 4 symbols 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 α Accuracy(%) 2symbols density=0.4 2symbols-vec density=0.4 3symbols density=0.4 3symbols-vec density=0.4 4symbols density=0.4 4symbols-vec density=0.4 11/18
  • 12. Introduction ESHT Evaluations Conclusions LastIndexOf: Alpha and Density vs Accuracy • Lower values of alpha (α) have low accuracy • There is no clear correlation between accuracy and density 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Alpha (α) Accuracy(%) 2symbols density=0.1 2symbols density=0.4 3symbols density=0.1 3symbols density=0.4 4symbols density=0.1 4symbols density=0.4 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.3 0.4 0.5 0.6 0.7 0.8 Density Accuracy(%) α=0.2 α=0.3 α=0.4 α=0.5 α=0.6 α=0.7 α=0.8 α=0.9 α=1.0 12/18
  • 13. Introduction ESHT Evaluations Conclusions EmailFilter • ESHT configuration: • ESL: 4,000 neurons • α = 1.0 and density = 0.1 • Outputs the length on the next space character • Dataset: 20 newsgroups dataset • Extracted 590 characters and repeated them 8 times • To reduce the memory usage we used an input vector of 4 symbols 13/18
  • 14. Introduction ESHT Evaluations Conclusions EmailFilter: Recurrence vs Non Recurrence • Non-recurrent methods (FIMT-DD and NN) fail to capture temporal dependences • NN defaults to majority class Algorithm Density α Learning rate Loss Accuracy (%) FIMT-DD - - - 4,119.7 91.61 NN - - 0.8 2,760 97.80 ESN1 0.2 1.0 0.1 1,032 98.47 ESN2 0.7 1.0 0.1 850 98.47 ESHT 0.1 1.0 - 180 99.75 14/18
  • 15. Introduction ESHT Evaluations Conclusions EmailFilter: ESN vs ESHT • After 500 samples the ESHT loss is close to 0 (and 0 loss after the 1,000 samples) 0 1,000 2,000 3,000 4,000 0 200 400 600 800 1,000 1,200 500 # Samples CummulativeLoss ESN1 ESN2 ESHT 15/18
  • 16. Introduction ESHT Evaluations Conclusions Conclusions and Future Work • Conclusions: • We presented the ESHT to learn temporal dependences in data streams in real-time • The ESHT requires less hyper-parameters than the ESN • Our proof-of-concept implementation is able to learn faster than an ESN (Most of them at first shot) • Future Work: • We are currently reimplementing our prototype so we can test larger input sequences • We need to study the effects of the initial state vanishing in large sequences 16/18
  • 18. Echo State Hoeffding Tree Learning Diego Marr´on (dmarron@ac.upc.edu) Jesse Read (jesse.read@telecom-paristech.fr) Albert Bifet (albert.bifet@telecom-paristech.fr) Talel Abdessalem (talel.abdessalem@telecom-paristech.fr) Eduard Ayguad´e (eduard.ayguade@bsc.es) Jos´e R. Herrero (josepr@ac.upc.edu) ACML 2016 Hamilton, New Zeland
  • 19. ESHT: Module Architecture • In each evaluation we use the following architecture • Label generator implements the function to be learnt 1/0
  • 20. Counter: Introduction • Stream of zeros and ones randomly generated • Input is a scalar • Two variants: • Option1: Outputs cumulative count • Option2: Outputs total count on the next zero 2/0
  • 21. Counter: Cumulative Loss • After 200 samples the loss is stable 0 200 400 600 800 1,000 0 10 20 30 # Samples CummulativeLoss Op1(density=0.3,α=1.0) Op1(density=1.0,α=0.7) Op2(density=0.8,α=1.0) Op2(density=0.8,α=0.7) 3/0
  • 22. Counter: Alpha and Density vs Accuracy 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.5 0.6 0.7 0.8 0.9 1 Alpha (α) Accuracy(%) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.5 0.6 0.7 0.8 0.9 1 Density (%) Accuracy(%) 4/0
  • 23. EmailFilter: ASCII to 4 symbols Table ASCII Domain 4-Symbols Domain Original Symbols Target Symbol Target Symbol Index [t n r]+ Single space 0 [a-zA-Z0-9] x 1 @ @ 2 . . 3 5/0