SlideShare une entreprise Scribd logo
1  sur  27
Télécharger pour lire hors ligne
Generalization of Tensor Factorization
          and Applications

                            Kohei Hayashi
                                 Collaborators:
T. Takenouchi, T. Shibata, Y. Kamiya, D. Kato, K. Kunieda, K. Yamada, K. Ikeda,
                            R. Tomioka, H. Kashima


                            May 14, 2012



                                                                                  1 / 29
Relational data
is a collection of relationships among multiple objects.




         relationships of pair   ⇔   matrix




                                                           2 / 29
Relational data
is a collection of relationships among multiple objects.




         relationships of pair ⇔     matrix
       relationships of 3-tuple ⇔ 3 dim. array




                                                           3 / 29
Relational data
is a collection of relationships among multiple objects.




                                              
         relationships of pair ⇔     matrix   
       relationships of 3-tuple ⇔ 3 dim. array tensor
                   .
                   .                   .
                                       .      
                   .                   .

can represent as a tensor with missing values.
                                                           4 / 29
Issue of tensor representation


Tensor representation is generally high-dimensional and
large-scale
   • e.g. 1, 000 users × 1, 000 items × 1, 000 times
     = total 1, 000, 000, 000 relationships


Dimensional reduction techniques such as tensor
factorization is used.



                                                          5 / 29
Tensor factorization


Tucker decomposition [Tucker 1966]: a tensor factorization
method assuming
  1 observation noise is Gaussian
   .

  2 underlying tensor is low-dimensional
   .




These assumptions are general ... but are not always
true.



                                                             6 / 29
Contributions
Generalize Tucker decomposition and propose two new
models:
  1 Exponential family tensor factorization (ETF)
   .
       [Joint work with Takenouchi, Shibata, Kamiya, Kunieda, Yamada, and
       Ikeda]
          • Generalize the noise distribution.
          • Can handle a tensor containing mixed discrete and
             continuous values.
  2.   Full-rank tensor completion (FTC)
       [Joint work with Tomioka and Kashima]
          • Kernelize Tucker decomposition
          • Complete missing values without reducing the
             dimensionality.

                                                                            7 / 29
Outline



1.   Tucker decomposition
2.   Exponential family tensor factorization (ETF)
3.   Full-rank tensor completion (FTC)
4.   Conclusion




                                                     8 / 29
Outline



1.   Tucker decomposition
2.   Exponential family tensor factorization (ETF)
3.   Full-rank tensor completion (FTC)
4.   Conclusion




                                                     9 / 29
Tucker decomposition




                  K1   K2   K3
                                  (1) (2) (3)
         xijk =                  uiq ujr uks zqrs + εijk   (1)
                  q=1 r=1 s=1


• ε: i.i.d Gaussian noise

                                                           10 / 29
Tucker decomposition




                  K1   K2   K3
                                  (1) (2) (3)
         xijk =                  uiq ujr uks zqrs + εijk   (1)
                  q=1 r=1 s=1


• ε: i.i.d Gaussian noise

                                                           11 / 29
Tucker decomposition




                  K1   K2   K3
                                  (1) (2) (3)
         xijk =                  uiq ujr uks zqrs + εijk   (1)
                  q=1 r=1 s=1


• ε: i.i.d Gaussian noise

                                                           12 / 29
Vectorized form

Let x ∈ RD and z ∈ RK denote vectorized X and Z,
resp., then

   x = Wz + ε where W ≡ U(3) ⊗ U(2) ⊗ U(1) .

  • D ≡ D1 D2 D3 , K ≡ K1 K2 K3
  • ⊗: the Kronecker product

Tucker decomposition = A linear Gaussian model

                x ∼ N (x | Wz, σ 2 I)


                                                   13 / 29
Rank of tensor
Call dimensionalities of the core tensor rank of tensor .




  • Rank of X is (K1 , K2 , K3 ).


Rank of tensor represents its complexity
  • low-rank tensor has less information
                                                            14 / 29
Outline



1.   Tucker decomposition
2.   Exponential family tensor factorization (ETF)
3.   Full-rank tensor completion (FTC)
4.   Conclusion




                                                     15 / 29
Motivation: heterogeneous tensor




Tucker decomposition assumes Gaussian noise
  • not appropriate for such data




                                              16 / 29
Motivation: heterogeneous tensor




Tucker decomposition assumes Gaussian noise
   • not appropriate for such data
.
Approach
.
generalize Tucker decomposition, assuming a different
distribution for each element
.
                                                       17 / 29
Exponential family tensor factorization
.
Likelihood
.
                            D
        x∼                       Expond (xd | θd ),                    θ ≡ Wz
                           d=1           exp. family                  Tucker decomp.

.       where Expon(x | θ) ≡ exp [xθ − ψ(θ) + F (x)]

exponential family : a class of distributions
                           Gaussian                    Poisson                   Bernoulli
                     0.4




                                                                           0.7
    Density
           Density




                                                 0.2
                     0.2




                                                                           0.5
    (θ = 0)
                     0.0




                                                 0.0




                                                                           0.3

    ψ(θ)                   θ2−4 −2
                             /2      0   2   4
                                                       exp[θ] 4
                                                        0 2       6    8
                                                                                 ln(10.0 0.5 1.0 1.5
                                                                                 −0.5
                                                                                       + exp[θ])
                                     x

                                                                                                       18 / 29
.
Priors
.
  • for z: a Gaussian prior N (0, I)
          (m)                        −1
. • for U : a Gaussian prior N (0, αm I)

.
Joint log-likelihood
.
                       D
         L =x Wz −           ψhd (wd z)
                       d=1
                             M
                 1              αm        2
             −     ||z||2 −        U(m)         + const.
                 2          m=1
                                 2        Fro

.
Estimate parameters by Bayesian inference.
                                                           19 / 29
Bayesian inference
Marginal-MAP estimator:

        argmax           exp[L(z, U(1) , . . . , U(M ) )]dz
       U(1) ,...,U(M )


  • The integral is not analytical.

Develop efficient yet accurate approximation with
Laplace approximation and Gaussian process.
  • Computational cost is still higher than Tucker
    dcomp.
  • For a long and thin tensor (e.g. time series), online
    algorithm is applicable (see thesis.)
                                                              20 / 29
Experiments




              21 / 29
Anomaly detection by ETF
    Purpose Find irregular parts of tensor data
    Method Apply distance-based outlier
            DB (p, D) [Knorr+ VLDB’00] to the estimated
            factor matrix U(m)
.
Definition of DB (p, D)
.
“An object O is a DB (p, D) outlier if at least fraction p
. the objects lies at a distance greater than D from O.”
of




                                                          22 / 29
Data set

A time series of multisensor measurements
  • Each sensor recorded the human behavior (e.g.
     position) of researchers in NEC lab. for 8 month.
         • 6 (sensors) × 21 (persons) × 1927 (hours)

          Sensor                    Type           Min         Max
    X1    # of sent emails          Count          0.00       14.00
    X2    # of received emails      Count          0.00       15.00
    X3    # of typed keys           Count          0.00    50422.00
    X4    X coordinate               Real       −550.17     3444.15
    X5    Y coordinate               Real        128.71     2353.55
    X6    Movement distance      Non-negative      0.00   203136.96




                                                                      23 / 29
Evaluation
List of irregular events are provided.
                    Examples of irregular events
                Date    Time       Description
                Dec 21 All day Private incident
                Dec 22 15:00       Monthly seminar
                        16:00
                Jun 15 13:00       Visiting tour
                .
                .       .
                        .          .
                                   .
                .       .          .


  • Evaluate as a binary classification
  • 50% are missing, 10 trials
  • Apply ETF with online algorithm


                                                     24 / 29
Result: classification performance



          0.7



                                                      method
          0.6                                            PARAFAC
    AUC




                                                         Tucker
                                                         pTucker (EM)

          0.5                                            ETF online




          0.4


                2x2x2 3x3x3 4x4x4 5x5x5 6x6x6 6x7x7
                             Rank




                                                                        25 / 29
Result: classification performance



                 0.7



                                                             method
                 0.6                                            PARAFAC
           AUC




                                                                Tucker
                                                                pTucker (EM)

                 0.5                                            ETF online




                 0.4


                       2x2x2 3x3x3 4x4x4 5x5x5 6x6x6 6x7x7
                                    Rank




 ETF well detect anomaly events
                                                                               26 / 29
Skip the rest




                27 / 29

Contenu connexe

Tendances

はじパタ8章 svm
はじパタ8章 svmはじパタ8章 svm
はじパタ8章 svmtetsuro ito
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionActiveEon
 
Differential equation and Laplace Transform
Differential equation and Laplace TransformDifferential equation and Laplace Transform
Differential equation and Laplace TransformKalaiindhu
 
はじパタ 10章 クラスタリング 前半
はじパタ 10章 クラスタリング 前半はじパタ 10章 クラスタリング 前半
はじパタ 10章 クラスタリング 前半Katsushi Yamashita
 
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability DistributionsDeep Learning JP
 
03 truncation errors
03 truncation errors03 truncation errors
03 truncation errorsmaheej
 
[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions
[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions
[DL輪読会]Xception: Deep Learning with Depthwise Separable ConvolutionsDeep Learning JP
 
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Preferred Networks
 
PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半Ohsawa Goodfellow
 
(DL hacks輪読) Deep Kernel Learning
(DL hacks輪読) Deep Kernel Learning(DL hacks輪読) Deep Kernel Learning
(DL hacks輪読) Deep Kernel LearningMasahiro Suzuki
 
05.第五章用Matlab计算积分
05.第五章用Matlab计算积分05.第五章用Matlab计算积分
05.第五章用Matlab计算积分Xin Zheng
 
Forward-Forward Algorithm
Forward-Forward AlgorithmForward-Forward Algorithm
Forward-Forward AlgorithmDong Heon Cho
 
Meta-learning and the ELBO
Meta-learning and the ELBOMeta-learning and the ELBO
Meta-learning and the ELBOYoonho Lee
 

Tendances (20)

はじパタ8章 svm
はじパタ8章 svmはじパタ8章 svm
はじパタ8章 svm
 
Contraction mapping
Contraction mappingContraction mapping
Contraction mapping
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object Detection
 
Differential equation and Laplace Transform
Differential equation and Laplace TransformDifferential equation and Laplace Transform
Differential equation and Laplace Transform
 
はじパタ 10章 クラスタリング 前半
はじパタ 10章 クラスタリング 前半はじパタ 10章 クラスタリング 前半
はじパタ 10章 クラスタリング 前半
 
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions
[DL輪読会]Deep Learning for Sampling from Arbitrary Probability Distributions
 
03 truncation errors
03 truncation errors03 truncation errors
03 truncation errors
 
主成分分析
主成分分析主成分分析
主成分分析
 
K-Means manual work
K-Means manual workK-Means manual work
K-Means manual work
 
Distributed ADMM
Distributed ADMMDistributed ADMM
Distributed ADMM
 
[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions
[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions
[DL輪読会]Xception: Deep Learning with Depthwise Separable Convolutions
 
Prml 1.3~1.6 ver3
Prml 1.3~1.6 ver3Prml 1.3~1.6 ver3
Prml 1.3~1.6 ver3
 
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
 
PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半PRML上巻勉強会 at 東京大学 資料 第1章後半
PRML上巻勉強会 at 東京大学 資料 第1章後半
 
(DL hacks輪読) Deep Kernel Learning
(DL hacks輪読) Deep Kernel Learning(DL hacks輪読) Deep Kernel Learning
(DL hacks輪読) Deep Kernel Learning
 
05.第五章用Matlab计算积分
05.第五章用Matlab计算积分05.第五章用Matlab计算积分
05.第五章用Matlab计算积分
 
Prml07
Prml07Prml07
Prml07
 
Fourier series
Fourier seriesFourier series
Fourier series
 
Forward-Forward Algorithm
Forward-Forward AlgorithmForward-Forward Algorithm
Forward-Forward Algorithm
 
Meta-learning and the ELBO
Meta-learning and the ELBOMeta-learning and the ELBO
Meta-learning and the ELBO
 

En vedette

Matrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionMatrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionActiveEon
 
A brief survey of tensors
A brief survey of tensorsA brief survey of tensors
A brief survey of tensorsBerton Earnshaw
 
Expected tensor decomposition with stochastic gradient descent
Expected tensor decomposition with stochastic gradient descentExpected tensor decomposition with stochastic gradient descent
Expected tensor decomposition with stochastic gradient descentKohei Hayashi
 
On the benchmark of Chainer
On the benchmark of ChainerOn the benchmark of Chainer
On the benchmark of ChainerKenta Oono
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsYONG ZHENG
 
Tensor Train decomposition in machine learning
Tensor Train decomposition in machine learningTensor Train decomposition in machine learning
Tensor Train decomposition in machine learningAlexander Novikov
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFMLiangjie Hong
 
2014 3 13(テンソル分解の基礎)
2014 3 13(テンソル分解の基礎)2014 3 13(テンソル分解の基礎)
2014 3 13(テンソル分解の基礎)Tatsuya Yokota
 
Doubly Decomposing Nonparametric Tensor Regression
Doubly Decomposing Nonparametric Tensor RegressionDoubly Decomposing Nonparametric Tensor Regression
Doubly Decomposing Nonparametric Tensor RegressionMasaaki Imaizumi
 
Extending Preconditioned GMRES to Nonlinear Optimization
Extending Preconditioned GMRES to Nonlinear OptimizationExtending Preconditioned GMRES to Nonlinear Optimization
Extending Preconditioned GMRES to Nonlinear OptimizationHans De Sterck
 
Top-N Recommendation for Shared Accounts
Top-N Recommendation for Shared AccountsTop-N Recommendation for Shared Accounts
Top-N Recommendation for Shared Accountskoeverstrep
 
Originan
OriginanOriginan
OriginanTensor
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorizationBalázs Hidasi
 
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...Shunsuke Ono
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization MachinesPavel Kalaidin
 
Recent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionRecent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionActiveEon
 
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...Alexandros Karatzoglou
 

En vedette (20)

Matrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionMatrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer Vision
 
A brief survey of tensors
A brief survey of tensorsA brief survey of tensors
A brief survey of tensors
 
Expected tensor decomposition with stochastic gradient descent
Expected tensor decomposition with stochastic gradient descentExpected tensor decomposition with stochastic gradient descent
Expected tensor decomposition with stochastic gradient descent
 
On the benchmark of Chainer
On the benchmark of ChainerOn the benchmark of Chainer
On the benchmark of Chainer
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender Systems
 
Tensor Train decomposition in machine learning
Tensor Train decomposition in machine learningTensor Train decomposition in machine learning
Tensor Train decomposition in machine learning
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
3. tensor calculus jan 2013
3. tensor calculus jan 20133. tensor calculus jan 2013
3. tensor calculus jan 2013
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
 
2014 3 13(テンソル分解の基礎)
2014 3 13(テンソル分解の基礎)2014 3 13(テンソル分解の基礎)
2014 3 13(テンソル分解の基礎)
 
Doubly Decomposing Nonparametric Tensor Regression
Doubly Decomposing Nonparametric Tensor RegressionDoubly Decomposing Nonparametric Tensor Regression
Doubly Decomposing Nonparametric Tensor Regression
 
Extending Preconditioned GMRES to Nonlinear Optimization
Extending Preconditioned GMRES to Nonlinear OptimizationExtending Preconditioned GMRES to Nonlinear Optimization
Extending Preconditioned GMRES to Nonlinear Optimization
 
Top-N Recommendation for Shared Accounts
Top-N Recommendation for Shared AccountsTop-N Recommendation for Shared Accounts
Top-N Recommendation for Shared Accounts
 
07 Tensor Visualization
07 Tensor Visualization07 Tensor Visualization
07 Tensor Visualization
 
Originan
OriginanOriginan
Originan
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorization
 
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...
Ph.D. Thesis Presentation: A Study of Priors and Algorithms for Signal Recove...
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
 
Recent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detectionRecent advances on low-rank and sparse decomposition for moving object detection
Recent advances on low-rank and sparse decomposition for moving object detection
 
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
 

Similaire à Generalization of Tensor Factorization and Applications for Anomaly Detection

Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsColin Gillespie
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsColin Gillespie
 
Fluctuations and rare events in stochastic aggregation
Fluctuations and rare events in stochastic aggregationFluctuations and rare events in stochastic aggregation
Fluctuations and rare events in stochastic aggregationColm Connaughton
 
Pinning and facetting in multiphase LBMs
Pinning and facetting in multiphase LBMsPinning and facetting in multiphase LBMs
Pinning and facetting in multiphase LBMsTim Reis
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility usingkkislas
 
The Multivariate Gaussian Probability Distribution
The Multivariate Gaussian Probability DistributionThe Multivariate Gaussian Probability Distribution
The Multivariate Gaussian Probability DistributionPedro222284
 
Fixed point theorem of discontinuity and weak compatibility in non complete n...
Fixed point theorem of discontinuity and weak compatibility in non complete n...Fixed point theorem of discontinuity and weak compatibility in non complete n...
Fixed point theorem of discontinuity and weak compatibility in non complete n...Alexander Decker
 
11.fixed point theorem of discontinuity and weak compatibility in non complet...
11.fixed point theorem of discontinuity and weak compatibility in non complet...11.fixed point theorem of discontinuity and weak compatibility in non complet...
11.fixed point theorem of discontinuity and weak compatibility in non complet...Alexander Decker
 
Solution of a subclass of singular second order
Solution of a subclass of singular second orderSolution of a subclass of singular second order
Solution of a subclass of singular second orderAlexander Decker
 
11.solution of a subclass of singular second order
11.solution of a subclass of singular second order11.solution of a subclass of singular second order
11.solution of a subclass of singular second orderAlexander Decker
 
computational stochastic phase-field
computational stochastic phase-fieldcomputational stochastic phase-field
computational stochastic phase-fieldcerniagigante
 
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...Colm Connaughton
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...Colm Connaughton
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Matthew Leingang
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Matthew Leingang
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Mel Anthony Pepito
 

Similaire à Generalization of Tensor Factorization and Applications for Anomaly Detection (20)

Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic models
 
Fluctuations and rare events in stochastic aggregation
Fluctuations and rare events in stochastic aggregationFluctuations and rare events in stochastic aggregation
Fluctuations and rare events in stochastic aggregation
 
Pinning and facetting in multiphase LBMs
Pinning and facetting in multiphase LBMsPinning and facetting in multiphase LBMs
Pinning and facetting in multiphase LBMs
 
Berans qm overview
Berans qm overviewBerans qm overview
Berans qm overview
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility using
 
The Multivariate Gaussian Probability Distribution
The Multivariate Gaussian Probability DistributionThe Multivariate Gaussian Probability Distribution
The Multivariate Gaussian Probability Distribution
 
1 - Linear Regression
1 - Linear Regression1 - Linear Regression
1 - Linear Regression
 
Fixed point theorem of discontinuity and weak compatibility in non complete n...
Fixed point theorem of discontinuity and weak compatibility in non complete n...Fixed point theorem of discontinuity and weak compatibility in non complete n...
Fixed point theorem of discontinuity and weak compatibility in non complete n...
 
11.fixed point theorem of discontinuity and weak compatibility in non complet...
11.fixed point theorem of discontinuity and weak compatibility in non complet...11.fixed point theorem of discontinuity and weak compatibility in non complet...
11.fixed point theorem of discontinuity and weak compatibility in non complet...
 
Solution of a subclass of singular second order
Solution of a subclass of singular second orderSolution of a subclass of singular second order
Solution of a subclass of singular second order
 
11.solution of a subclass of singular second order
11.solution of a subclass of singular second order11.solution of a subclass of singular second order
11.solution of a subclass of singular second order
 
computational stochastic phase-field
computational stochastic phase-fieldcomputational stochastic phase-field
computational stochastic phase-field
 
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...
Instantaneous Gelation in Smoluchwski's Coagulation Equation Revisited, Confe...
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...
The Inverse Smoluchowski Problem, Particles In Turbulence 2011, Potsdam, Marc...
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)
 
Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)Lesson 26: The Fundamental Theorem of Calculus (slides)
Lesson 26: The Fundamental Theorem of Calculus (slides)
 
Chapter 3 (maths 3)
Chapter 3 (maths 3)Chapter 3 (maths 3)
Chapter 3 (maths 3)
 

Dernier

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Dernier (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

Generalization of Tensor Factorization and Applications for Anomaly Detection

  • 1. Generalization of Tensor Factorization and Applications Kohei Hayashi Collaborators: T. Takenouchi, T. Shibata, Y. Kamiya, D. Kato, K. Kunieda, K. Yamada, K. Ikeda, R. Tomioka, H. Kashima May 14, 2012 1 / 29
  • 2. Relational data is a collection of relationships among multiple objects. relationships of pair ⇔ matrix 2 / 29
  • 3. Relational data is a collection of relationships among multiple objects. relationships of pair ⇔ matrix relationships of 3-tuple ⇔ 3 dim. array 3 / 29
  • 4. Relational data is a collection of relationships among multiple objects.  relationships of pair ⇔ matrix  relationships of 3-tuple ⇔ 3 dim. array tensor . . . .  . . can represent as a tensor with missing values. 4 / 29
  • 5. Issue of tensor representation Tensor representation is generally high-dimensional and large-scale • e.g. 1, 000 users × 1, 000 items × 1, 000 times = total 1, 000, 000, 000 relationships Dimensional reduction techniques such as tensor factorization is used. 5 / 29
  • 6. Tensor factorization Tucker decomposition [Tucker 1966]: a tensor factorization method assuming 1 observation noise is Gaussian . 2 underlying tensor is low-dimensional . These assumptions are general ... but are not always true. 6 / 29
  • 7. Contributions Generalize Tucker decomposition and propose two new models: 1 Exponential family tensor factorization (ETF) . [Joint work with Takenouchi, Shibata, Kamiya, Kunieda, Yamada, and Ikeda] • Generalize the noise distribution. • Can handle a tensor containing mixed discrete and continuous values. 2. Full-rank tensor completion (FTC) [Joint work with Tomioka and Kashima] • Kernelize Tucker decomposition • Complete missing values without reducing the dimensionality. 7 / 29
  • 8. Outline 1. Tucker decomposition 2. Exponential family tensor factorization (ETF) 3. Full-rank tensor completion (FTC) 4. Conclusion 8 / 29
  • 9. Outline 1. Tucker decomposition 2. Exponential family tensor factorization (ETF) 3. Full-rank tensor completion (FTC) 4. Conclusion 9 / 29
  • 10. Tucker decomposition K1 K2 K3 (1) (2) (3) xijk = uiq ujr uks zqrs + εijk (1) q=1 r=1 s=1 • ε: i.i.d Gaussian noise 10 / 29
  • 11. Tucker decomposition K1 K2 K3 (1) (2) (3) xijk = uiq ujr uks zqrs + εijk (1) q=1 r=1 s=1 • ε: i.i.d Gaussian noise 11 / 29
  • 12. Tucker decomposition K1 K2 K3 (1) (2) (3) xijk = uiq ujr uks zqrs + εijk (1) q=1 r=1 s=1 • ε: i.i.d Gaussian noise 12 / 29
  • 13. Vectorized form Let x ∈ RD and z ∈ RK denote vectorized X and Z, resp., then x = Wz + ε where W ≡ U(3) ⊗ U(2) ⊗ U(1) . • D ≡ D1 D2 D3 , K ≡ K1 K2 K3 • ⊗: the Kronecker product Tucker decomposition = A linear Gaussian model x ∼ N (x | Wz, σ 2 I) 13 / 29
  • 14. Rank of tensor Call dimensionalities of the core tensor rank of tensor . • Rank of X is (K1 , K2 , K3 ). Rank of tensor represents its complexity • low-rank tensor has less information 14 / 29
  • 15. Outline 1. Tucker decomposition 2. Exponential family tensor factorization (ETF) 3. Full-rank tensor completion (FTC) 4. Conclusion 15 / 29
  • 16. Motivation: heterogeneous tensor Tucker decomposition assumes Gaussian noise • not appropriate for such data 16 / 29
  • 17. Motivation: heterogeneous tensor Tucker decomposition assumes Gaussian noise • not appropriate for such data . Approach . generalize Tucker decomposition, assuming a different distribution for each element . 17 / 29
  • 18. Exponential family tensor factorization . Likelihood . D x∼ Expond (xd | θd ), θ ≡ Wz d=1 exp. family Tucker decomp. . where Expon(x | θ) ≡ exp [xθ − ψ(θ) + F (x)] exponential family : a class of distributions Gaussian Poisson Bernoulli 0.4 0.7 Density Density 0.2 0.2 0.5 (θ = 0) 0.0 0.0 0.3 ψ(θ) θ2−4 −2 /2 0 2 4 exp[θ] 4 0 2 6 8 ln(10.0 0.5 1.0 1.5 −0.5 + exp[θ]) x 18 / 29
  • 19. . Priors . • for z: a Gaussian prior N (0, I) (m) −1 . • for U : a Gaussian prior N (0, αm I) . Joint log-likelihood . D L =x Wz − ψhd (wd z) d=1 M 1 αm 2 − ||z||2 − U(m) + const. 2 m=1 2 Fro . Estimate parameters by Bayesian inference. 19 / 29
  • 20. Bayesian inference Marginal-MAP estimator: argmax exp[L(z, U(1) , . . . , U(M ) )]dz U(1) ,...,U(M ) • The integral is not analytical. Develop efficient yet accurate approximation with Laplace approximation and Gaussian process. • Computational cost is still higher than Tucker dcomp. • For a long and thin tensor (e.g. time series), online algorithm is applicable (see thesis.) 20 / 29
  • 21. Experiments 21 / 29
  • 22. Anomaly detection by ETF Purpose Find irregular parts of tensor data Method Apply distance-based outlier DB (p, D) [Knorr+ VLDB’00] to the estimated factor matrix U(m) . Definition of DB (p, D) . “An object O is a DB (p, D) outlier if at least fraction p . the objects lies at a distance greater than D from O.” of 22 / 29
  • 23. Data set A time series of multisensor measurements • Each sensor recorded the human behavior (e.g. position) of researchers in NEC lab. for 8 month. • 6 (sensors) × 21 (persons) × 1927 (hours) Sensor Type Min Max X1 # of sent emails Count 0.00 14.00 X2 # of received emails Count 0.00 15.00 X3 # of typed keys Count 0.00 50422.00 X4 X coordinate Real −550.17 3444.15 X5 Y coordinate Real 128.71 2353.55 X6 Movement distance Non-negative 0.00 203136.96 23 / 29
  • 24. Evaluation List of irregular events are provided. Examples of irregular events Date Time Description Dec 21 All day Private incident Dec 22 15:00 Monthly seminar 16:00 Jun 15 13:00 Visiting tour . . . . . . . . . • Evaluate as a binary classification • 50% are missing, 10 trials • Apply ETF with online algorithm 24 / 29
  • 25. Result: classification performance 0.7 method 0.6 PARAFAC AUC Tucker pTucker (EM) 0.5 ETF online 0.4 2x2x2 3x3x3 4x4x4 5x5x5 6x6x6 6x7x7 Rank 25 / 29
  • 26. Result: classification performance 0.7 method 0.6 PARAFAC AUC Tucker pTucker (EM) 0.5 ETF online 0.4 2x2x2 3x3x3 4x4x4 5x5x5 6x6x6 6x7x7 Rank ETF well detect anomaly events 26 / 29
  • 27. Skip the rest 27 / 29