SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
Sparse Coding and Dictionary Learning
                              for Image Analysis
                       Part II: Dictionary Learning for signal reconstruction


           Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro


                        ICCV’09 tutorial, Kyoto, 28th September 2009




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   1/43
What this part is about
           The learning of compact representations of images
           adapted to restoration tasks.
           A fast online algorithm for learning dictionaries and
           factorizing matrices in general.
           Various formulations for image and video processing.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   2/43
The Image Denoising Problem




                               y                  =            xorig                  + w
                 measurements                            original image                        noise


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   3/43
Sparse representations for image restoration

                                             y          =          xorig        + w
                                     measurements              original image       noise


   Energy minimization problem - MAP estimation
                                 E (x) =               ||y − x||2
                                                                2                + Pr (x)
                                              relation to measurements                 prior


   Some classical priors
           Smoothness λ||Lx||2
                             2
           Total variation λ|| x||2
                                  1
           Wavelet sparsity λ||Wx||1
           ...

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro       Dictionary Learning for signal reconstruction   4/43
Sparse representations for image restoration
   Sparsity and redundancy
                 Pr (x) = λ||α||0 for x ≈ Dα
                                               
                                      α[1]
                                          α[2]
            x =  d1 d2 · · · dp   .  . 
                                           . 
                                            α[p]
                      x∈Rm                             D∈Rm×p
                                                                                   α∈Rp ,sparse




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   5/43
Sparse representations for image restoration
   Designed dictionaries
   [Haar, 1910], [Zweig, Morlet, Grossman ∼70s], [Meyer, Mallat,
   Daubechies, Coifman, Donoho, Candes ∼80s-today]. . . (see [Mallat,
   1999])
   Wavelets, Curvelets, Wedgelets, Bandlets, . . . lets

   Learned dictionaries of patches
   [Olshausen and Field, 1997], [Engan et al., 1999], [Lewicki and
   Sejnowski, 2000], [Aharon et al., 2006] , [Roth and Black, 2005], [Lee
   et al., 2007]
                                1
                       min        ||xi − Dαi ||2 + λψ(αi )
                                               2
                      αi ,D∈C   2
                                               i
                                                                               sparsity
                                                      reconstruction
           ψ(α) = ||α||0 (“               0   pseudo-norm”)
           ψ(α) = ||α||1 (            1   norm)

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   6/43
Sparse representations for image restoration
   Solving the denoising problem
   [Elad and Aharon, 2006]
       Extract all overlapping 8 × 8 patches yi .
       Solve a matrix factorization problem:
                                             n
                                                   1
                               min                   ||yi − Dαi ||2 + λψ(αi ),
                                                                  2
                             αi ,D∈C               2
                                           i=1                                          sparsity
                                                        reconstruction

           with n > 100, 000
           Average the reconstruction of each patch.


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   7/43
Sparse representations for image restoration
K-SVD: [Elad and Aharon, 2006]




   Figure:     Dictionary trained on a noisy version of the image
   boat.

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   8/43
Sparse representations for image restoration
   Inpainting, Demosaicking
                                             1
                             min               ||β ⊗ (yi − Dαi )||2 + λi ψ(αi )
                           D∈C,α             2 i                  2
                                        i

   RAW Image Processing (see our poster)

                                               White
                                              balance.
                                                                                   Denoising
                                               Black
                                            substraction.
                                            Conversion
                                             to sRGB.
                                                                               Demosaicking
                                              Gamma
                                            correction.

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   9/43
Sparse representations for image restoration
[Mairal, Bach, Ponce, Sapiro, and Zisserman, 2009c]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   10/43
Sparse representations for image restoration
[Mairal, Sapiro, and Elad, 2008b]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   11/43
Sparse representations for image restoration
Inpainting, [Mairal, Elad, and Sapiro, 2008a]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   12/43
Sparse representations for image restoration
Inpainting, [Mairal, Elad, and Sapiro, 2008a]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   13/43
Sparse representations for video restoration
   Key ideas for video processing
   [Protter and Elad, 2009]
       Using a 3D dictionary.
       Processing of many frames at the same time.
       Dictionary propagation.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   14/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   15/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   16/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   17/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   18/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   19/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   20/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   21/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   22/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   23/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   24/43
Optimization for Dictionary Learning

                                          n
                                                1
                            min                   ||xi − Dαi ||2 + λ||αi ||1
                                                               2
                         α∈R  p×n               2
                          D∈C           i=1

         C = {D ∈ Rm×p s.t. ∀j = 1, . . . , p, ||dj ||2 ≤ 1}.


           Classical optimization alternates between D and α.
           Good results, but very slow!




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   25/43
Optimization for Dictionary Learning
[Mairal, Bach, Ponce, and Sapiro, 2009a]

   Classical formulation of dictionary learning
                                                                     n
                                                                1
                                     min fn (D) = min                     l(xi , D),
                                     D∈C                  D∈C   n
                                                                    i=1

   where
                                             1
                               l(x, D) = minp ||x − Dα||2 + λ||α||1 .
                                                        2
                                         α∈R 2

   Which formulation are we interested in?
                                                                                    n
                                                     1
                      min f (D) = Ex [l(x, D)] ≈ lim                                    l(xi , D)
                      D∈C                       n→+∞ n
                                                                                  i=1

   [Bottou and Bousquet, 2008]: Online learning can
           handle potentially infinite or dynamic datasets,
           be dramatically faster than batch algorithms.
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro    Dictionary Learning for signal reconstruction   26/43
Optimization for Dictionary Learning
   Require: D0 ∈ Rm×p (initial dictionary); λ ∈ R
    1: A0 = 0, B0 = 0.
    2: for t=1,. . . ,T do
    3: Draw xt
    4: Sparse Coding

                                             1
                                 αt ← arg min ||xt − Dt−1 α||2 + λ||α||1 ,
                                                             2
                                       α∈Rp 2

     5:     Aggregate sufficient statistics
            At ← At−1 + αt αT , Bt ← Bt−1 + xt αT
                              t                    t
     6:     Dictionary Update (block-coordinate descent)
                                                        t
                                                   1           1
                           Dt ← arg min                          ||xi − Dαi ||2 + λ||αi ||1 .
                                                                              2
                                         D∈C       t           2
                                                       i=1

     7:   end for
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro    Dictionary Learning for signal reconstruction   27/43
Optimization for Dictionary Learning
   Which guarantees do we have?
   Under a few reasonable assumptions,
                                     ˆ
       we build a surrogate function ft of the expected cost f verifying
                                                   ˆ
                                               lim ft (Dt ) − f (Dt ) = 0,
                                             t→+∞

           Dt is asymptotically close to a stationary point.

   Extensions (all implemented in SPAMS)
           non-negative matrix decompositions.
           sparse PCA (sparse dictionaries).
           fused-lasso regularizations (piecewise constant dictionaries)




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   28/43
Optimization for Dictionary Learning
Experimental results, batch vs online




                                             m = 8 × 8, p = 256
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   29/43
Optimization for Dictionary Learning
Experimental results, batch vs online




                                       m = 12 × 12 × 3, p = 512
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   30/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   31/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   32/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   33/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   34/43
Extension to NMF and sparse PCA
[Mairal, Bach, Ponce, and Sapiro, 2009b]

   NMF extension
                                    n
                                         1
                         min               ||xi − Dαi ||2 s.t. αi ≥ 0,
                                                        2                                  D ≥ 0.
                      α∈Rp×n             2
                       D∈C i=1


   SPCA extension
                                               n
                                                   1
                                   min               ||xi − Dαi ||2 + λ||α1 ||1
                                                                  2
                                 α∈Rp×n            2
                                   D∈C       i=1

                      C = {D ∈ Rm×p s.t. ∀j ||dj ||2 + γ||dj ||1 ≤ 1}.
                                                   2




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   35/43
Extension to NMF and sparse PCA
Faces: Extended Yale Database B




                 (a) PCA                               (b) NNMF                                  (c) DL




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   36/43
Extension to NMF and sparse PCA
Faces: Extended Yale Database B




         (d) SPCA, τ = 70%                       (e) SPCA, τ = 30%                     (f) SPCA, τ = 10%




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   37/43
Extension to NMF and sparse PCA
Natural Patches




                 (a) PCA                               (b) NNMF                                  (c) DL




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   38/43
Extension to NMF and sparse PCA
Natural Patches




         (d) SPCA, τ = 70%                       (e) SPCA, τ = 30%                     (f) SPCA, τ = 10%




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   39/43
Summary of this part
           The dictionary learning framework leads to
           state-of-the-art results for many image . . .
           . . . and video processing tasks.
           Online learning techniques are well-suited for this
           problem and allows training sets with millions of
           patches.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   40/43
References I
   M. Aharon, M. Elad, and A. M. Bruckstein. The K-SVD: An algorithm for designing
     of overcomplete dictionaries for sparse representations. IEEE Transactions on
     Signal Processing, 54(11):4311–4322, November 2006.
   L. Bottou and O. Bousquet. The trade-offs of large scale learning. In J.C. Platt,
      D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information
      Processing Systems, volume 20, pages 161–168. MIT Press, Cambridge, MA, 2008.
   M. Elad and M. Aharon. Image denoising via sparse and redundant representations
     over learned dictionaries. IEEE Transactions on Image Processing, 54(12):
     3736–3745, December 2006.
   K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using
      method of optimal directions (MOD). In Proceedings of the 1999 IEEE
      International Symposium on Circuits Systems, volume 4, 1999.
   A. Haar. Zur theorie der orthogonalen funktionensysteme. Mathematische Annalen,
      69:331–371, 1910.
   H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In
      B. Sch¨lkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information
            o
      Processing Systems, volume 19, pages 801–808. MIT Press, Cambridge, MA, 2007.


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   41/43
References II
   M. S. Lewicki and T. J. Sejnowski. Learning overcomplete representations. Neural
     Computation, 12(2):337–365, 2000.
   J. Mairal, M. Elad, and G. Sapiro. Sparse representation for color image restoration.
      IEEE Transactions on Image Processing, 17(1):53–69, January 2008a.
   J. Mairal, G. Sapiro, and M. Elad. Learning multiscale sparse representations for
      image and video restoration. SIAM Multiscale Modelling and Simulation, 7(1):
      214–241, April 2008b.
   J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online dictionary learning for sparse
      coding. In Proceedings of the International Conference on Machine Learning
      (ICML), 2009a.
   J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization
      and sparse coding. ArXiv:0908.0050v1, 2009b. submitted.
   J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models
      for image restoration. In Proceedings of the IEEE International Conference on
      Computer Vision (ICCV), 2009c.
   S. Mallat. A Wavelet Tour of Signal Processing, Second Edition. Academic Press,
      New York, September 1999.



Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   42/43
References III
   B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A
      strategy employed by V1? Vision Research, 37:3311–3325, 1997.
   M. Protter and M. Elad. Image sequence denoising via sparse and redundant
     representations. IEEE Transactions on Image Processing, 18(1):27–36, 2009.
   S. Roth and M. J. Black. Fields of experts: A framework for learning image priors. In
      Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
      (CVPR), 2005.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   43/43

Contenu connexe

Tendances

Sparsity and Compressed Sensing
Sparsity and Compressed SensingSparsity and Compressed Sensing
Sparsity and Compressed SensingGabriel Peyré
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
Object Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based ModelsObject Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based Modelszukun
 
Signal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingSignal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingGabriel Peyré
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Object Recognition with Deformable Models
Object Recognition with Deformable ModelsObject Recognition with Deformable Models
Object Recognition with Deformable Modelszukun
 
Digital image processing - Image Enhancement (MATERIAL)
Digital image processing  - Image Enhancement (MATERIAL)Digital image processing  - Image Enhancement (MATERIAL)
Digital image processing - Image Enhancement (MATERIAL)Mathankumar S
 
Compressed Sensing and Tomography
Compressed Sensing and TomographyCompressed Sensing and Tomography
Compressed Sensing and TomographyGabriel Peyré
 
Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...zukun
 
Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restorationAshish Kumar
 
Introduction to digital image processing
Introduction to digital image processingIntroduction to digital image processing
Introduction to digital image processingHossain Md Shakhawat
 
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...IJECEIAES
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainMalik obeisat
 
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUERP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUEprj_publication
 

Tendances (18)

Sparsity and Compressed Sensing
Sparsity and Compressed SensingSparsity and Compressed Sensing
Sparsity and Compressed Sensing
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
Object Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based ModelsObject Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based Models
 
Signal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingSignal Processing Course : Compressed Sensing
Signal Processing Course : Compressed Sensing
 
It 603
It 603It 603
It 603
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Object Recognition with Deformable Models
Object Recognition with Deformable ModelsObject Recognition with Deformable Models
Object Recognition with Deformable Models
 
Digital image processing - Image Enhancement (MATERIAL)
Digital image processing  - Image Enhancement (MATERIAL)Digital image processing  - Image Enhancement (MATERIAL)
Digital image processing - Image Enhancement (MATERIAL)
 
ICPR 2012
ICPR 2012ICPR 2012
ICPR 2012
 
Compressed Sensing and Tomography
Compressed Sensing and TomographyCompressed Sensing and Tomography
Compressed Sensing and Tomography
 
Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...
 
Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restoration
 
In2414961500
In2414961500In2414961500
In2414961500
 
Introduction to digital image processing
Introduction to digital image processingIntroduction to digital image processing
Introduction to digital image processing
 
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domain
 
Gg2411291135
Gg2411291135Gg2411291135
Gg2411291135
 
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUERP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
 

Similaire à Tuto part2

Tuto cvpr part4
Tuto cvpr part4Tuto cvpr part4
Tuto cvpr part4Bo Li
 
01 graphical models
01 graphical models01 graphical models
01 graphical modelszukun
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slideszukun
 
Image Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelImage Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelIJERA Editor
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image RestorationMathankumar S
 
Gi2429352937
Gi2429352937Gi2429352937
Gi2429352937IJMER
 
Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherchesPierre Pudlo
 
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normRobust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normTuan Q. Pham
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Titlebutest
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Titlebutest
 
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Md. Shohel Rana
 
Talk icml
Talk icmlTalk icml
Talk icmlBo Li
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesisValentin De Bortoli
 
Ip lectures 1 and 2
Ip lectures 1 and 2 Ip lectures 1 and 2
Ip lectures 1 and 2 samarthgec
 
Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Frederic Perez
 

Similaire à Tuto part2 (19)

Tuto cvpr part4
Tuto cvpr part4Tuto cvpr part4
Tuto cvpr part4
 
01 graphical models
01 graphical models01 graphical models
01 graphical models
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
 
BMC 2012
BMC 2012BMC 2012
BMC 2012
 
Image Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelImage Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation Model
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image Restoration
 
Gi2429352937
Gi2429352937Gi2429352937
Gi2429352937
 
Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherches
 
Lec-3 DIP.pptx
Lec-3 DIP.pptxLec-3 DIP.pptx
Lec-3 DIP.pptx
 
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normRobust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
 
Talk icml
Talk icmlTalk icml
Talk icml
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesis
 
Lecture 11
Lecture 11Lecture 11
Lecture 11
 
Ip lectures 1 and 2
Ip lectures 1 and 2 Ip lectures 1 and 2
Ip lectures 1 and 2
 
Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...
 
DIP-Questions.pdf
DIP-Questions.pdfDIP-Questions.pdf
DIP-Questions.pdf
 

Dernier

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesShubhangi Sonawane
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 

Dernier (20)

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

Tuto part2

  • 1. Sparse Coding and Dictionary Learning for Image Analysis Part II: Dictionary Learning for signal reconstruction Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro ICCV’09 tutorial, Kyoto, 28th September 2009 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 1/43
  • 2. What this part is about The learning of compact representations of images adapted to restoration tasks. A fast online algorithm for learning dictionaries and factorizing matrices in general. Various formulations for image and video processing. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 2/43
  • 3. The Image Denoising Problem y = xorig + w measurements original image noise Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 3/43
  • 4. Sparse representations for image restoration y = xorig + w measurements original image noise Energy minimization problem - MAP estimation E (x) = ||y − x||2 2 + Pr (x) relation to measurements prior Some classical priors Smoothness λ||Lx||2 2 Total variation λ|| x||2 1 Wavelet sparsity λ||Wx||1 ... Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 4/43
  • 5. Sparse representations for image restoration Sparsity and redundancy Pr (x) = λ||α||0 for x ≈ Dα       α[1] α[2] x =  d1 d2 · · · dp   .  .   .  α[p] x∈Rm D∈Rm×p α∈Rp ,sparse Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 5/43
  • 6. Sparse representations for image restoration Designed dictionaries [Haar, 1910], [Zweig, Morlet, Grossman ∼70s], [Meyer, Mallat, Daubechies, Coifman, Donoho, Candes ∼80s-today]. . . (see [Mallat, 1999]) Wavelets, Curvelets, Wedgelets, Bandlets, . . . lets Learned dictionaries of patches [Olshausen and Field, 1997], [Engan et al., 1999], [Lewicki and Sejnowski, 2000], [Aharon et al., 2006] , [Roth and Black, 2005], [Lee et al., 2007] 1 min ||xi − Dαi ||2 + λψ(αi ) 2 αi ,D∈C 2 i sparsity reconstruction ψ(α) = ||α||0 (“ 0 pseudo-norm”) ψ(α) = ||α||1 ( 1 norm) Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 6/43
  • 7. Sparse representations for image restoration Solving the denoising problem [Elad and Aharon, 2006] Extract all overlapping 8 × 8 patches yi . Solve a matrix factorization problem: n 1 min ||yi − Dαi ||2 + λψ(αi ), 2 αi ,D∈C 2 i=1 sparsity reconstruction with n > 100, 000 Average the reconstruction of each patch. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 7/43
  • 8. Sparse representations for image restoration K-SVD: [Elad and Aharon, 2006] Figure: Dictionary trained on a noisy version of the image boat. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 8/43
  • 9. Sparse representations for image restoration Inpainting, Demosaicking 1 min ||β ⊗ (yi − Dαi )||2 + λi ψ(αi ) D∈C,α 2 i 2 i RAW Image Processing (see our poster) White balance. Denoising Black substraction. Conversion to sRGB. Demosaicking Gamma correction. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 9/43
  • 10. Sparse representations for image restoration [Mairal, Bach, Ponce, Sapiro, and Zisserman, 2009c] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 10/43
  • 11. Sparse representations for image restoration [Mairal, Sapiro, and Elad, 2008b] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 11/43
  • 12. Sparse representations for image restoration Inpainting, [Mairal, Elad, and Sapiro, 2008a] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 12/43
  • 13. Sparse representations for image restoration Inpainting, [Mairal, Elad, and Sapiro, 2008a] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 13/43
  • 14. Sparse representations for video restoration Key ideas for video processing [Protter and Elad, 2009] Using a 3D dictionary. Processing of many frames at the same time. Dictionary propagation. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 14/43
  • 15. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 15/43
  • 16. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 16/43
  • 17. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 17/43
  • 18. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 18/43
  • 19. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 19/43
  • 20. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 20/43
  • 21. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 21/43
  • 22. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 22/43
  • 23. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 23/43
  • 24. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 24/43
  • 25. Optimization for Dictionary Learning n 1 min ||xi − Dαi ||2 + λ||αi ||1 2 α∈R p×n 2 D∈C i=1 C = {D ∈ Rm×p s.t. ∀j = 1, . . . , p, ||dj ||2 ≤ 1}. Classical optimization alternates between D and α. Good results, but very slow! Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 25/43
  • 26. Optimization for Dictionary Learning [Mairal, Bach, Ponce, and Sapiro, 2009a] Classical formulation of dictionary learning n 1 min fn (D) = min l(xi , D), D∈C D∈C n i=1 where 1 l(x, D) = minp ||x − Dα||2 + λ||α||1 . 2 α∈R 2 Which formulation are we interested in? n 1 min f (D) = Ex [l(x, D)] ≈ lim l(xi , D) D∈C n→+∞ n i=1 [Bottou and Bousquet, 2008]: Online learning can handle potentially infinite or dynamic datasets, be dramatically faster than batch algorithms. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 26/43
  • 27. Optimization for Dictionary Learning Require: D0 ∈ Rm×p (initial dictionary); λ ∈ R 1: A0 = 0, B0 = 0. 2: for t=1,. . . ,T do 3: Draw xt 4: Sparse Coding 1 αt ← arg min ||xt − Dt−1 α||2 + λ||α||1 , 2 α∈Rp 2 5: Aggregate sufficient statistics At ← At−1 + αt αT , Bt ← Bt−1 + xt αT t t 6: Dictionary Update (block-coordinate descent) t 1 1 Dt ← arg min ||xi − Dαi ||2 + λ||αi ||1 . 2 D∈C t 2 i=1 7: end for Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 27/43
  • 28. Optimization for Dictionary Learning Which guarantees do we have? Under a few reasonable assumptions, ˆ we build a surrogate function ft of the expected cost f verifying ˆ lim ft (Dt ) − f (Dt ) = 0, t→+∞ Dt is asymptotically close to a stationary point. Extensions (all implemented in SPAMS) non-negative matrix decompositions. sparse PCA (sparse dictionaries). fused-lasso regularizations (piecewise constant dictionaries) Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 28/43
  • 29. Optimization for Dictionary Learning Experimental results, batch vs online m = 8 × 8, p = 256 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 29/43
  • 30. Optimization for Dictionary Learning Experimental results, batch vs online m = 12 × 12 × 3, p = 512 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 30/43
  • 31. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 31/43
  • 32. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 32/43
  • 33. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 33/43
  • 34. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 34/43
  • 35. Extension to NMF and sparse PCA [Mairal, Bach, Ponce, and Sapiro, 2009b] NMF extension n 1 min ||xi − Dαi ||2 s.t. αi ≥ 0, 2 D ≥ 0. α∈Rp×n 2 D∈C i=1 SPCA extension n 1 min ||xi − Dαi ||2 + λ||α1 ||1 2 α∈Rp×n 2 D∈C i=1 C = {D ∈ Rm×p s.t. ∀j ||dj ||2 + γ||dj ||1 ≤ 1}. 2 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 35/43
  • 36. Extension to NMF and sparse PCA Faces: Extended Yale Database B (a) PCA (b) NNMF (c) DL Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 36/43
  • 37. Extension to NMF and sparse PCA Faces: Extended Yale Database B (d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10% Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 37/43
  • 38. Extension to NMF and sparse PCA Natural Patches (a) PCA (b) NNMF (c) DL Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 38/43
  • 39. Extension to NMF and sparse PCA Natural Patches (d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10% Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 39/43
  • 40. Summary of this part The dictionary learning framework leads to state-of-the-art results for many image . . . . . . and video processing tasks. Online learning techniques are well-suited for this problem and allows training sets with millions of patches. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 40/43
  • 41. References I M. Aharon, M. Elad, and A. M. Bruckstein. The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations. IEEE Transactions on Signal Processing, 54(11):4311–4322, November 2006. L. Bottou and O. Bousquet. The trade-offs of large scale learning. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems, volume 20, pages 161–168. MIT Press, Cambridge, MA, 2008. M. Elad and M. Aharon. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image Processing, 54(12): 3736–3745, December 2006. K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using method of optimal directions (MOD). In Proceedings of the 1999 IEEE International Symposium on Circuits Systems, volume 4, 1999. A. Haar. Zur theorie der orthogonalen funktionensysteme. Mathematische Annalen, 69:331–371, 1910. H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In B. Sch¨lkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information o Processing Systems, volume 19, pages 801–808. MIT Press, Cambridge, MA, 2007. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 41/43
  • 42. References II M. S. Lewicki and T. J. Sejnowski. Learning overcomplete representations. Neural Computation, 12(2):337–365, 2000. J. Mairal, M. Elad, and G. Sapiro. Sparse representation for color image restoration. IEEE Transactions on Image Processing, 17(1):53–69, January 2008a. J. Mairal, G. Sapiro, and M. Elad. Learning multiscale sparse representations for image and video restoration. SIAM Multiscale Modelling and Simulation, 7(1): 214–241, April 2008b. J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online dictionary learning for sparse coding. In Proceedings of the International Conference on Machine Learning (ICML), 2009a. J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization and sparse coding. ArXiv:0908.0050v1, 2009b. submitted. J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models for image restoration. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009c. S. Mallat. A Wavelet Tour of Signal Processing, Second Edition. Academic Press, New York, September 1999. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 42/43
  • 43. References III B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A strategy employed by V1? Vision Research, 37:3311–3325, 1997. M. Protter and M. Elad. Image sequence denoising via sparse and redundant representations. IEEE Transactions on Image Processing, 18(1):27–36, 2009. S. Roth and M. J. Black. Fields of experts: A framework for learning image priors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2005. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 43/43