SlideShare a Scribd company logo
1 of 34
Sparse Coding  for Image and Video Understanding Jean Ponce http://www.di.ens.fr/willow/ Willow team, LIENS, UMR 8548 Ecolenormalesupérieure, Paris Joint work with JulienMairal, Francis Bach,  Guillermo Sapiro and Andrew Zisserman
What this is all about..                                                                      (Courtesy Ivan Laptev) Object class recognition 3D scene reconstruction Face recognition Action recognition (Furukawa & Ponce’07) (Sivic & Zisserman’03) (Laptev & Perez’07) Drinking
What this is all about..                                                                      (Courtesy Ivan Laptev) Object class recognition 3D scene reconstruction Face recognition Action recognition (Sivic & Zisserman’03) (Laptev & Perez’07) Drinking
Outline What this is all about A quick glance at Willow Sparse linear models Learning to classify image features Learning to detect edges On-line sparse matrix factorization Learning to restorean image
Willow tenet: ,[object Object]
 Representational issues must be addressed.Scientific challenges: ,[object Object]
Category-level object and scene recognition
Human activity capture and classification
Machine learningApplications: ,[object Object]
Quantitative image analysis in archaeology, 	anthropology, and cultural heritage preservation ,[object Object]
Others in an opportunistic manner,[object Object]
 Y. Boureau (INRIA)
 F. Couzinie-Devy (ENSC)
 O. Duchenne (ENS)
 L. Février (ENS)
 R. Jenatton (DGA)
 A. Joulin (Polytechnique)
 J. Mairal (INRIA)
 M. Sturzel (EADS)
 O. Whyte (ANR)Invited professors: ,[object Object]
 A. Efros (CMU/INRIA) Faculty: ,[object Object]
 J.-Y. Audibert (ENPC)
 F. Bach (INRIA)
 I. Laptev (INRIA)
 J. Ponce (ENS)
 J. Sivic (INRIA)
 A. Zisserman (Oxford/ENS - EADS)Post-docs: ,[object Object]
 J. van Gemert (DGA)
 Kong H. (ANR)
 N. Cherniavsky (MSR/INRIA)
 T. Cour (INRIA)
 G. Obozinski (ANR),[object Object]
Finding human actions in videos (O. Duchenne, I. Laptev, J. Sivic, F. Bach, J. Ponce, ICCV’09)
Sparse linear models Dictionary:  D=[d1,...,dp]2Rm x p Signal: x2Rm D may be overcomplete, i.e. p> m 	     x ≈ ®1d1 + ®2d2 + ... + ®pdp

More Related Content

What's hot

CG OpenGL line & area-course 3
CG OpenGL line & area-course 3CG OpenGL line & area-course 3
CG OpenGL line & area-course 3fungfung Chen
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Hirokatsu Kataoka
 
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...Debdoot Sheet
 
Cross-domain complementary learning with synthetic data for multi-person part...
Cross-domain complementary learning with synthetic data for multi-person part...Cross-domain complementary learning with synthetic data for multi-person part...
Cross-domain complementary learning with synthetic data for multi-person part...哲东 郑
 
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...IDES Editor
 
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...Debdoot Sheet
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsWassim Filali
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)Po-Jen Lai
 
Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Detection Tracking and Recognition of Human Poses for a Real Time Spatial GameDetection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Detection Tracking and Recognition of Human Poses for a Real Time Spatial GameWolfgang Hürst
 
Ultrasonic Histology
Ultrasonic HistologyUltrasonic Histology
Ultrasonic HistologyDebdoot Sheet
 

What's hot (12)

CG OpenGL line & area-course 3
CG OpenGL line & area-course 3CG OpenGL line & area-course 3
CG OpenGL line & area-course 3
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
 
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...
Detection of Retinal Vessels in Fundus Images through Transfer Learning of Ti...
 
Cross-domain complementary learning with synthetic data for multi-person part...
Cross-domain complementary learning with synthetic data for multi-person part...Cross-domain complementary learning with synthetic data for multi-person part...
Cross-domain complementary learning with synthetic data for multi-person part...
 
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...
Shift Invariant Ear Feature Extraction using Dual Tree Complex Wavelet Transf...
 
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...
Deep Learning of Tissue Specific Speckle Representations in Optical Coherence...
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth Sensors
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)
 
Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Detection Tracking and Recognition of Human Poses for a Real Time Spatial GameDetection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game
 
BMC 2012
BMC 2012BMC 2012
BMC 2012
 
Ultrasonic Histology
Ultrasonic HistologyUltrasonic Histology
Ultrasonic Histology
 

Similar to No Slide Title

Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Frank Nielsen
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Points
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest PointsCVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Points
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Pointszukun
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfhemangppatel
 
Particle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer VisionParticle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer Visionzukun
 
Dictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix FactorizationDictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix FactorizationArthur Mensch
 
Weeks 1 Introductions_V1_1.ppt
Weeks 1 Introductions_V1_1.pptWeeks 1 Introductions_V1_1.ppt
Weeks 1 Introductions_V1_1.pptssusera0a371
 
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...Tomoyuki Suzuki
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinChristian Robert
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesisValentin De Bortoli
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdfHODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1MostafaHazemMostafaa
 
Introduction
IntroductionIntroduction
Introductionbutest
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemiaemedu
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemIAEME Publication
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemiaemedu
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemiaemedu
 

Similar to No Slide Title (20)

Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Points
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest PointsCVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Points
CVPR2010: Advanced ITinCVPR in a Nutshell: part 2: Interest Points
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
Particle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer VisionParticle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer Vision
 
SIAM CSE 2017 talk
SIAM CSE 2017 talkSIAM CSE 2017 talk
SIAM CSE 2017 talk
 
Blurclassification
BlurclassificationBlurclassification
Blurclassification
 
Dictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix FactorizationDictionary Learning for Massive Matrix Factorization
Dictionary Learning for Massive Matrix Factorization
 
Weeks 1 Introductions_V1_1.ppt
Weeks 1 Introductions_V1_1.pptWeeks 1 Introductions_V1_1.ppt
Weeks 1 Introductions_V1_1.ppt
 
Lecture1.pptx
Lecture1.pptxLecture1.pptx
Lecture1.pptx
 
Machine learning
Machine learningMachine learning
Machine learning
 
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael Martin
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesis
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
Introduction
IntroductionIntroduction
Introduction
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

No Slide Title

  • 1. Sparse Coding for Image and Video Understanding Jean Ponce http://www.di.ens.fr/willow/ Willow team, LIENS, UMR 8548 Ecolenormalesupérieure, Paris Joint work with JulienMairal, Francis Bach, Guillermo Sapiro and Andrew Zisserman
  • 2. What this is all about.. (Courtesy Ivan Laptev) Object class recognition 3D scene reconstruction Face recognition Action recognition (Furukawa & Ponce’07) (Sivic & Zisserman’03) (Laptev & Perez’07) Drinking
  • 3. What this is all about.. (Courtesy Ivan Laptev) Object class recognition 3D scene reconstruction Face recognition Action recognition (Sivic & Zisserman’03) (Laptev & Perez’07) Drinking
  • 4. Outline What this is all about A quick glance at Willow Sparse linear models Learning to classify image features Learning to detect edges On-line sparse matrix factorization Learning to restorean image
  • 5.
  • 6.
  • 7. Category-level object and scene recognition
  • 8. Human activity capture and classification
  • 9.
  • 10.
  • 11.
  • 12. Y. Boureau (INRIA)
  • 17. A. Joulin (Polytechnique)
  • 18. J. Mairal (INRIA)
  • 19. M. Sturzel (EADS)
  • 20.
  • 21.
  • 23. F. Bach (INRIA)
  • 24. I. Laptev (INRIA)
  • 25. J. Ponce (ENS)
  • 26. J. Sivic (INRIA)
  • 27.
  • 28. J. van Gemert (DGA)
  • 29. Kong H. (ANR)
  • 30. N. Cherniavsky (MSR/INRIA)
  • 31. T. Cour (INRIA)
  • 32.
  • 33. Finding human actions in videos (O. Duchenne, I. Laptev, J. Sivic, F. Bach, J. Ponce, ICCV’09)
  • 34. Sparse linear models Dictionary: D=[d1,...,dp]2Rm x p Signal: x2Rm D may be overcomplete, i.e. p> m x ≈ ®1d1 + ®2d2 + ... + ®pdp
  • 35. Sparse linear models Dictionary: D=[d1,...,dp]2Rm x p Signal: x2Rm D is adapted to x when x admits a sparse decomposition on D, i.e., x ≈ j2J®jdjwhere |J| = |®|0is small
  • 36. Sparse linear models Dictionary: D=[d1,...,dp]2Rm x p Signal: x2Rm A priori dictionaries such as wavelets and learned dictionaries are adapted to sparse modeling of audio signals and natural images (see, e.g., [Donoho, Bruckstein, Elad, 2009]).
  • 37. Sparse coding and dictionary learning: A hierarchy of problems min®| x – D® |22 min®| x – D® |22 + ¸ |®|0 min®| x – D® |22 + ¸Ã(®) minDєC,®1,..., ®n1≤i≤n [ 1/2 | xi – D®i |22 + ¸Ã(®i) ] minDєC,®1,..., ®n1≤i≤n [ f (xi, D, ®i) + ¸Ã(®i) ] minDєC,®1,..., ®n1≤i≤n [ f (xi, D, ®i) + ¸1≤k≤q Ã(dk) ] Least squares Sparse coding Dictionary learning Learning for a task Learning structures
  • 38. Discriminative dictionaries for local image analysis (Mairal, Bach, Ponce, Sapiro, Zisserman, CVPR’08) *(x,D) = Argmin | x - D |22 s.t. ||0 ≤ L R*(x,D) = | x – D*|22 Reconstruction (MOD: Engan, Aase, Husoy’99; K-SVD: Aharon, Elad, Bruckstein’06): min l R*(xl,D) Discrimination: min i,l Ci [R*(xl,D1),…,R*(xl,Dn)] +  R*(xl,Di) (Both MOD and K-SVD version with truncated Newton iterations.) Orthogonal matching pursuit (Mallat & Zhang’93, Tropp’04) D D1,…,Dn
  • 40. Pixel-level classification results Qualitative results, Graz 02 data Quantitative results Comparaison with Pantofaru et al. (2006) and Tuytelaars & Schmid (2007).
  • 41. L1 local sparse image representations (Mairal, Leordeanu, Bach, Hebert, Ponce, ECCV’08) *(x,D) = Argmin | x - D |22s.t. ||1 ≤ L R*(x,D) = | x – D*|22 Reconstruction (Lee, Battle, Rajat, Ng’07): min l R*(xl,D) Discrimination: min i,lCi [R*(xl,D1),…,R*(xl,Dn)] +  R*(xl,Di) (Partial dictionary update with Newtown iterations on the dual problem; partial fast sparse coding with projected gradient descent.) Lasso: Convex optimization (LARS: Efron et al.’04) D D1,…,Dn
  • 42. Edge detection results Quantitative results on the Berkeley segmentation dataset and benchmark (Martin et al., ICCV’01)
  • 43. Pascal 07 data L’07 Us + L’07 Comparaison with Leordeanu et al. (2007) on Pascal’07 benchmark. Mean error rate reduction: 33%. Input edges Bike edges Bottle edges People edges
  • 44.
  • 45. Online sparse matrix factorization (Mairal, Bach, Ponce, Sapiro, ICML’09) Problem: min DєC,®1,..., ®n1≤i≤n [ 1/2 | x – D®i |22 + ¸ |®i|1 ] min DєC, A1≤i≤n [ 1/2 | X – DA |F2 + ¸ |A|1 ] Algorithm: Iteratively draw one random training sample xt and minimize the quadratic surrogate function: gt ( D ) = 1/t 1≤i≤t[ 1/2 | x – D®i |22 + ¸ |®i|1 ] (Lars/Lasso for sparse coding, block-coordinate descent with warm restarts for dictionary updates, mini-batch extensions, etc.)
  • 46.
  • 47. Non negative sparse coding (Hoyer’02)
  • 48. Sparse principal component analysis (Jolliffe et al.’03; Zou et al.’06; Zass& Shashua’07; d’Aspremont et al.’08; Witten et al.’09)
  • 49.
  • 50. B: 12£16£3 color patches, 512 atoms.
  • 51.
  • 52. Batch version on different subsets of training data.Online vsbatch Online vsstochastic gradient descent
  • 53.
  • 54. E: 2414 192£168 images from extended Yale B.
  • 55.
  • 56. Hoyer’s Matlab implementation of NNSC (Hoyer’02).
  • 57. Our C++/Matlab implementation of SPCA (elastic net on D).SPCA vsNNMF SPCA vsNNSC
  • 58. Faces
  • 59. Inpainting a 12MP image with a dictionary learned from 7x106 patches (Mairal et al., 2009)
  • 60. State of the art in image denoising Non-local means filtering (Buades et al.’05) Dictionary learning for denoising (Elad & Aharon’06; Mairal, Elad & Sapiro’08) min DєC,®1,..., ®n1≤i≤n [ 1/2 | yi – D®i |22 + ¸ |®i|1 ] x = 1/n 1≤i≤n RiD®i
  • 61. State of the art in image denoising BM3D (Dabov et al.’07) Non-local means filtering (Buades et al.’05) Dictionary learning for denoising (Elad & Aharon’06; Mairal, Elad & Sapiro’08) min DєC,®1,..., ®n1≤i≤n [ 1/2 | yi – D®i |22 + ¸ |®i|1 ] x = 1/n 1≤i≤n RiD®i
  • 62. Non-local SparseModels for Image Restoration (Mairal, Bach, Ponce, Sapiro, Zisserman, ICCV’09) Sparsityvs Joint sparsity min  [1/2 | yj – D®ij |F2] + ¸ |Ai|p,q i j2Si D2 C A1,...,An |A|p,q= 1≤i≤k |®i|qp (p,q) = (1,2) or (0,1)
  • 63.
  • 64. PSNR comparison between our method (LSSC) and Portilla et al.’03 [23]; Roth & Black’05 [25]; Elad& Aharon’06 [12]; and Dabov et al.’07 [8].
  • 65. Demosaicking experiments LSSC LSC Bayer pattern ……………………………………………...…………… PSNR comparison between our method (LSSC) and Gunturk et al.’02 [AP]; Zhang & Wu’05 [DL]; and Paliy et al.’07 [LPA] on the Kodak PhotoCD data.
  • 66. Real noise (Canon Powershot G9, 1600 ISO) Raw camera jpeg output Adobe Photoshop DxO Optics Pro LSSC
  • 67.