SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
Social-sparsity brain decoders:
faster spatial sparsity
G. Varoquaux, M. Kowalski, B. Thirion
Brain decoding with linear models
Design
matrix
× Coefficients =
Coefficients are
brain maps
Target
G Varoquaux 2
Brain decoding with linear models
Design
matrix
× Coefficients =
Coefficients are
brain maps
Minimize the error:
l(y − Xw)
Target
G Varoquaux 2
Brain decoder maps and prediction accuracy
Face vs house visual recognition [Haxby... 2001]
SVM
error: 26%
G Varoquaux 3
Brain decoder maps and prediction accuracy
Face vs house visual recognition [Haxby... 2001]
Ridge
error: 15%
G Varoquaux 3
Brain decoder maps and prediction accuracy
Face vs house visual recognition [Haxby... 2001]
Sparse model
error: 19%
Which decoder predicts best?
How to get good decoder maps?
G Varoquaux 3
Sparse models
Ill-posed inverse problem
minw
l(y − Xw) + λ w 1
A priori:
A small fraction of the voxels are predictive
Sparse models to select relevant regions?
[Yamashita... 2008, Carroll... 2009]
G Varoquaux 4
Sparse models
Ill-posed inverse problem ⇒ regularization
minw
l(y − Xw) + λ w 1
sparsity inducing norm 1 norm,
or elastic net
Elastic Net
G Varoquaux 4
Sparse models
Ill-posed inverse problem ⇒ regularization
minw
l(y − Xw) + λ w 1
sparsity inducing norm 1 norm,
or elastic net
Elastic Net Can only select a
subset of relevant
voxels.
[Varoquaux... 2012]
G Varoquaux 4
Spatial sparse penalties
Spatial regularization, total variation
minw
l(y − Xw) + λ
i
( w)i 2
12 norm: 1 norm of the
gradient magnitudePenalize the image gradient:
Shrinks jointly x , y , and z
Elastic Net TV + 1
[Gramfort... 2013]
G Varoquaux 5
Spatial sparse penalties
Spatial regularization, total variation
minw
l(y − Xw) + λ
i
( w)i 2
More generally: analysis sparsity [Eickenberg... 2015]
Sparse in a transformation of the weights:
minw
l(y − Xw) + λ K w 21
For instance: overlapping blocks
(Kw)1 ↔ G1
(Kw)2 ↔ G2
... G2
G1
G Varoquaux 6
Good convergence of solvers is important
Spatial regularization, total variation
minw
l(y − Xw) + λ
i
( w)i 2
x=17
L R
z=-17
Stopping: ∆E < 10−1
x=17
L R
z=-17
Stopping: ∆E < 10−5
[Dohmatob... 2014]
G Varoquaux 7
Sparse solvers
Iterative Shrinkage-Thresholding Algorithm
minw
l(y − Xw) + λ
i
Kw 1
Settings: min l + p; l smooth, p non-smooth
Minimize successively: (quadratic approx of l) + p
1. Gradient descent on smooth term
FISTA loop
2. Proximal operator
proxpx = miny
1
2 x − y
2
2
+ p(y)
G Varoquaux 8
Sparse solvers
Iterative Shrinkage-Thresholding Algorithm
minw
l(y − Xw) + λ
i
w 1
Settings: min l + p; l smooth, p non-smooth
Minimize successively: (quadratic approx of l) + p
1. Gradient descent on smooth term
FISTA loop
2. Proximal operator
proxpx = miny
1
2 x − y
2
2
+ p(y)
1 penalty: “soft thresholding”:
prox 1
: ∀i wi ← wi

1 −
λ
|wi|


+
G Varoquaux 8
Sparse solvers: proximals and co.
1 penalty: “soft thresholding”:
prox 1
: ∀i wi ← wi

1 −
λ
|wi|


+
Group sparsity:
prox 21
on G:
∀i ∈ G wi ← wi

1−
λ
j∈G w2
j


+
G2
G1
G Varoquaux 9
Sparse solvers: proximals and co.
1 penalty: “soft thresholding”:
prox 1
: ∀i wi ← wi

1 −
λ
|wi|


+
Group sparsity:
prox 21
on G:
∀i ∈ G wi ← wi

1−
λ
j∈G w2
j


+
G2
G1
Overlapping groups, TV:
Inner loop iterative solver
G2
G1
G Varoquaux 9
Sparse solvers: proximals and co.
Group sparsity:
prox 21
on G:
∀i ∈ G wi ← wi

1−
λ
j∈G w2
j


+
G2
G1
Overlapping groups, TV:
Inner loop iterative solver
G2
G1
Social sparsity shrinkage:
∀i wi ← wi

1−
λ
j∈N(i) w2
j


+
N1
x1
N2
x2
G Varoquaux 9
Social sparsity: “soft-threshold” neighboring voxels
Sparsity must be combined with spatial structure
Convex solvers for non-local sparsity are expensive
Not separable
Social sparsity:
forget the coupling between
soft thresholding
[Kowalski... 2013]
N1
x1
N2
x2
G Varoquaux 10
Empirical evaluation for decoding
­25% ­10%  0% +10%
TV­l1
graph
­net
social
sparsity
SVM
 + anova
Prediction accuracy 1
20x 1
5x 1
2x 1x 2x 5x
Run time
bottle/scramble
bottle/shoe
cat/bottle
cat/chair
cat/face
cat/house
cat/scramble
cat/shoe
chair/scramble
chair/shoe
face/house
face/scissors
scissors/scramble
shoe/scramble
OASIS VBM
 male vs femaleG Varoquaux 11
Empirical evaluation for decoding
­25% ­10%  0% +10%
TV­l1
graph
­net
social
sparsity
SVM
 + anova
Prediction accuracy 1
20x 1
5x 1
2x 1x 2x 5x
Run time
bottle/scramble
bottle/shoe
cat/bottle
cat/chair
cat/face
cat/house
cat/scramble
cat/shoe
chair/scramble
chair/shoe
face/house
face/scissors
scissors/scramble
shoe/scramble
OASIS VBM
 male vs femaleG Varoquaux 11
Social sparsity maps
L R
z=­16
y=­34
face vs house
TV- 1 Graph-net Social sparsity
G Varoquaux 12
Social sparsity maps
L R
z=­16
y=­34
face vs house
TV- 1 Graph-net Social sparsity
G Varoquaux 12
@GaelVaroquaux
Social-sparsity brain decoders: faster spatial sparsity
Spatial sparsity improves prediction
and denoises maps
TV- 1 “space-net” very successful, but slow
Social-sparsity: heuristic that forgoes couplings
10× faster than TV- 1 almost as accurate
3× faster than graph-net more accurate
Maps segment well regions
ni
References I
M. K. Carroll, G. A. Cecchi, I. Rish, R. Garg, and A. R. Rao.
Prediction and interpretation of distributed neural activity with
sparse models. NeuroImage, 44(1):112 – 122, 2009.
E. Dohmatob, A. Gramfort, B. Thirion, and G. Varoquaux.
Benchmarking solvers for TV-l1 least-squares and logistic
regression in brain imaging. PRNI, 2014.
M. Eickenberg, E. Dohmatob, B. Thirion, and G. Varoquaux.
Total variation meets sparsity: statistical learning with
segmenting penalties. MICCAI, 2015.
A. Gramfort, B. Thirion, and G. Varoquaux. Identifying predictive
regions from fMRI with TV-L1 prior. In PRNI, pages 17–20,
2013.
J. Haxby, I. Gobbini, M. Furey, ... Distributed and overlapping
representations of faces and objects in ventral temporal cortex.
Science, 293:2425, 2001.
References II
M. Kowalski, K. Siedenburg, and M. Dorfler. Social sparsity!
neighborhood systems enrich structured shrinkage operators.
Transactions on Signal Processing, 61:2498, 2013.
G. Varoquaux, A. Gramfort, and B. Thirion. Small-sample brain
mapping: sparse recovery on spatially correlated designs with
randomization and clustering. In ICML, page 1375, 2012.
O. Yamashita, M. aki Sato, T. Yoshioka, F. Tong, and
Y. Kamitani. Sparse estimation automatically selects voxels
relevant for the decoding of fMRI activity patterns. NeuroImage,
42(4):1414 – 1429, 2008.

Contenu connexe

Tendances

Dirty data science machine learning on non-curated data
Dirty data science machine learning on non-curated dataDirty data science machine learning on non-curated data
Dirty data science machine learning on non-curated dataGael Varoquaux
 
Similarity encoding for learning on dirty categorical variables
Similarity encoding for learning on dirty categorical variablesSimilarity encoding for learning on dirty categorical variables
Similarity encoding for learning on dirty categorical variablesGael Varoquaux
 
Machine learning and cognitive neuroimaging: new tools can answer new questions
Machine learning and cognitive neuroimaging: new tools can answer new questionsMachine learning and cognitive neuroimaging: new tools can answer new questions
Machine learning and cognitive neuroimaging: new tools can answer new questionsGael Varoquaux
 
Bayesian Deep Learning
Bayesian Deep LearningBayesian Deep Learning
Bayesian Deep LearningRayKim51
 
Scikit-learn for easy machine learning: the vision, the tool, and the project
Scikit-learn for easy machine learning: the vision, the tool, and the projectScikit-learn for easy machine learning: the vision, the tool, and the project
Scikit-learn for easy machine learning: the vision, the tool, and the projectGael Varoquaux
 
Scikit-learn and nilearn: Democratisation of machine learning for brain imaging
Scikit-learn and nilearn: Democratisation of machine learning for brain imagingScikit-learn and nilearn: Democratisation of machine learning for brain imaging
Scikit-learn and nilearn: Democratisation of machine learning for brain imagingGael Varoquaux
 
Data-driven Hypothesis Management
Data-driven Hypothesis ManagementData-driven Hypothesis Management
Data-driven Hypothesis Managementbgoncalves2
 
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...David Gleich
 
NeuralArt 電腦作畫
NeuralArt 電腦作畫NeuralArt 電腦作畫
NeuralArt 電腦作畫Mark Chang
 
Multi-scalar multiplication: state of the art and new ideas
Multi-scalar multiplication: state of the art and new ideasMulti-scalar multiplication: state of the art and new ideas
Multi-scalar multiplication: state of the art and new ideasGus Gutoski
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)Mark Chang
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsDavid Gleich
 
2018.01.12 AHClab SD-study paper reading
2018.01.12 AHClab SD-study paper reading2018.01.12 AHClab SD-study paper reading
2018.01.12 AHClab SD-study paper readingShinagawa Seitaro
 
Large Scale Recommendation: a view from the Trenches
Large Scale Recommendation: a view from the TrenchesLarge Scale Recommendation: a view from the Trenches
Large Scale Recommendation: a view from the TrenchesAnne-Marie Tousch
 
PAC-Bayesian Bound for Deep Learning
PAC-Bayesian Bound for Deep LearningPAC-Bayesian Bound for Deep Learning
PAC-Bayesian Bound for Deep LearningMark Chang
 
KDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solutionKDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solution志明 陳
 
How to calculate back propagation
How to calculate back propagationHow to calculate back propagation
How to calculate back propagationShinagawa Seitaro
 

Tendances (20)

Dirty data science machine learning on non-curated data
Dirty data science machine learning on non-curated dataDirty data science machine learning on non-curated data
Dirty data science machine learning on non-curated data
 
Similarity encoding for learning on dirty categorical variables
Similarity encoding for learning on dirty categorical variablesSimilarity encoding for learning on dirty categorical variables
Similarity encoding for learning on dirty categorical variables
 
Machine learning and cognitive neuroimaging: new tools can answer new questions
Machine learning and cognitive neuroimaging: new tools can answer new questionsMachine learning and cognitive neuroimaging: new tools can answer new questions
Machine learning and cognitive neuroimaging: new tools can answer new questions
 
Bayesian Deep Learning
Bayesian Deep LearningBayesian Deep Learning
Bayesian Deep Learning
 
Scikit-learn for easy machine learning: the vision, the tool, and the project
Scikit-learn for easy machine learning: the vision, the tool, and the projectScikit-learn for easy machine learning: the vision, the tool, and the project
Scikit-learn for easy machine learning: the vision, the tool, and the project
 
Scikit-learn and nilearn: Democratisation of machine learning for brain imaging
Scikit-learn and nilearn: Democratisation of machine learning for brain imagingScikit-learn and nilearn: Democratisation of machine learning for brain imaging
Scikit-learn and nilearn: Democratisation of machine learning for brain imaging
 
Data-driven Hypothesis Management
Data-driven Hypothesis ManagementData-driven Hypothesis Management
Data-driven Hypothesis Management
 
Backpropagation for Deep Learning
Backpropagation for Deep LearningBackpropagation for Deep Learning
Backpropagation for Deep Learning
 
Backpropagation for Neural Networks
Backpropagation for Neural NetworksBackpropagation for Neural Networks
Backpropagation for Neural Networks
 
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
 
NeuralArt 電腦作畫
NeuralArt 電腦作畫NeuralArt 電腦作畫
NeuralArt 電腦作畫
 
Multi-scalar multiplication: state of the art and new ideas
Multi-scalar multiplication: state of the art and new ideasMulti-scalar multiplication: state of the art and new ideas
Multi-scalar multiplication: state of the art and new ideas
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)
 
Big data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphsBig data matrix factorizations and Overlapping community detection in graphs
Big data matrix factorizations and Overlapping community detection in graphs
 
2018.01.12 AHClab SD-study paper reading
2018.01.12 AHClab SD-study paper reading2018.01.12 AHClab SD-study paper reading
2018.01.12 AHClab SD-study paper reading
 
DTLC-GAN
DTLC-GANDTLC-GAN
DTLC-GAN
 
Large Scale Recommendation: a view from the Trenches
Large Scale Recommendation: a view from the TrenchesLarge Scale Recommendation: a view from the Trenches
Large Scale Recommendation: a view from the Trenches
 
PAC-Bayesian Bound for Deep Learning
PAC-Bayesian Bound for Deep LearningPAC-Bayesian Bound for Deep Learning
PAC-Bayesian Bound for Deep Learning
 
KDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solutionKDD CUP 2015 - 9th solution
KDD CUP 2015 - 9th solution
 
How to calculate back propagation
How to calculate back propagationHow to calculate back propagation
How to calculate back propagation
 

Similaire à Faster spatial sparsity brain decoders using social sparsity heuristic

MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsElvis DOHMATOB
 
More on randomization semi-definite programming and derandomization
More on randomization semi-definite programming and derandomizationMore on randomization semi-definite programming and derandomization
More on randomization semi-definite programming and derandomizationAbner Chih Yi Huang
 
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18Olga Zinkevych
 
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.A non-stiff numerical method for 3D interfacial flow of inviscid fluids.
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.Alex (Oleksiy) Varfolomiyev
 
Algorithmic Data Science = Theory + Practice
Algorithmic Data Science = Theory + PracticeAlgorithmic Data Science = Theory + Practice
Algorithmic Data Science = Theory + PracticeTwo Sigma
 
MinFill_Presentation
MinFill_PresentationMinFill_Presentation
MinFill_PresentationAnna Lasota
 
Real-Time Data Mining for Event Streams
Real-Time Data Mining for Event StreamsReal-Time Data Mining for Event Streams
Real-Time Data Mining for Event StreamsSylvain Hallé
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeMagdi Mohamed
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practiceguest3550292
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function홍배 김
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large ImagesChiamin Hsu
 
Node Unique Label Cover
Node Unique Label CoverNode Unique Label Cover
Node Unique Label Covermsramanujan
 
Fast relaxation methods for the matrix exponential
Fast relaxation methods for the matrix exponential Fast relaxation methods for the matrix exponential
Fast relaxation methods for the matrix exponential David Gleich
 

Similaire à Faster spatial sparsity brain decoders using social sparsity heuristic (20)

MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priors
 
More on randomization semi-definite programming and derandomization
More on randomization semi-definite programming and derandomizationMore on randomization semi-definite programming and derandomization
More on randomization semi-definite programming and derandomization
 
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
 
cswiercz-general-presentation
cswiercz-general-presentationcswiercz-general-presentation
cswiercz-general-presentation
 
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18
Variational autoencoders for speech processing d.bielievtsov dataconf 21 04 18
 
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.A non-stiff numerical method for 3D interfacial flow of inviscid fluids.
A non-stiff numerical method for 3D interfacial flow of inviscid fluids.
 
Algorithmic Data Science = Theory + Practice
Algorithmic Data Science = Theory + PracticeAlgorithmic Data Science = Theory + Practice
Algorithmic Data Science = Theory + Practice
 
MinFill_Presentation
MinFill_PresentationMinFill_Presentation
MinFill_Presentation
 
1
11
1
 
Real-Time Data Mining for Event Streams
Real-Time Data Mining for Event StreamsReal-Time Data Mining for Event Streams
Real-Time Data Mining for Event Streams
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and Practice
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practice
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function
 
Triggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphsTriggering patterns of topology changes in dynamic attributed graphs
Triggering patterns of topology changes in dynamic attributed graphs
 
QMC: Transition Workshop - Density Estimation by Randomized Quasi-Monte Carlo...
QMC: Transition Workshop - Density Estimation by Randomized Quasi-Monte Carlo...QMC: Transition Workshop - Density Estimation by Randomized Quasi-Monte Carlo...
QMC: Transition Workshop - Density Estimation by Randomized Quasi-Monte Carlo...
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
Week6.ppt
Week6.pptWeek6.ppt
Week6.ppt
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Node Unique Label Cover
Node Unique Label CoverNode Unique Label Cover
Node Unique Label Cover
 
Fast relaxation methods for the matrix exponential
Fast relaxation methods for the matrix exponential Fast relaxation methods for the matrix exponential
Fast relaxation methods for the matrix exponential
 

Plus de Gael Varoquaux

Evaluating machine learning models and their diagnostic value
Evaluating machine learning models and their diagnostic valueEvaluating machine learning models and their diagnostic value
Evaluating machine learning models and their diagnostic valueGael Varoquaux
 
Measuring mental health with machine learning and brain imaging
Measuring mental health with machine learning and brain imagingMeasuring mental health with machine learning and brain imaging
Measuring mental health with machine learning and brain imagingGael Varoquaux
 
Machine learning with missing values
Machine learning with missing valuesMachine learning with missing values
Machine learning with missing valuesGael Varoquaux
 
Representation learning in limited-data settings
Representation learning in limited-data settingsRepresentation learning in limited-data settings
Representation learning in limited-data settingsGael Varoquaux
 
Better neuroimaging data processing: driven by evidence, open communities, an...
Better neuroimaging data processing: driven by evidence, open communities, an...Better neuroimaging data processing: driven by evidence, open communities, an...
Better neuroimaging data processing: driven by evidence, open communities, an...Gael Varoquaux
 
Functional-connectome biomarkers to meet clinical needs?
Functional-connectome biomarkers to meet clinical needs?Functional-connectome biomarkers to meet clinical needs?
Functional-connectome biomarkers to meet clinical needs?Gael Varoquaux
 
Atlases of cognition with large-scale human brain mapping
Atlases of cognition with large-scale human brain mappingAtlases of cognition with large-scale human brain mapping
Atlases of cognition with large-scale human brain mappingGael Varoquaux
 
Towards psychoinformatics with machine learning and brain imaging
Towards psychoinformatics with machine learning and brain imagingTowards psychoinformatics with machine learning and brain imaging
Towards psychoinformatics with machine learning and brain imagingGael Varoquaux
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Gael Varoquaux
 
A tutorial on Machine Learning, with illustrations for MR imaging
A tutorial on Machine Learning, with illustrations for MR imagingA tutorial on Machine Learning, with illustrations for MR imaging
A tutorial on Machine Learning, with illustrations for MR imagingGael Varoquaux
 
Computational practices for reproducible science
Computational practices for reproducible scienceComputational practices for reproducible science
Computational practices for reproducible scienceGael Varoquaux
 
Coding for science and innovation
Coding for science and innovationCoding for science and innovation
Coding for science and innovationGael Varoquaux
 
On the code of data science
On the code of data scienceOn the code of data science
On the code of data scienceGael Varoquaux
 
Scientist meets web dev: how Python became the language of data
Scientist meets web dev: how Python became the language of dataScientist meets web dev: how Python became the language of data
Scientist meets web dev: how Python became the language of dataGael Varoquaux
 
Scikit-learn: the state of the union 2016
Scikit-learn: the state of the union 2016Scikit-learn: the state of the union 2016
Scikit-learn: the state of the union 2016Gael Varoquaux
 
Succeeding in academia despite doing good_software
Succeeding in academia despite doing good_softwareSucceeding in academia despite doing good_software
Succeeding in academia despite doing good_softwareGael Varoquaux
 
Building a cutting-edge data processing environment on a budget
Building a cutting-edge data processing environment on a budgetBuilding a cutting-edge data processing environment on a budget
Building a cutting-edge data processing environment on a budgetGael Varoquaux
 
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...Gael Varoquaux
 
Open Source Scientific Software
Open Source Scientific SoftwareOpen Source Scientific Software
Open Source Scientific SoftwareGael Varoquaux
 
Scikit learn: apprentissage statistique en Python
Scikit learn: apprentissage statistique en PythonScikit learn: apprentissage statistique en Python
Scikit learn: apprentissage statistique en PythonGael Varoquaux
 

Plus de Gael Varoquaux (20)

Evaluating machine learning models and their diagnostic value
Evaluating machine learning models and their diagnostic valueEvaluating machine learning models and their diagnostic value
Evaluating machine learning models and their diagnostic value
 
Measuring mental health with machine learning and brain imaging
Measuring mental health with machine learning and brain imagingMeasuring mental health with machine learning and brain imaging
Measuring mental health with machine learning and brain imaging
 
Machine learning with missing values
Machine learning with missing valuesMachine learning with missing values
Machine learning with missing values
 
Representation learning in limited-data settings
Representation learning in limited-data settingsRepresentation learning in limited-data settings
Representation learning in limited-data settings
 
Better neuroimaging data processing: driven by evidence, open communities, an...
Better neuroimaging data processing: driven by evidence, open communities, an...Better neuroimaging data processing: driven by evidence, open communities, an...
Better neuroimaging data processing: driven by evidence, open communities, an...
 
Functional-connectome biomarkers to meet clinical needs?
Functional-connectome biomarkers to meet clinical needs?Functional-connectome biomarkers to meet clinical needs?
Functional-connectome biomarkers to meet clinical needs?
 
Atlases of cognition with large-scale human brain mapping
Atlases of cognition with large-scale human brain mappingAtlases of cognition with large-scale human brain mapping
Atlases of cognition with large-scale human brain mapping
 
Towards psychoinformatics with machine learning and brain imaging
Towards psychoinformatics with machine learning and brain imagingTowards psychoinformatics with machine learning and brain imaging
Towards psychoinformatics with machine learning and brain imaging
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities
 
A tutorial on Machine Learning, with illustrations for MR imaging
A tutorial on Machine Learning, with illustrations for MR imagingA tutorial on Machine Learning, with illustrations for MR imaging
A tutorial on Machine Learning, with illustrations for MR imaging
 
Computational practices for reproducible science
Computational practices for reproducible scienceComputational practices for reproducible science
Computational practices for reproducible science
 
Coding for science and innovation
Coding for science and innovationCoding for science and innovation
Coding for science and innovation
 
On the code of data science
On the code of data scienceOn the code of data science
On the code of data science
 
Scientist meets web dev: how Python became the language of data
Scientist meets web dev: how Python became the language of dataScientist meets web dev: how Python became the language of data
Scientist meets web dev: how Python became the language of data
 
Scikit-learn: the state of the union 2016
Scikit-learn: the state of the union 2016Scikit-learn: the state of the union 2016
Scikit-learn: the state of the union 2016
 
Succeeding in academia despite doing good_software
Succeeding in academia despite doing good_softwareSucceeding in academia despite doing good_software
Succeeding in academia despite doing good_software
 
Building a cutting-edge data processing environment on a budget
Building a cutting-edge data processing environment on a budgetBuilding a cutting-edge data processing environment on a budget
Building a cutting-edge data processing environment on a budget
 
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...
Scikit-learn: apprentissage statistique en Python. Créer des machines intelli...
 
Open Source Scientific Software
Open Source Scientific SoftwareOpen Source Scientific Software
Open Source Scientific Software
 
Scikit learn: apprentissage statistique en Python
Scikit learn: apprentissage statistique en PythonScikit learn: apprentissage statistique en Python
Scikit learn: apprentissage statistique en Python
 

Dernier

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Dernier (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Faster spatial sparsity brain decoders using social sparsity heuristic

  • 1. Social-sparsity brain decoders: faster spatial sparsity G. Varoquaux, M. Kowalski, B. Thirion
  • 2. Brain decoding with linear models Design matrix × Coefficients = Coefficients are brain maps Target G Varoquaux 2
  • 3. Brain decoding with linear models Design matrix × Coefficients = Coefficients are brain maps Minimize the error: l(y − Xw) Target G Varoquaux 2
  • 4. Brain decoder maps and prediction accuracy Face vs house visual recognition [Haxby... 2001] SVM error: 26% G Varoquaux 3
  • 5. Brain decoder maps and prediction accuracy Face vs house visual recognition [Haxby... 2001] Ridge error: 15% G Varoquaux 3
  • 6. Brain decoder maps and prediction accuracy Face vs house visual recognition [Haxby... 2001] Sparse model error: 19% Which decoder predicts best? How to get good decoder maps? G Varoquaux 3
  • 7. Sparse models Ill-posed inverse problem minw l(y − Xw) + λ w 1 A priori: A small fraction of the voxels are predictive Sparse models to select relevant regions? [Yamashita... 2008, Carroll... 2009] G Varoquaux 4
  • 8. Sparse models Ill-posed inverse problem ⇒ regularization minw l(y − Xw) + λ w 1 sparsity inducing norm 1 norm, or elastic net Elastic Net G Varoquaux 4
  • 9. Sparse models Ill-posed inverse problem ⇒ regularization minw l(y − Xw) + λ w 1 sparsity inducing norm 1 norm, or elastic net Elastic Net Can only select a subset of relevant voxels. [Varoquaux... 2012] G Varoquaux 4
  • 10. Spatial sparse penalties Spatial regularization, total variation minw l(y − Xw) + λ i ( w)i 2 12 norm: 1 norm of the gradient magnitudePenalize the image gradient: Shrinks jointly x , y , and z Elastic Net TV + 1 [Gramfort... 2013] G Varoquaux 5
  • 11. Spatial sparse penalties Spatial regularization, total variation minw l(y − Xw) + λ i ( w)i 2 More generally: analysis sparsity [Eickenberg... 2015] Sparse in a transformation of the weights: minw l(y − Xw) + λ K w 21 For instance: overlapping blocks (Kw)1 ↔ G1 (Kw)2 ↔ G2 ... G2 G1 G Varoquaux 6
  • 12. Good convergence of solvers is important Spatial regularization, total variation minw l(y − Xw) + λ i ( w)i 2 x=17 L R z=-17 Stopping: ∆E < 10−1 x=17 L R z=-17 Stopping: ∆E < 10−5 [Dohmatob... 2014] G Varoquaux 7
  • 13. Sparse solvers Iterative Shrinkage-Thresholding Algorithm minw l(y − Xw) + λ i Kw 1 Settings: min l + p; l smooth, p non-smooth Minimize successively: (quadratic approx of l) + p 1. Gradient descent on smooth term FISTA loop 2. Proximal operator proxpx = miny 1 2 x − y 2 2 + p(y) G Varoquaux 8
  • 14. Sparse solvers Iterative Shrinkage-Thresholding Algorithm minw l(y − Xw) + λ i w 1 Settings: min l + p; l smooth, p non-smooth Minimize successively: (quadratic approx of l) + p 1. Gradient descent on smooth term FISTA loop 2. Proximal operator proxpx = miny 1 2 x − y 2 2 + p(y) 1 penalty: “soft thresholding”: prox 1 : ∀i wi ← wi  1 − λ |wi|   + G Varoquaux 8
  • 15. Sparse solvers: proximals and co. 1 penalty: “soft thresholding”: prox 1 : ∀i wi ← wi  1 − λ |wi|   + Group sparsity: prox 21 on G: ∀i ∈ G wi ← wi  1− λ j∈G w2 j   + G2 G1 G Varoquaux 9
  • 16. Sparse solvers: proximals and co. 1 penalty: “soft thresholding”: prox 1 : ∀i wi ← wi  1 − λ |wi|   + Group sparsity: prox 21 on G: ∀i ∈ G wi ← wi  1− λ j∈G w2 j   + G2 G1 Overlapping groups, TV: Inner loop iterative solver G2 G1 G Varoquaux 9
  • 17. Sparse solvers: proximals and co. Group sparsity: prox 21 on G: ∀i ∈ G wi ← wi  1− λ j∈G w2 j   + G2 G1 Overlapping groups, TV: Inner loop iterative solver G2 G1 Social sparsity shrinkage: ∀i wi ← wi  1− λ j∈N(i) w2 j   + N1 x1 N2 x2 G Varoquaux 9
  • 18. Social sparsity: “soft-threshold” neighboring voxels Sparsity must be combined with spatial structure Convex solvers for non-local sparsity are expensive Not separable Social sparsity: forget the coupling between soft thresholding [Kowalski... 2013] N1 x1 N2 x2 G Varoquaux 10
  • 19. Empirical evaluation for decoding ­25% ­10%  0% +10% TV­l1 graph ­net social sparsity SVM  + anova Prediction accuracy 1 20x 1 5x 1 2x 1x 2x 5x Run time bottle/scramble bottle/shoe cat/bottle cat/chair cat/face cat/house cat/scramble cat/shoe chair/scramble chair/shoe face/house face/scissors scissors/scramble shoe/scramble OASIS VBM  male vs femaleG Varoquaux 11
  • 20. Empirical evaluation for decoding ­25% ­10%  0% +10% TV­l1 graph ­net social sparsity SVM  + anova Prediction accuracy 1 20x 1 5x 1 2x 1x 2x 5x Run time bottle/scramble bottle/shoe cat/bottle cat/chair cat/face cat/house cat/scramble cat/shoe chair/scramble chair/shoe face/house face/scissors scissors/scramble shoe/scramble OASIS VBM  male vs femaleG Varoquaux 11
  • 21. Social sparsity maps L R z=­16 y=­34 face vs house TV- 1 Graph-net Social sparsity G Varoquaux 12
  • 22. Social sparsity maps L R z=­16 y=­34 face vs house TV- 1 Graph-net Social sparsity G Varoquaux 12
  • 23. @GaelVaroquaux Social-sparsity brain decoders: faster spatial sparsity Spatial sparsity improves prediction and denoises maps TV- 1 “space-net” very successful, but slow Social-sparsity: heuristic that forgoes couplings 10× faster than TV- 1 almost as accurate 3× faster than graph-net more accurate Maps segment well regions ni
  • 24. References I M. K. Carroll, G. A. Cecchi, I. Rish, R. Garg, and A. R. Rao. Prediction and interpretation of distributed neural activity with sparse models. NeuroImage, 44(1):112 – 122, 2009. E. Dohmatob, A. Gramfort, B. Thirion, and G. Varoquaux. Benchmarking solvers for TV-l1 least-squares and logistic regression in brain imaging. PRNI, 2014. M. Eickenberg, E. Dohmatob, B. Thirion, and G. Varoquaux. Total variation meets sparsity: statistical learning with segmenting penalties. MICCAI, 2015. A. Gramfort, B. Thirion, and G. Varoquaux. Identifying predictive regions from fMRI with TV-L1 prior. In PRNI, pages 17–20, 2013. J. Haxby, I. Gobbini, M. Furey, ... Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science, 293:2425, 2001.
  • 25. References II M. Kowalski, K. Siedenburg, and M. Dorfler. Social sparsity! neighborhood systems enrich structured shrinkage operators. Transactions on Signal Processing, 61:2498, 2013. G. Varoquaux, A. Gramfort, and B. Thirion. Small-sample brain mapping: sparse recovery on spatially correlated designs with randomization and clustering. In ICML, page 1375, 2012. O. Yamashita, M. aki Sato, T. Yoshioka, F. Tong, and Y. Kamitani. Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns. NeuroImage, 42(4):1414 – 1429, 2008.