SlideShare une entreprise Scribd logo
1  sur  38
1




MACHINE LEARNING FOR
MEDICAL IMAGING DATA
Yiou (Leo) Li
Background
2



    Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory,
    University of California San Francisco
    •   Developed unsupervised learning method for feature extraction of brain
        imaging data
    •   Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive
        modeling of brain trauma
    •   Designed batch data processing protocol to perform image registration,
        segmentation, band-pass filtering, smoothing, and linear model fitting


    Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal
    processing Laboratory, University of Maryland Baltimore County
    •   Developed the effective degrees of freedom of random process and applied it
        to the model order selection by Information Theoretic Criteria
    •   Developed a linear filtering mechanism in independent component analysis for
        feature enhancement
    •   Analyzed canonical correlation analysis for multiple datasets
Outline
3




     Independent component analysis (ICA) and its
      application to sparse feature extraction from
      multivariate dataset


     Multi-set canonical correlation analysis and its
      application to joint pattern extraction from a group of
      datasets


     Order selection of principal component analysis (PCA)
      and its application to data dimension reduction
PCA vs ICA
4




                  PCA                               ICA
           Linear projection                 Linear projection
             (Orthogonal)
       Uncorrelated components           Independent components
             (non sparse)             (sparse, “long tail” distribution)
      Typically analytical solution     Typically iterative solution
                 (SVD)                    (Iterative optimization)
ICA detects independent factors with
    long tails in multivariate dataset
5
Long tail factors are sparse features in
    data samples
6



                                          Weights of
                                           features
                  Data points (N)

                                    ICA
    Sensors (M)         X            =       A         .         S




                                                           Sparse features



                                     X= AS
ICA model
7


           x1   a11 a12 ... a1M   s1 
          x  a       a 22   a 2M   s 2 
           2    21               
           ...        ...          ... 
                                   
           x M  a M1 a M2   a MM  s M 


                x : Observed variables
                A : Mixing matrix
                s : Latent factors



                  x= As -> s =A-1x
ICA by maximum likelihood estimation
8


    Transformation of multivariate random variable: x = As
                              p(s 1, s2 , ... , sM )
    p(x 1,x 2 , ... , x M )                                 (1)
                                    det(A)
    Statistical independence condition of s:

    p(s 1, s2 , ... , sM )  i 1 p(si )
                                          M
                                                             (2)

    Log likelihood function of x with parameter A:

    log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A)
                                                      -1

                                      i
ICA Application: Sparse feature extraction from
    multivariate dataset
9
Functional MRI experiment
10
Analyze functional MRI data of resting
     state brain
11




                              Sparse features




                    ICA
Feature 1. Primary visual network
12



 +
                                         A




 -
Feature 2. “Default mode network”
13
Feature 3. Attention control network
14
Hierarchical clustering shows link
15
     between features (brain regions)
Predicative modeling of brain trauma
16

                                                 Pattern weights
                                 N

       Healthy
                                 X           =      A        .                 S
       Patients

                                                                               Sparse
                                                                           spatial features


     Subject 1

                             …
                 Subject 2
          16                                                                       Pattern 2
                                                                   Feature 1                   …
                                 Subject M                                         Feature 2

                                                           Y.-O. Li, et al., HBM, 2011
ICA Pattern classification for predictive
     modeling of brain trauma
17




      • 29 healthy + 29 trauma, 10-fold cross-validation


             Classifier                9 patterns                14 patterns
                                   Classification error      Classification error
            Naïve Bayes                0.35+/-0.03               0.32+/-0.03

        K nearest neighbor             0.29+/-0.02               0.30+/-0.03

      Support vector classifier         0.36+/-0.02               0.30 +/-0.02
                                  (c=1, number of SV: 46)   (c=1, number of SV: 20)
Outline
18




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to dimension reduction
Joint pattern extraction requires coherency
     on extracted patterns across datasets
19


               Model:   x k =Aksk , k=1,2,...,M




                                     Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation analysis
20




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation
     analysis
21




          Correlation matrix of [S1,S2, … SM]

                                        Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Application: joint pattern extraction from a
     group of datasets
22




     •   Analyze group functional MRI data from
         simulated driving experiment
Simulated driving experiment
23

     •   Forty subjects, three repeated sessions (120 datasets)
     •   Experiment paradigm:




     •   Behavioral records:
          •   Average speed (AS)
          •   Differential of speed (DS)
          •   Average steering offset (AR)
          •   Differential steering offset (DR)
          •   Differential pedal offset (DP)
          •   Occurrence of yellow line crossing (YLC)
          •   Occurrence of white passenger-side line crossing (WPLC)

                                                    Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step I: M-CCA for joint feature extraction
24




                           Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step II: PCA and behavioral association
25




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Pattern 1: Primary visual function
26




                                                   D = 0:85
                                                   W = 0:42




                        95% CI of behavioral association
Pattern 2: “default mode network”
27




                                                  D = -0.63
                                                  W = -0.39




                      95% CI of behavioral association
Pattern 3: Motor coordination
28




                                                  D = 0.86
                                                  W = 0.15




                       95% CI of behavioral association
Pattern 4: Executive control network
29




                                                 D = 0.64
                                                 W = 0.61




                       95% CI of behavioral association
Cross correlation of Pattern 1
30




                       Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Outline
31




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to data dimension reduction
Decreased reproducibility of independent
     component on high-dimensional dataset
32

          •    Functional MRI with 120 time points
          •    Twenty Monte Carlo trials of ICA algorithm
          •    Clustering the IC estimates
          •    Reproducible ICs: compact and separated clusters




        K=20                           K=40                             K=90

                                                     Y.-O. Li, et al., HBM, 2007
Dimension reduction of high-dimensional
     data by PCA
33


                                       ICA              N
               N


       M       X               =            A   .       S


                                       MxM

                       PCA dimension reduction + ICA


                           .       A    .           S
           X   =   E                                        +      N


                                       K-largest PCs            M-k PCs
Failure of information-theoretic criteria with
     uncorrected degrees of freedom
34



                        AIC, MDL
            ˆ
            k  arg min k {l ( x | k )  g (k )}

                                                                            ( M k)
                                                                    
                                                i k 1 
                                                     M       1/ ( M  k )

                          l(x |  k )  N ln  M             i
                                                                     
                                             
                                              i k 1 i / ( M  k) 
                                                                     

                                           AIC : k (2M  k )  1
                            g ( k )  
                                       MDL : 0.5  ln N (k (2M  k )  1)

                                           Y.-O. Li, et al., HBM, 2007
Estimation of degrees of freedom by
     entropy rate
35


                 Entropy rate of a Gaussian process
                                        1       
                     h( x)  ln 2 e 
                                       4        ln s()d
                                               


            h( x)  ln 2 e     iff x[n] is an i.i.d. random process




       h(x) = 0.40               h(x) = 1.28                  h(x) = 1.41
                                                     Y.-O. Li, et al., HBM, 2007
Application: Order selection of high-
     dimensional dataset
36
Corrected order selection criteria
     significantly improves order selection
37




            Original       With correction on degrees of freedom


                                    Y.-O. Li, et al., HBM, 2007
Summary
38




     •   ICA extracts useful patterns from high dimensional imaging data for
         predictive modeling

     •   M-CCA reveals patterns from several datasets in a coherent order

     •   Dimension reduction by PCA improves the reproducibility of ICA extracted
         patterns


         Exploratory multivariate analysis are promising tools for
         data mining applications

Contenu connexe

Tendances

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and SelectionNikita Zhiltsov
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Armando Vieira
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tenconDr. Jai Sakthi
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Juan Luis Nieves
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix FactorizationTatsuya Yokota
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentJordan McBain
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...BigMC
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentationEnrico Scapin
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Tatsuya Yokota
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registrationzukun
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship ReportHarshilJain26
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2nep_test_account
 
Pres110811
Pres110811Pres110811
Pres110811shotlub
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...ijma
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...Yusuf Bhujwalla
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filtersSandip Joardar
 

Tendances (17)

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and Selection
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
 
Eurogen v
Eurogen vEurogen v
Eurogen v
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship Report
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2
 
Pres110811
Pres110811Pres110811
Pres110811
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
 

En vedette

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsSimon Harris
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Miguel Cabrer
 
Machine Learning
Machine LearningMachine Learning
Machine Learningbutest
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOThomas Roth-Berghofer
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...Debdoot Sheet
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2Hossam Hassan
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?Debdoot Sheet
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processingFarzad Jahedi
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzerbutest
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaAlexander Decker
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...NSEAkure
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splinesEklavya Gupta
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?PRIYAJNVCTC
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data AnalysisMerul Romadhani
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysisDushyant Bheda
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different IndustriesPRIYAJNVCTC
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_financeStefan Duprey
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionTe-Yen Liu
 

En vedette (20)

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICO
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processing
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeria
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splines
 
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLSMULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data Analysis
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different Industries
 
Audio mining
Audio miningAudio mining
Audio mining
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis Introduction
 

Similaire à Machine Learning for Medical Imaging Data Analysis

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis Ibrahim Amer
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Mostafa G. M. Mostafa
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx36rajneekant
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleHakka Labs
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationUnited States Air Force Academy
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionJordan McBain
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxSivam Chinna
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedPei-Che Chang
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdfNarenRajVivek
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...ICAC09
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similaire à Machine Learning for Medical Imaging Data Analysis (20)

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis
 
Sp18_P1.pptx
Sp18_P1.pptxSp18_P1.pptx
Sp18_P1.pptx
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Presentation on machine learning
Presentation on machine learningPresentation on machine learning
Presentation on machine learning
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
 
Ph.D. Presentation
Ph.D. PresentationPh.D. Presentation
Ph.D. Presentation
 
Dbm630 lecture09
Dbm630 lecture09Dbm630 lecture09
Dbm630 lecture09
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty Detection
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and Related
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdf
 
tutorial.ppt
tutorial.ppttutorial.ppt
tutorial.ppt
 
Spatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understandingSpatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understanding
 
190 195
190 195190 195
190 195
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
 
Sunum
SunumSunum
Sunum
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 

Dernier

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 

Dernier (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 

Machine Learning for Medical Imaging Data Analysis

  • 1. 1 MACHINE LEARNING FOR MEDICAL IMAGING DATA Yiou (Leo) Li
  • 2. Background 2 Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory, University of California San Francisco • Developed unsupervised learning method for feature extraction of brain imaging data • Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive modeling of brain trauma • Designed batch data processing protocol to perform image registration, segmentation, band-pass filtering, smoothing, and linear model fitting Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal processing Laboratory, University of Maryland Baltimore County • Developed the effective degrees of freedom of random process and applied it to the model order selection by Information Theoretic Criteria • Developed a linear filtering mechanism in independent component analysis for feature enhancement • Analyzed canonical correlation analysis for multiple datasets
  • 3. Outline 3  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 4. PCA vs ICA 4 PCA ICA Linear projection Linear projection (Orthogonal) Uncorrelated components Independent components (non sparse) (sparse, “long tail” distribution) Typically analytical solution Typically iterative solution (SVD) (Iterative optimization)
  • 5. ICA detects independent factors with long tails in multivariate dataset 5
  • 6. Long tail factors are sparse features in data samples 6 Weights of features Data points (N) ICA Sensors (M) X = A . S Sparse features X= AS
  • 7. ICA model 7  x1   a11 a12 ... a1M   s1  x  a a 22 a 2M   s 2   2    21    ...   ...   ...         x M  a M1 a M2 a MM  s M  x : Observed variables A : Mixing matrix s : Latent factors x= As -> s =A-1x
  • 8. ICA by maximum likelihood estimation 8 Transformation of multivariate random variable: x = As p(s 1, s2 , ... , sM ) p(x 1,x 2 , ... , x M )  (1) det(A) Statistical independence condition of s: p(s 1, s2 , ... , sM )  i 1 p(si ) M (2) Log likelihood function of x with parameter A: log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A) -1 i
  • 9. ICA Application: Sparse feature extraction from multivariate dataset 9
  • 11. Analyze functional MRI data of resting state brain 11 Sparse features ICA
  • 12. Feature 1. Primary visual network 12 + A -
  • 13. Feature 2. “Default mode network” 13
  • 14. Feature 3. Attention control network 14
  • 15. Hierarchical clustering shows link 15 between features (brain regions)
  • 16. Predicative modeling of brain trauma 16 Pattern weights N Healthy X = A . S Patients Sparse spatial features Subject 1 … Subject 2 16 Pattern 2 Feature 1 … Subject M Feature 2 Y.-O. Li, et al., HBM, 2011
  • 17. ICA Pattern classification for predictive modeling of brain trauma 17 • 29 healthy + 29 trauma, 10-fold cross-validation Classifier 9 patterns 14 patterns Classification error Classification error Naïve Bayes 0.35+/-0.03 0.32+/-0.03 K nearest neighbor 0.29+/-0.02 0.30+/-0.03 Support vector classifier 0.36+/-0.02 0.30 +/-0.02 (c=1, number of SV: 46) (c=1, number of SV: 20)
  • 18. Outline 18  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to dimension reduction
  • 19. Joint pattern extraction requires coherency on extracted patterns across datasets 19 Model: x k =Aksk , k=1,2,...,M Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 20. Multi-set canonical correlation analysis 20 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 21. Multi-set canonical correlation analysis 21 Correlation matrix of [S1,S2, … SM] Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 22. Application: joint pattern extraction from a group of datasets 22 • Analyze group functional MRI data from simulated driving experiment
  • 23. Simulated driving experiment 23 • Forty subjects, three repeated sessions (120 datasets) • Experiment paradigm: • Behavioral records: • Average speed (AS) • Differential of speed (DS) • Average steering offset (AR) • Differential steering offset (DR) • Differential pedal offset (DP) • Occurrence of yellow line crossing (YLC) • Occurrence of white passenger-side line crossing (WPLC) Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 24. Step I: M-CCA for joint feature extraction 24 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 25. Step II: PCA and behavioral association 25 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 26. Pattern 1: Primary visual function 26 D = 0:85 W = 0:42 95% CI of behavioral association
  • 27. Pattern 2: “default mode network” 27 D = -0.63 W = -0.39 95% CI of behavioral association
  • 28. Pattern 3: Motor coordination 28 D = 0.86 W = 0.15 95% CI of behavioral association
  • 29. Pattern 4: Executive control network 29 D = 0.64 W = 0.61 95% CI of behavioral association
  • 30. Cross correlation of Pattern 1 30 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 31. Outline 31  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 32. Decreased reproducibility of independent component on high-dimensional dataset 32 • Functional MRI with 120 time points • Twenty Monte Carlo trials of ICA algorithm • Clustering the IC estimates • Reproducible ICs: compact and separated clusters K=20 K=40 K=90 Y.-O. Li, et al., HBM, 2007
  • 33. Dimension reduction of high-dimensional data by PCA 33 ICA N N M X = A . S MxM PCA dimension reduction + ICA . A . S X = E + N K-largest PCs M-k PCs
  • 34. Failure of information-theoretic criteria with uncorrected degrees of freedom 34 AIC, MDL ˆ k  arg min k {l ( x | k )  g (k )} ( M k)    i k 1  M 1/ ( M  k ) l(x |  k )  N ln  M i    i k 1 i / ( M  k)    AIC : k (2M  k )  1 g ( k )   MDL : 0.5  ln N (k (2M  k )  1) Y.-O. Li, et al., HBM, 2007
  • 35. Estimation of degrees of freedom by entropy rate 35 Entropy rate of a Gaussian process 1  h( x)  ln 2 e  4   ln s()d  h( x)  ln 2 e iff x[n] is an i.i.d. random process h(x) = 0.40 h(x) = 1.28 h(x) = 1.41 Y.-O. Li, et al., HBM, 2007
  • 36. Application: Order selection of high- dimensional dataset 36
  • 37. Corrected order selection criteria significantly improves order selection 37 Original With correction on degrees of freedom Y.-O. Li, et al., HBM, 2007
  • 38. Summary 38 • ICA extracts useful patterns from high dimensional imaging data for predictive modeling • M-CCA reveals patterns from several datasets in a coherent order • Dimension reduction by PCA improves the reproducibility of ICA extracted patterns Exploratory multivariate analysis are promising tools for data mining applications