SlideShare une entreprise Scribd logo
1  sur  15
BRANCH AND BOUND BASED FEATURE
ELIMINATION FOR SUPPORT VECTOR
MACHINE BASED CLASSIFICATION OF
HYPERSPECTRAL IMAGES
Sathishkumar Samiappan
Saurabh Prasad
Lori M. Bruce
&
Eric Hansen
Mississippi State University
INTRODUCTION
• Hyperspectral Images (HSI) are widely used for ground cover
classification problems.
• The problem is very challenging because,
1) High dimensional feature space
2) High correlation between successive features
• In last decade, Support Vector Machines(SVM) shown to perform well for
the problem
• Traditional View about SVMs:
They can handle higher dimensionality, Hence Feature Selection (FS) is
not required
• Recently Waske et al showed that FS can improve the classification
performance of SVMs
MOTIVATION
FS based on Metrics such as
Bhattacharya Distance,
Mutual Information
& Correlation etc..
FS based on Search Approach
Feature Selection Algorithms
What is good or Bad about this?
• Easy computation
• Often Sub - Optimal
Exhaustive Search Search based on
Intelligence
Can both of them get married?
A HYBRID approach?
SELECTION OF ALGORITHMS
• Rank Based Approach : Feature selection based on Bhattacharya
distance(BD) and correlation
• Features are ranked according to descending order of their BDs
and Correlation
• Select the first m features
• BDs are generally used in selecting features for hyperspectral
image classification
Bhattacharya Distance
µi & µj are the means of two classes
∑i & ∑j be their covariance
SELECTION OF ALGORITHMS
• Search Approach : Branch and Bound (B&B) Search and Genetic
Algorithms(GA)
• Branch and Bound is a modification of simple tree search with
back tracking.
• With a good estimate of upper bound, B&B almost converges to
the solution of exhaustive search
• Genetic Algorithm is a very popular optimization procedure
inspired from human evolution.
• GA is basically a random search but guaranteed to converge to the
optimal solution
PROPOSED HYBRIDAPPROACH
OBJECTIVE
• To remove a subset of features such that the remaining features
achieves the best performance during SVM classification
• Bhattacharya Distance and Correlation:
To rank the features based on their usefulness in discriminating
between classes
• Branch and Bound or Genetic Algorithms:
To select a subset of lower ranked features to remove from
feature set.
• To create a elimination strategy in different combinations among
the lower ranked features.
BRANCH AND BOUND
• B&B is a general algorithmic strategy used to solve optimization
problems
• It divides the problem to be solved as number of sub-problems
• Instead of solving all the sub problems, B&B tries to find one viable
solution and notes its value as Upper Bound
• All following calculations are terminated as soon as its cost reaches
upper bound
• If a better solution is found then Upper Bound will be updated
• In this way many sub problems can be left unsolved safely
EXAMPLE : BRANCH AND BOUND
Step 1: Rank the features in
descending order
of its importance
and select the first
+1m
B1
Total Number of Features q = 6
Features to be Removed m = 4
Features Selected = (q-m) = 2
Initial Upper Bound B = Bo
m
Step 2: Compute the
cost B1 at node 1
If B1<B
Goto Step 1
else
Back track
& select B2
B1
The entire subtree can be
discarded
EXAMPLE : BRANCH AND BOUND
S1
S2
S3
S4
Maximum Depth of the tree = m
So if depth reaches m,
Compute new Bound for the path
If new bound is better
Update bound B
else
backtrack
ALGORITHM
The parameters used for GA
Fitness Function - Multiclass Spider SVM Implementation
using RBF kernal with sigma = 0.5
Number of generations = 20
Length of chromosome = 50
Population size = 30
Crossover Probability = 0.6
Mutation Probability = 0.003
GENETIC ALGORITHMS
RESULTS
• Dataset used – AVIRIS Indian
Pines with 220 features.
• It’s a 7 class data with 200
training samples from each
class.
• The classes are corn no-till,
corn min-till, grass pasture,
hay windrowed, soybeans no-
till, soybeans clean and
woods.
• Pros
- Compromise between rank based FS and
exhaustive search
- Computationally efficient when compared to
GA and other search techniques
- Potential to provide significant increase in
performance of SVMs
- Robust with small sample sizes(few training
samples)
• Cons
- potential of overtraining
DISCUSSION
Thank you
Queries – Comments - suggestions
Sathishkumar Samiappan
sathish@gri.msstate.edu

Contenu connexe

Tendances

Tendances (20)

Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!Paper study: Attention, learn to solve routing problems!
Paper study: Attention, learn to solve routing problems!
 
Svm algorithm
Svm algorithmSvm algorithm
Svm algorithm
 
Linear discriminant analysis
Linear discriminant analysisLinear discriminant analysis
Linear discriminant analysis
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipeline
 
Support vector machine
Support vector machineSupport vector machine
Support vector machine
 
K means and dbscan
K means and dbscanK means and dbscan
K means and dbscan
 
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural NetworksPaper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
 
Paper Study: Transformer dissection
Paper Study: Transformer dissectionPaper Study: Transformer dissection
Paper Study: Transformer dissection
 
Implementation of K-Nearest Neighbor Algorithm
Implementation of K-Nearest Neighbor AlgorithmImplementation of K-Nearest Neighbor Algorithm
Implementation of K-Nearest Neighbor Algorithm
 
Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)Sharpness-aware minimization (SAM)
Sharpness-aware minimization (SAM)
 
Implementing the Perceptron Algorithm for Finding the weights of a Linear Dis...
Implementing the Perceptron Algorithm for Finding the weights of a Linear Dis...Implementing the Perceptron Algorithm for Finding the weights of a Linear Dis...
Implementing the Perceptron Algorithm for Finding the weights of a Linear Dis...
 
Learning a nonlinear embedding by preserving class neibourhood structure 최종
Learning a nonlinear embedding by preserving class neibourhood structure   최종Learning a nonlinear embedding by preserving class neibourhood structure   최종
Learning a nonlinear embedding by preserving class neibourhood structure 최종
 
Tensor Spectral Clustering
Tensor Spectral ClusteringTensor Spectral Clustering
Tensor Spectral Clustering
 
K-means Clustering
K-means ClusteringK-means Clustering
K-means Clustering
 
PCA and SVD in brief
PCA and SVD in briefPCA and SVD in brief
PCA and SVD in brief
 
Spectral clustering
Spectral clusteringSpectral clustering
Spectral clustering
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering
 
Notes on Spectral Clustering
Notes on Spectral ClusteringNotes on Spectral Clustering
Notes on Spectral Clustering
 
Meta-Learning with Implicit Gradients
Meta-Learning with Implicit GradientsMeta-Learning with Implicit Gradients
Meta-Learning with Implicit Gradients
 
Pattern Mining in large time series databases
Pattern Mining in large time series databasesPattern Mining in large time series databases
Pattern Mining in large time series databases
 

En vedette

Three types of wireless technology
Three types of wireless technologyThree types of wireless technology
Three types of wireless technology
Rashmi Joaa
 
Wireless communication
Wireless communicationWireless communication
Wireless communication
Darshan Maru
 
Basic concepts of wireless communication system
Basic concepts of wireless communication systemBasic concepts of wireless communication system
Basic concepts of wireless communication system
Bogs De Castro
 

En vedette (9)

Wireless
WirelessWireless
Wireless
 
HYPERSPECTRAL IMAGERY CLASSIFICATION USING TECHNOLOGIES OF COMPUTATIONAL INTE...
HYPERSPECTRAL IMAGERY CLASSIFICATION USING TECHNOLOGIES OF COMPUTATIONAL INTE...HYPERSPECTRAL IMAGERY CLASSIFICATION USING TECHNOLOGIES OF COMPUTATIONAL INTE...
HYPERSPECTRAL IMAGERY CLASSIFICATION USING TECHNOLOGIES OF COMPUTATIONAL INTE...
 
Branch & bound
Branch & boundBranch & bound
Branch & bound
 
Branch and bound technique
Branch and bound techniqueBranch and bound technique
Branch and bound technique
 
Branch and bound technique
Branch and bound techniqueBranch and bound technique
Branch and bound technique
 
Backtracking
Backtracking  Backtracking
Backtracking
 
Three types of wireless technology
Three types of wireless technologyThree types of wireless technology
Three types of wireless technology
 
Wireless communication
Wireless communicationWireless communication
Wireless communication
 
Basic concepts of wireless communication system
Basic concepts of wireless communication systemBasic concepts of wireless communication system
Basic concepts of wireless communication system
 

Similaire à Branch and Bound Feature Selection for Hyperspectral Image Classification

few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
few common Feature of Size Datum Features are bores, cylinders, slots, or tab...few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
DrPArivalaganASSTPRO
 
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
Machine Learning Unit-5 Decesion Trees & Random Forest.pdfMachine Learning Unit-5 Decesion Trees & Random Forest.pdf
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
AdityaSoraut
 
Machine Learning Workshop
Machine Learning WorkshopMachine Learning Workshop
Machine Learning Workshop
Osman Ramadan
 

Similaire à Branch and Bound Feature Selection for Hyperspectral Image Classification (20)

Nbvtalkonfeatureselection
NbvtalkonfeatureselectionNbvtalkonfeatureselection
Nbvtalkonfeatureselection
 
Predicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine LearningPredicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine Learning
 
Predicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine LearningPredicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine Learning
 
Predicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine LearningPredicting Moscow Real Estate Prices with Azure Machine Learning
Predicting Moscow Real Estate Prices with Azure Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Automatic pruning method for random forest
Automatic pruning method for random forestAutomatic pruning method for random forest
Automatic pruning method for random forest
 
Support Vector machine(SVM) and Random Forest
Support Vector machine(SVM) and Random ForestSupport Vector machine(SVM) and Random Forest
Support Vector machine(SVM) and Random Forest
 
How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?
 
Data discretization
Data discretizationData discretization
Data discretization
 
background.pptx
background.pptxbackground.pptx
background.pptx
 
An introduction to variable and feature selection
An introduction to variable and feature selectionAn introduction to variable and feature selection
An introduction to variable and feature selection
 
few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
few common Feature of Size Datum Features are bores, cylinders, slots, or tab...few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
few common Feature of Size Datum Features are bores, cylinders, slots, or tab...
 
random forest.pptx
random forest.pptxrandom forest.pptx
random forest.pptx
 
Kaggle Higgs Boson Machine Learning Challenge
Kaggle Higgs Boson Machine Learning ChallengeKaggle Higgs Boson Machine Learning Challenge
Kaggle Higgs Boson Machine Learning Challenge
 
Machine learning for Data Science
Machine learning for Data ScienceMachine learning for Data Science
Machine learning for Data Science
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
 
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
Machine Learning Unit-5 Decesion Trees & Random Forest.pdfMachine Learning Unit-5 Decesion Trees & Random Forest.pdf
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
 
Algorithm explanations
Algorithm explanationsAlgorithm explanations
Algorithm explanations
 
Intro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft VenturesIntro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft Ventures
 
Machine Learning Workshop
Machine Learning WorkshopMachine Learning Workshop
Machine Learning Workshop
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Branch and Bound Feature Selection for Hyperspectral Image Classification

  • 1. BRANCH AND BOUND BASED FEATURE ELIMINATION FOR SUPPORT VECTOR MACHINE BASED CLASSIFICATION OF HYPERSPECTRAL IMAGES Sathishkumar Samiappan Saurabh Prasad Lori M. Bruce & Eric Hansen Mississippi State University
  • 2. INTRODUCTION • Hyperspectral Images (HSI) are widely used for ground cover classification problems. • The problem is very challenging because, 1) High dimensional feature space 2) High correlation between successive features • In last decade, Support Vector Machines(SVM) shown to perform well for the problem • Traditional View about SVMs: They can handle higher dimensionality, Hence Feature Selection (FS) is not required • Recently Waske et al showed that FS can improve the classification performance of SVMs
  • 3. MOTIVATION FS based on Metrics such as Bhattacharya Distance, Mutual Information & Correlation etc.. FS based on Search Approach Feature Selection Algorithms What is good or Bad about this? • Easy computation • Often Sub - Optimal Exhaustive Search Search based on Intelligence Can both of them get married? A HYBRID approach?
  • 4. SELECTION OF ALGORITHMS • Rank Based Approach : Feature selection based on Bhattacharya distance(BD) and correlation • Features are ranked according to descending order of their BDs and Correlation • Select the first m features • BDs are generally used in selecting features for hyperspectral image classification Bhattacharya Distance µi & µj are the means of two classes ∑i & ∑j be their covariance
  • 5. SELECTION OF ALGORITHMS • Search Approach : Branch and Bound (B&B) Search and Genetic Algorithms(GA) • Branch and Bound is a modification of simple tree search with back tracking. • With a good estimate of upper bound, B&B almost converges to the solution of exhaustive search • Genetic Algorithm is a very popular optimization procedure inspired from human evolution. • GA is basically a random search but guaranteed to converge to the optimal solution
  • 7. OBJECTIVE • To remove a subset of features such that the remaining features achieves the best performance during SVM classification • Bhattacharya Distance and Correlation: To rank the features based on their usefulness in discriminating between classes • Branch and Bound or Genetic Algorithms: To select a subset of lower ranked features to remove from feature set. • To create a elimination strategy in different combinations among the lower ranked features.
  • 8. BRANCH AND BOUND • B&B is a general algorithmic strategy used to solve optimization problems • It divides the problem to be solved as number of sub-problems • Instead of solving all the sub problems, B&B tries to find one viable solution and notes its value as Upper Bound • All following calculations are terminated as soon as its cost reaches upper bound • If a better solution is found then Upper Bound will be updated • In this way many sub problems can be left unsolved safely
  • 9. EXAMPLE : BRANCH AND BOUND Step 1: Rank the features in descending order of its importance and select the first +1m B1 Total Number of Features q = 6 Features to be Removed m = 4 Features Selected = (q-m) = 2 Initial Upper Bound B = Bo m Step 2: Compute the cost B1 at node 1 If B1<B Goto Step 1 else Back track & select B2 B1 The entire subtree can be discarded
  • 10. EXAMPLE : BRANCH AND BOUND S1 S2 S3 S4 Maximum Depth of the tree = m So if depth reaches m, Compute new Bound for the path If new bound is better Update bound B else backtrack
  • 12. The parameters used for GA Fitness Function - Multiclass Spider SVM Implementation using RBF kernal with sigma = 0.5 Number of generations = 20 Length of chromosome = 50 Population size = 30 Crossover Probability = 0.6 Mutation Probability = 0.003 GENETIC ALGORITHMS
  • 13. RESULTS • Dataset used – AVIRIS Indian Pines with 220 features. • It’s a 7 class data with 200 training samples from each class. • The classes are corn no-till, corn min-till, grass pasture, hay windrowed, soybeans no- till, soybeans clean and woods.
  • 14. • Pros - Compromise between rank based FS and exhaustive search - Computationally efficient when compared to GA and other search techniques - Potential to provide significant increase in performance of SVMs - Robust with small sample sizes(few training samples) • Cons - potential of overtraining DISCUSSION
  • 15. Thank you Queries – Comments - suggestions Sathishkumar Samiappan sathish@gri.msstate.edu