SlideShare a Scribd company logo
1 of 6
Download to read offline
Example of a right floating element.
my awesome super cool team
(by joe mogannam, ben levinsky, blair bickel)
CS 385 : TextonBoost - Parking Lots
Introduction
TextonBoost is an approach to multi-class classification that attempts to segment photographs into
various classes. This model seeks to use texture-layout filters, or features based on textons.
Classification and feature selection is achieved using boosting to give an efficent classifier which can be
applied to a large number of classes.
The aim of our project was to apply TextonBoost to determine where there are occupied and vacant
spaces in a parking lot.
Initially, the codebase worked on 24 classes, but we had to sift through the codebase and instead add
two new categories, occupied and unoccupied.
Our team leveraged two datasets to train our model for this purpose. The PKLot and UIUC datasets, each housing many images that show cars
and parking spaces in different poses and/or lighting conditions.
When using the UIUC dataset, the images had varying results, but when using the PKLot dataset with default parameters, We were able to
determine where some occupied spots were with relatively little difficulty.
Approach/Algorithm
A simplified explanation of TextonBoost is:
Learn a boosting classifier based on relative texture locations for each class. The classification is then refined by combining various weak classifiers
and then combining them to construct a classifier that has a higher accuracy than any of the individual classifiers would produce on their own.
Pre-existing Algorithm
Given an image and for each pixel:
Texture-Layout features are calculated1.
A boosting classifier gives the probability of the pixel belonging to each class2.
The model combines the boosting output with shape, location, color and edge information3.
Pixel receives final label.4.
Image receives final label.5.
Extensions to existing TextonBoost framework
The original algorithm used a filter bank with 3 Gaussians, 4 Laplacian of Gaussians and 4 first order derivatives of Gaussians. In addition to
these filters, we augmented the filter bank to also use dense SIFT. Dense SIFT differs from regular SIFT in that dense SIFT assumes every pixel
is a point of interest. We also modified the approach of the framework to only care about 3 classes, void, car, unoccupied.
When working with the data set UIUC, the algorithm needed to be altered to account for gray scale images. To process a gray scale image they
first need to converted to a 3 dimensional image matrix to simulate an RGB image matrix. Than the code needs to be altered, so that Hue and
Saturation are not used when textonizing images. Removing Hue and Saturation results in decreasing the number of features per pixel from 17
to 11.
Tweaking the Parameters
The pixel feature descriptors that were aggregated from applying the various filters are clustered using k-means. We varied the value of k for
k-means clustering to determine the optimal number of clusters for our model.
Additionally, to balance the trade-off between too much noise in the image from image textonization, keeping enough information for accuracy
and speed of training, the image in testing and the training set are sampled with a filter. We also varied the size of the filter used in the various
filter banks that already existed in the framework.
When using the UIUC data set, the number of features decreases from 17 to 11. So the parameter numFeatures in the file imgTextonization.m
must be changed to 11. This parameter change will account for less features used, and result in a smaller code book generated.
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
1 of 6 3/28/17, 12:42 AM
Results
Problems with Input Images1.
All of the Input Images in the PKLot data set span a large parking lot. Since the input images are not taken from the zenith of the center of
the parking lot, the resulting images will result in objects of varying size because parking spaces at one of end of the lot can be much
smaller than spaces at another end of the lot. The objects (e.g. cars, unoccupied spots, etc.) in the foreground will be much larger and more
defined than the objects in the background. To account for this problem we cropped the original images to capture a small location, where
the object's have less varying sizes.
Problems with Ground Truth Images2.
In a perfect world the Ground Truth Images should have every pixel colored as either void, car, or unoccupied. Unfortunately because the
PKLot metadata associated with the images did not describe complex polygons that exactly match the contours of a car in a parking spot.
Instead the PKLot data set simply provided rectangles that approximately cover a parking spot. This metadata causes issues because there
is extraneous information used to describe a car. This means that if, for example, there is a crowded parking lot in the training set and we
have a given spot as empty, the model will train on ground truth images that have shadows from adjacent cars and think that these
shadows characterize an empty spot.
In addition, the Ground Truth metadata provided by PKLot also has a small subset of polygons that would overlap with each other. This
will cause issues in the training phase because pixels associated with a given class can be classified inaccurately. For example, consider an
empty parking spot with two cars on each side of it. If the Ground Truth image for an occupied spot covers a portion of the two cars,
unwanted textures will be introduced to describe the unoccupied spot's class category.
our new color classes
Output Images
First Pipeline with only SIFT features
First pipeline with both SIFT and filter bank features
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
2 of 6 3/28/17, 12:42 AM
UIUC dataset with only filter bank features
UIUC dataset with only SIFT features
PKLot dataset trained with segmented images with only SIFT features
PKLot dataset trained with segmented images with only filter bank features
UIUC dataset trained with segmented images with both filter bank and SIFT features and 5 clusters in k-means clustering
UIUC dataset trained with segmented images with both filter bank and SIFT features and 10 clusters in k-means clustering
Some figures
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
3 of 6 3/28/17, 12:42 AM
Graph plotting False Negative vs. True Positive in varying outputs for unoccupied parking spots
Graph plotting False Negative vs. True Positive in varying outputs for unoccupied parking spots
RPC curve for unoccupied classification when varying number of clusters
Legend:
Point 10: Number of Clusters = 10.
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
4 of 6 3/28/17, 12:42 AM
Point 3: Number of Clusters = 3.
Point 5: Number of Clusters = 5.
Point 20: Number of Clusters = 20.
RPC curve for occupied classification when varying number of clusters
Legend:
Point 10: Number of Clusters = 10.
Point 3: Number of Clusters = 3.
Point 5: Number of Clusters = 5.
Point 20: Number of Clusters = 20.
Discussion and Conclusions
In hindsight, TextonBoost was limited by our data set, the code base, and the similarity of the classes.
The codebase seemed to use k-means clustering at an odd time. We posited that k-means clustering should be used on all of the data at one
time. Instead, the codebase seemed to use k-means clustering on each image independent of the other images.
Understanding all the various pieces that made the code run, from textonization to training of the model, all involved multiple files and
assumed the user of the framework had an intimate understanding of the approach.
Introducing dense SIFT was a challenge because the filter-bank used initially on each pixel worked on all pixels. Unfortunately, dense SIFT
cannot operate on pixels too close to a boundary. Our team overcame this challenge by adjusting the size that the filter-bank dealt with to the
same dimensions as the dense-SIFT portion of the image.
Trying to find a good data set was difficult. While testing the functionality of our code we came across many problems caused by input images.
With the PKLot data set one of the most obvious problems we noticed was that objects' size varied based on location within the image. In order
to get input images with objects of relatively similar size we cropped a small area out of the original images in the PKLot data set. Cropping the
original images also had the benefit of speeding up the code because there would be less pixels to train on.
We also tried cropping individual objects of interest out of the PKLot data set images. Cropping individual images provided an additional data
set, where we could train on either cars or empty parking spots. The difference between these segmented images and the original images with
an entire parking lot in view is that the original images included other objects like grass, roads, & people that could interfere with training. The
result of these segmented images provided input images with less noise.
References
MSRC framework from Microsoft
Research paper from Microsoft
TextonBoost framework in Matlab documentation
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
5 of 6 3/28/17, 12:42 AM
CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p...
6 of 6 3/28/17, 12:42 AM

More Related Content

What's hot

Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
MLAI2
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
MLAI2
 
Report: Test49 Geant4 Monte-Carlo Models Testing Tools
Report: Test49 Geant4 Monte-Carlo Models Testing ToolsReport: Test49 Geant4 Monte-Carlo Models Testing Tools
Report: Test49 Geant4 Monte-Carlo Models Testing Tools
Roman Atachiants
 

What's hot (20)

Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
 
Karnaugh map or K-map method
Karnaugh map or K-map methodKarnaugh map or K-map method
Karnaugh map or K-map method
 
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
 
Floating point butterfly architecture based on binary signed-digit representa...
Floating point butterfly architecture based on binary signed-digit representa...Floating point butterfly architecture based on binary signed-digit representa...
Floating point butterfly architecture based on binary signed-digit representa...
 
KARNAUGH MAP using OpenGL (KMAP)
KARNAUGH MAP using OpenGL (KMAP)KARNAUGH MAP using OpenGL (KMAP)
KARNAUGH MAP using OpenGL (KMAP)
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
 
GTC 2009 OpenGL Barthold
GTC 2009 OpenGL BartholdGTC 2009 OpenGL Barthold
GTC 2009 OpenGL Barthold
 
Optimization for iterative queries on Mapreduce
Optimization for iterative queries on MapreduceOptimization for iterative queries on Mapreduce
Optimization for iterative queries on Mapreduce
 
RWCap ASCION2011
RWCap ASCION2011RWCap ASCION2011
RWCap ASCION2011
 
Report: Test49 Geant4 Monte-Carlo Models Testing Tools
Report: Test49 Geant4 Monte-Carlo Models Testing ToolsReport: Test49 Geant4 Monte-Carlo Models Testing Tools
Report: Test49 Geant4 Monte-Carlo Models Testing Tools
 
Implementation of Low Power and Area Efficient Carry Select Adder
Implementation of Low Power and Area Efficient Carry Select AdderImplementation of Low Power and Area Efficient Carry Select Adder
Implementation of Low Power and Area Efficient Carry Select Adder
 
Map-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP ProcessingMap-Side Merge Joins for Scalable SPARQL BGP Processing
Map-Side Merge Joins for Scalable SPARQL BGP Processing
 
Ds36715716
Ds36715716Ds36715716
Ds36715716
 
Real Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLReal Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCL
 
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
Implementation of D* Path Planning Algorithm with NXT LEGO Mindstorms Kit for...
 
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving SystemsPRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
 
Design and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAsDesign and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAs
 
Geographic Information System unit 4
Geographic Information System   unit 4Geographic Information System   unit 4
Geographic Information System unit 4
 
ADAPTIVE MAP FOR SIMPLIFYING BOOLEAN EXPRESSIONS
ADAPTIVE MAP FOR SIMPLIFYING BOOLEAN EXPRESSIONSADAPTIVE MAP FOR SIMPLIFYING BOOLEAN EXPRESSIONS
ADAPTIVE MAP FOR SIMPLIFYING BOOLEAN EXPRESSIONS
 
Mask R-CNN
Mask R-CNNMask R-CNN
Mask R-CNN
 

Similar to Object Detection & Machine Learning Paper

Paper id 25201467
Paper id 25201467Paper id 25201467
Paper id 25201467
IJRAT
 
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarksVauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
Mumtaz Hannah Vauhkonen
 

Similar to Object Detection & Machine Learning Paper (20)

Number Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking SystemNumber Plate Recognition of Still Images in Vehicular Parking System
Number Plate Recognition of Still Images in Vehicular Parking System
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classification
 
Point cloud mesh-investigation_report-lihang
Point cloud mesh-investigation_report-lihangPoint cloud mesh-investigation_report-lihang
Point cloud mesh-investigation_report-lihang
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather Conditions
 
IRJET - Automatic Licence Plate Detection and Recognition
IRJET -  	  Automatic Licence Plate Detection and RecognitionIRJET -  	  Automatic Licence Plate Detection and Recognition
IRJET - Automatic Licence Plate Detection and Recognition
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
 
IEEE 2014 Matlab Projects
IEEE 2014 Matlab ProjectsIEEE 2014 Matlab Projects
IEEE 2014 Matlab Projects
 
Generative AI for Reengineering Variants into Software Product Lines: An Expe...
Generative AI for Reengineering Variants into Software Product Lines: An Expe...Generative AI for Reengineering Variants into Software Product Lines: An Expe...
Generative AI for Reengineering Variants into Software Product Lines: An Expe...
 
IRJET- Finding Dominant Color in the Artistic Painting using Data Mining ...
IRJET-  	  Finding Dominant Color in the Artistic Painting using Data Mining ...IRJET-  	  Finding Dominant Color in the Artistic Painting using Data Mining ...
IRJET- Finding Dominant Color in the Artistic Painting using Data Mining ...
 
A Novel Low Complexity Histogram Algorithm for High Performance Image Process...
A Novel Low Complexity Histogram Algorithm for High Performance Image Process...A Novel Low Complexity Histogram Algorithm for High Performance Image Process...
A Novel Low Complexity Histogram Algorithm for High Performance Image Process...
 
PointNet
PointNetPointNet
PointNet
 
Paper id 25201467
Paper id 25201467Paper id 25201467
Paper id 25201467
 
Design of a Novel Multiplier and Accumulator using Modified Booth Algorithm w...
Design of a Novel Multiplier and Accumulator using Modified Booth Algorithm w...Design of a Novel Multiplier and Accumulator using Modified Booth Algorithm w...
Design of a Novel Multiplier and Accumulator using Modified Booth Algorithm w...
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 Projects
 
IRJET - Single Image Super Resolution using Machine Learning
IRJET - Single Image Super Resolution using Machine LearningIRJET - Single Image Super Resolution using Machine Learning
IRJET - Single Image Super Resolution using Machine Learning
 
kanimozhi2019.pdf
kanimozhi2019.pdfkanimozhi2019.pdf
kanimozhi2019.pdf
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 Projects
 
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarksVauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
 
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
 
Final year cse,eee projects in bangalore
Final year cse,eee projects in bangaloreFinal year cse,eee projects in bangalore
Final year cse,eee projects in bangalore
 

More from Joseph Mogannam

More from Joseph Mogannam (6)

Scalable Google Cloud Payroll Project - Architecture Model
Scalable Google Cloud Payroll Project - Architecture ModelScalable Google Cloud Payroll Project - Architecture Model
Scalable Google Cloud Payroll Project - Architecture Model
 
Scalable Google Cloud Payroll Project- Presentation
Scalable Google Cloud Payroll Project- PresentationScalable Google Cloud Payroll Project- Presentation
Scalable Google Cloud Payroll Project- Presentation
 
Scalable Google Cloud Payroll Project - Paper
Scalable Google Cloud Payroll Project - PaperScalable Google Cloud Payroll Project - Paper
Scalable Google Cloud Payroll Project - Paper
 
Statistically Predicting an election from polling results - Paper
Statistically Predicting an election from polling results - PaperStatistically Predicting an election from polling results - Paper
Statistically Predicting an election from polling results - Paper
 
Statistically Predicting an election from polling results - Presentation
Statistically Predicting an election from polling results - Presentation Statistically Predicting an election from polling results - Presentation
Statistically Predicting an election from polling results - Presentation
 
Object Detection & Machine Learning
Object Detection & Machine LearningObject Detection & Machine Learning
Object Detection & Machine Learning
 

Recently uploaded

Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 

Object Detection & Machine Learning Paper

  • 1. Example of a right floating element. my awesome super cool team (by joe mogannam, ben levinsky, blair bickel) CS 385 : TextonBoost - Parking Lots Introduction TextonBoost is an approach to multi-class classification that attempts to segment photographs into various classes. This model seeks to use texture-layout filters, or features based on textons. Classification and feature selection is achieved using boosting to give an efficent classifier which can be applied to a large number of classes. The aim of our project was to apply TextonBoost to determine where there are occupied and vacant spaces in a parking lot. Initially, the codebase worked on 24 classes, but we had to sift through the codebase and instead add two new categories, occupied and unoccupied. Our team leveraged two datasets to train our model for this purpose. The PKLot and UIUC datasets, each housing many images that show cars and parking spaces in different poses and/or lighting conditions. When using the UIUC dataset, the images had varying results, but when using the PKLot dataset with default parameters, We were able to determine where some occupied spots were with relatively little difficulty. Approach/Algorithm A simplified explanation of TextonBoost is: Learn a boosting classifier based on relative texture locations for each class. The classification is then refined by combining various weak classifiers and then combining them to construct a classifier that has a higher accuracy than any of the individual classifiers would produce on their own. Pre-existing Algorithm Given an image and for each pixel: Texture-Layout features are calculated1. A boosting classifier gives the probability of the pixel belonging to each class2. The model combines the boosting output with shape, location, color and edge information3. Pixel receives final label.4. Image receives final label.5. Extensions to existing TextonBoost framework The original algorithm used a filter bank with 3 Gaussians, 4 Laplacian of Gaussians and 4 first order derivatives of Gaussians. In addition to these filters, we augmented the filter bank to also use dense SIFT. Dense SIFT differs from regular SIFT in that dense SIFT assumes every pixel is a point of interest. We also modified the approach of the framework to only care about 3 classes, void, car, unoccupied. When working with the data set UIUC, the algorithm needed to be altered to account for gray scale images. To process a gray scale image they first need to converted to a 3 dimensional image matrix to simulate an RGB image matrix. Than the code needs to be altered, so that Hue and Saturation are not used when textonizing images. Removing Hue and Saturation results in decreasing the number of features per pixel from 17 to 11. Tweaking the Parameters The pixel feature descriptors that were aggregated from applying the various filters are clustered using k-means. We varied the value of k for k-means clustering to determine the optimal number of clusters for our model. Additionally, to balance the trade-off between too much noise in the image from image textonization, keeping enough information for accuracy and speed of training, the image in testing and the training set are sampled with a filter. We also varied the size of the filter used in the various filter banks that already existed in the framework. When using the UIUC data set, the number of features decreases from 17 to 11. So the parameter numFeatures in the file imgTextonization.m must be changed to 11. This parameter change will account for less features used, and result in a smaller code book generated. CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 1 of 6 3/28/17, 12:42 AM
  • 2. Results Problems with Input Images1. All of the Input Images in the PKLot data set span a large parking lot. Since the input images are not taken from the zenith of the center of the parking lot, the resulting images will result in objects of varying size because parking spaces at one of end of the lot can be much smaller than spaces at another end of the lot. The objects (e.g. cars, unoccupied spots, etc.) in the foreground will be much larger and more defined than the objects in the background. To account for this problem we cropped the original images to capture a small location, where the object's have less varying sizes. Problems with Ground Truth Images2. In a perfect world the Ground Truth Images should have every pixel colored as either void, car, or unoccupied. Unfortunately because the PKLot metadata associated with the images did not describe complex polygons that exactly match the contours of a car in a parking spot. Instead the PKLot data set simply provided rectangles that approximately cover a parking spot. This metadata causes issues because there is extraneous information used to describe a car. This means that if, for example, there is a crowded parking lot in the training set and we have a given spot as empty, the model will train on ground truth images that have shadows from adjacent cars and think that these shadows characterize an empty spot. In addition, the Ground Truth metadata provided by PKLot also has a small subset of polygons that would overlap with each other. This will cause issues in the training phase because pixels associated with a given class can be classified inaccurately. For example, consider an empty parking spot with two cars on each side of it. If the Ground Truth image for an occupied spot covers a portion of the two cars, unwanted textures will be introduced to describe the unoccupied spot's class category. our new color classes Output Images First Pipeline with only SIFT features First pipeline with both SIFT and filter bank features CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 2 of 6 3/28/17, 12:42 AM
  • 3. UIUC dataset with only filter bank features UIUC dataset with only SIFT features PKLot dataset trained with segmented images with only SIFT features PKLot dataset trained with segmented images with only filter bank features UIUC dataset trained with segmented images with both filter bank and SIFT features and 5 clusters in k-means clustering UIUC dataset trained with segmented images with both filter bank and SIFT features and 10 clusters in k-means clustering Some figures CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 3 of 6 3/28/17, 12:42 AM
  • 4. Graph plotting False Negative vs. True Positive in varying outputs for unoccupied parking spots Graph plotting False Negative vs. True Positive in varying outputs for unoccupied parking spots RPC curve for unoccupied classification when varying number of clusters Legend: Point 10: Number of Clusters = 10. CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 4 of 6 3/28/17, 12:42 AM
  • 5. Point 3: Number of Clusters = 3. Point 5: Number of Clusters = 5. Point 20: Number of Clusters = 20. RPC curve for occupied classification when varying number of clusters Legend: Point 10: Number of Clusters = 10. Point 3: Number of Clusters = 3. Point 5: Number of Clusters = 5. Point 20: Number of Clusters = 20. Discussion and Conclusions In hindsight, TextonBoost was limited by our data set, the code base, and the similarity of the classes. The codebase seemed to use k-means clustering at an odd time. We posited that k-means clustering should be used on all of the data at one time. Instead, the codebase seemed to use k-means clustering on each image independent of the other images. Understanding all the various pieces that made the code run, from textonization to training of the model, all involved multiple files and assumed the user of the framework had an intimate understanding of the approach. Introducing dense SIFT was a challenge because the filter-bank used initially on each pixel worked on all pixels. Unfortunately, dense SIFT cannot operate on pixels too close to a boundary. Our team overcame this challenge by adjusting the size that the filter-bank dealt with to the same dimensions as the dense-SIFT portion of the image. Trying to find a good data set was difficult. While testing the functionality of our code we came across many problems caused by input images. With the PKLot data set one of the most obvious problems we noticed was that objects' size varied based on location within the image. In order to get input images with objects of relatively similar size we cropped a small area out of the original images in the PKLot data set. Cropping the original images also had the benefit of speeding up the code because there would be less pixels to train on. We also tried cropping individual objects of interest out of the PKLot data set images. Cropping individual images provided an additional data set, where we could train on either cars or empty parking spots. The difference between these segmented images and the original images with an entire parking lot in view is that the original images included other objects like grass, roads, & people that could interfere with training. The result of these segmented images provided input images with less noise. References MSRC framework from Microsoft Research paper from Microsoft TextonBoost framework in Matlab documentation CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 5 of 6 3/28/17, 12:42 AM
  • 6. CS 385 Final Project file:///Users/joemogannam/Desktop/FINALCS385Project/385 final p... 6 of 6 3/28/17, 12:42 AM