A random forest approach to skin detection with r

•Télécharger en tant que PPTX, PDF•

0 j'aime•1,932 vues

The document discusses using random forests for skin detection from images. It provides an agenda for a presentation which includes an overview of the skin detection scheme using random forests, a refresher on random forests and R support, and results and continuing work. Code and the dataset are available online. Random forests show the best performance for skin detection compared to other models. The presenter's results are incomplete due to a small training set and they plan to use a parallel computing cluster going forward.

Technologie

Auro Tripathy
auro@shatterline.com

*Random Forests are registered trademarks of Leo Breiman and Adele Cutler

 Attributions, code and dataset location (1
minute)
 Overview of the scheme (2 minutes)
 Refresher on Random Forest and R
Support (2 minutes)
 Results and continuing work (1 minute)
 Q&A (1 minute and later)

ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5651638

 R code available here; my contribution
 http://www.shatterline.com/SkinDetection.html
 Data set available here
 http://www.feeval.org/Data-sets/Skin_Colors.html
 Permission to use may be required

 All training sets organized as a two-movie
sequence
1. A movies sequence of frames in color
2. A corresponding sequence of frames in binary
black-and-white, the ground-truth
 Extract individual frames in jpeg format
using ffmpeg, a transcoding tool
ffmpeg -i 14.avi -f image2 -ss 1.000 -vframes 1
14_500offset10s.jpeg

ffmpeg -i 14_gt_500frames.avi -f image2 -ss 1.000 -vframes 1
14_gt_500frames_offset10s.jpeg

Image Ground-truth

The original authors used 8991 such image-pairs, the image along with
its manually annotated pixel-level ground-truth.

 Skin-color classification/segmentation
 Uses Improved Hue, Saturation, Luminance
(IHLS) color-space
 RBG values transformed to HLS
 HLS used as feature-vectors
 Original authors also experimented with
 Bayesian network,
 Multilayer Perceptron,
 SVM,
 AdaBoost (Adaptive Boosting),
 Naive Bayes,
 RBF network

“Random Forest shows the best performance in terms of accuracy,
precision and recall”

The most important property of this [IHLS] space is a “well-
behaved” saturation coordinate which, in contrast to commonly
used ones, always has a small numerical value for near-
achromatic colours, and is completely independent of the
brightness function
A 3D-polar Coordinate Colour Representation Suitable for
Image, Analysis Allan Hanbury and Jean Serra

MATLAB routines implementing the RGB-to-IHLS and IHLS-to-RGB are
available at http://www.prip.tuwien.ac.at/˜hanbury.

R routines implementing the RGB-to-IHLS and IHLS-to-RGB are
available at http://www.shatterline.com/SkinDetection.html

 Package „ReadImages‟
 This package provides functions for reading
JPEG and PNG ﬁles
 Package „randomForest‟
 Breiman and Cutler‟s Classification and
regression based on a forest of trees using
random inputs.
 Package „foreach‟
 Support for the foreach looping construct
 Stretch goal to use %dopar%

set.seed(371)
skin.rf <- foreach(i = c(1:nrow(training.frames.list)), .combine=combine,
.packages='randomForest') %do%
{
#Read the Image
#transform from RGB to IHLS
#Read the corresponding ground-truth image
#data is ready, now apply random forest #not using the formula interface
randomForest(table.data, y=table.truth, mtry = 2, importance = FALSE,
proximity = FALSE, ntree=10, do.trace = 100)

}

table.pred.truth <- predict(skin.rf, test.table.data)

 Have lots of decision-tree learners
 Each learner‟s training set is sampled
independently – with replacement
 Add more randomness – at each node of
the tree, the splitting attribute is selected
from a randomly chosen sample of
attributes

Each decision tree votes
for a classification

Forest chooses a
classification with the
most votes

 Quick training phase
 Trees can grow in parallel
 Trees have attractive computing
properties
 For example…
 Computation cost of making a binary tree is
low O(N Log N)
 Cost of using a tree is even lower – O(Log N)
 N is the number of data points
 Applies to balanced binary trees; decision
trees often not balanced

ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5651638
My Results? OK, but incomplete due to very small training set.
Need parallel computing cluster

Recommandé

Hands-on Tutorial of Machine Learning in PythonChun-Ming Chang

1873 1878Editor IJARCET

TensorFlow in 3 sentencesBarbara Fusinska

Presentation on experimental setup for verigying - "Slow Learners are F...Robin Srivastava

Discrete cosine transform Rashmi Karkra

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...IRJET Journal

Neural Learning to RankBhaskar Mitra

Image Resolution Enhancement using DWT and Spatial Domain Interpolation Techn...IJERA Editor

Recommandé

Hands-on Tutorial of Machine Learning in PythonChun-Ming Chang

1873 1878Editor IJARCET

TensorFlow in 3 sentencesBarbara Fusinska

Presentation on experimental setup for verigying - "Slow Learners are F...Robin Srivastava

Discrete cosine transform Rashmi Karkra

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...IRJET Journal

Neural Learning to RankBhaskar Mitra

Image Resolution Enhancement using DWT and Spatial Domain Interpolation Techn...IJERA Editor

Neural Learning to RankBhaskar Mitra

Introduction to Convolutional Neural NetworksHannes Hapke

Non-Blind Deblurring Using Partial Differential Equation MethodEditor IJCATR

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor

Workshop - Introduction to Machine Learning with RShirin Elsinghorst

APPLIED MACHINE LEARNINGRevanth Kumar

Case Study of Convolutional Neural NetworkNamHyuk Ahn

Image compression 14_04_2020 (1)Joel P

03 image transformations_iankit_ppt

DCTaniruddh Tyagi

Image classification with Deep Neural NetworksYogendra Tamang

F0533134IOSR Journals

Learning to Rank with Neural NetworksBhaskar Mitra

캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)NAVER Engineering

Thesisjoseangl

GBM package in rmark_landry

07 learningankit_ppt

Investigations on the role of analysis window shape parameter in speech enhan...karthik annam

2021 03-02-transformer interpretabilityJAEMINJEONG5

Deep neural networks & computational graphsRevanth Kumar

Why Graphics Is Fast, and What It Can Teach Us About Parallel ProgrammingJonathan Ragan-Kelley

Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves MabialaSpark Summit

Contenu connexe

Tendances

Neural Learning to RankBhaskar Mitra

Introduction to Convolutional Neural NetworksHannes Hapke

Non-Blind Deblurring Using Partial Differential Equation MethodEditor IJCATR

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor

Workshop - Introduction to Machine Learning with RShirin Elsinghorst

APPLIED MACHINE LEARNINGRevanth Kumar

Case Study of Convolutional Neural NetworkNamHyuk Ahn

Image compression 14_04_2020 (1)Joel P

03 image transformations_iankit_ppt

DCTaniruddh Tyagi

Image classification with Deep Neural NetworksYogendra Tamang

F0533134IOSR Journals

Learning to Rank with Neural NetworksBhaskar Mitra

캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)NAVER Engineering

Thesisjoseangl

GBM package in rmark_landry

07 learningankit_ppt

Investigations on the role of analysis window shape parameter in speech enhan...karthik annam

2021 03-02-transformer interpretabilityJAEMINJEONG5

Deep neural networks & computational graphsRevanth Kumar

Tendances (20)

Neural Learning to Rank

Introduction to Convolutional Neural Networks

Non-Blind Deblurring Using Partial Differential Equation Method

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...

Workshop - Introduction to Machine Learning with R

APPLIED MACHINE LEARNING

Case Study of Convolutional Neural Network

Image compression 14_04_2020 (1)

03 image transformations_i

DCT

Image classification with Deep Neural Networks

F0533134

Learning to Rank with Neural Networks

캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)

Thesis

GBM package in r

07 learning

Investigations on the role of analysis window shape parameter in speech enhan...

2021 03-02-transformer interpretability

Deep neural networks & computational graphs

Similaire à A random forest approach to skin detection with r

Why Graphics Is Fast, and What It Can Teach Us About Parallel ProgrammingJonathan Ragan-Kelley

Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves MabialaSpark Summit

Deep Learning Based Voice Activity Detection and Speech EnhancementNAVER Engineering

PT-4054, "OpenCL™ Accelerated Compute Libraries" by John MelonakosAMD Developer Central

Generating super resolution images using transformersNEERAJ BAGHEL

530 535Editor IJARCET

Separating Hype from Reality in Deep Learning with Sameer FarooquiDatabricks

ECCV2010: feature learning for image classification, part 4zukun

Deep Learning and Watson StudioSasha Lazarevic

A Platform for Accelerating Machine Learning ApplicationsNVIDIA Taiwan

CudaTree (GTC 2014)Alex Rubinsteyn

机器学习AdaboostShocky1

Fann tool users_guideBirol Kuyumcu

Massive Matrix Factorization : Applications to collaborative filteringArthur Mensch

Dictionary Learning for Massive Matrix Factorizationrecsysfr

Why Image compression is Necessary?Prabhat Kumar

Image De-Noising Using Deep Neural Networkaciijournal

Super Resolution with OCR OptimizationniveditJain

Classification of chestnuts with feature selection by noise resilient classif...Elena Roglia

OpenAI Retro ContestKIYONARI HARIGAE

Similaire à A random forest approach to skin detection with r (20)

Why Graphics Is Fast, and What It Can Teach Us About Parallel Programming

Deep Recurrent Neural Networks for Sequence Learning in Spark by Yves Mabiala

Deep Learning Based Voice Activity Detection and Speech Enhancement

PT-4054, "OpenCL™ Accelerated Compute Libraries" by John Melonakos

Generating super resolution images using transformers

530 535

Separating Hype from Reality in Deep Learning with Sameer Farooqui

ECCV2010: feature learning for image classification, part 4

Deep Learning and Watson Studio

A Platform for Accelerating Machine Learning Applications

CudaTree (GTC 2014)

机器学习Adaboost

Fann tool users_guide

Massive Matrix Factorization : Applications to collaborative filtering

Dictionary Learning for Massive Matrix Factorization

Why Image compression is Necessary?

Image De-Noising Using Deep Neural Network

Super Resolution with OCR Optimization

Classification of chestnuts with feature selection by noise resilient classif...

OpenAI Retro Contest

Plus de Dmitry Makarchuk

Linzer slides-barugDmitry Makarchuk

2012 11-28 rich web data modeling with graphs-1Dmitry Makarchuk

Hadoop and mysql by Chris SchneiderDmitry Makarchuk

"Your script just killed my site" by Steve SoudersDmitry Makarchuk

RBrowserPlugin Project (Gabriel Becker)Dmitry Makarchuk

Bridge to rDmitry Makarchuk

Builiding analytical apps on HadoopDmitry Makarchuk

Jesse Yates: Hbase snapshots patchDmitry Makarchuk

Phoenix h basemeetupDmitry Makarchuk

Mongo DB in gaming industryDmitry Makarchuk

Plus de Dmitry Makarchuk (11)

Linzer slides-barug

2012 11-28 rich web data modeling with graphs-1

Hadoop and mysql by Chris Schneider

"Your script just killed my site" by Steve Souders

RBrowserPlugin Project (Gabriel Becker)

Bridge to r

Builiding analytical apps on Hadoop

Jesse Yates: Hbase snapshots patch

Phoenix h basemeetup

Mongo DB in gaming industry

Dernier

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Slack Application Development 101 Slidespraypatel2

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Histor y of HAM Radio presentation slidevu2urc

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

How to convert PDF to text with Nanonetsnaman860154

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Dernier (20)

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Slack Application Development 101 Slides

Presentation on how to chat with PDF using ChatGPT code interpreter

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Histor y of HAM Radio presentation slide

Driving Behavioral Change for Information Management through Data-Driven Gree...

CNv6 Instructor Chapter 6 Quality of Service

Axa Assurance Maroc - Insurer Innovation Award 2024

Scaling API-first – The story of a global engineering organization

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Boost PC performance: How more available memory can improve productivity

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

How to convert PDF to text with Nanonets

08448380779 Call Girls In Friends Colony Women Seeking Men

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Salesforce Community Group Quito, Salesforce 101

A random forest approach to skin detection with r

1. Auro Tripathy auro@shatterline.com *Random Forests are registered trademarks of Leo Breiman and Adele Cutler

2.  Attributions, code and dataset location (1 minute)  Overview of the scheme (2 minutes)  Refresher on Random Forest and R Support (2 minutes)  Results and continuing work (1 minute)  Q&A (1 minute and later)

3. ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5651638

4.  R code available here; my contribution  http://www.shatterline.com/SkinDetection.html  Data set available here  http://www.feeval.org/Data-sets/Skin_Colors.html  Permission to use may be required

5.  All training sets organized as a two-movie sequence 1. A movies sequence of frames in color 2. A corresponding sequence of frames in binary black-and-white, the ground-truth  Extract individual frames in jpeg format using ffmpeg, a transcoding tool ffmpeg -i 14.avi -f image2 -ss 1.000 -vframes 1 14_500offset10s.jpeg ffmpeg -i 14_gt_500frames.avi -f image2 -ss 1.000 -vframes 1 14_gt_500frames_offset10s.jpeg

6. Image Ground-truth The original authors used 8991 such image-pairs, the image along with its manually annotated pixel-level ground-truth.

7.  Attributions, code and dataset location (1 minute)  Overview of the scheme (2 minutes)  Refresher on Random Forest and R Support (2 minutes)  Results and continuing work (1 minute)  Q&A (1 minute and later)

8.  Skin-color classification/segmentation  Uses Improved Hue, Saturation, Luminance (IHLS) color-space  RBG values transformed to HLS  HLS used as feature-vectors  Original authors also experimented with  Bayesian network,  Multilayer Perceptron,  SVM,  AdaBoost (Adaptive Boosting),  Naive Bayes,  RBF network “Random Forest shows the best performance in terms of accuracy, precision and recall”

9. The most important property of this [IHLS] space is a “well- behaved” saturation coordinate which, in contrast to commonly used ones, always has a small numerical value for near- achromatic colours, and is completely independent of the brightness function A 3D-polar Coordinate Colour Representation Suitable for Image, Analysis Allan Hanbury and Jean Serra MATLAB routines implementing the RGB-to-IHLS and IHLS-to-RGB are available at http://www.prip.tuwien.ac.at/˜hanbury. R routines implementing the RGB-to-IHLS and IHLS-to-RGB are available at http://www.shatterline.com/SkinDetection.html

10.  Package „ReadImages‟  This package provides functions for reading JPEG and PNG ﬁles  Package „randomForest‟  Breiman and Cutler‟s Classification and regression based on a forest of trees using random inputs.  Package „foreach‟  Support for the foreach looping construct  Stretch goal to use %dopar%

11. set.seed(371) skin.rf <- foreach(i = c(1:nrow(training.frames.list)), .combine=combine, .packages='randomForest') %do% { #Read the Image #transform from RGB to IHLS #Read the corresponding ground-truth image #data is ready, now apply random forest #not using the formula interface randomForest(table.data, y=table.truth, mtry = 2, importance = FALSE, proximity = FALSE, ntree=10, do.trace = 100) } table.pred.truth <- predict(skin.rf, test.table.data)

12.  Attributions, code and dataset location (1 minute)  Overview of the scheme (2 minutes)  Refresher on Random Forest and R Support (2 minutes)  Results and continuing work (1 minute)  Q&A (1 minute and later)

13.  Have lots of decision-tree learners  Each learner‟s training set is sampled independently – with replacement  Add more randomness – at each node of the tree, the splitting attribute is selected from a randomly chosen sample of attributes

14. Each decision tree votes for a classification Forest chooses a classification with the most votes

15.  Quick training phase  Trees can grow in parallel  Trees have attractive computing properties  For example…  Computation cost of making a binary tree is low O(N Log N)  Cost of using a tree is even lower – O(Log N)  N is the number of data points  Applies to balanced binary trees; decision trees often not balanced

16.  Attributions, code and dataset location (1 minute)  Overview of the scheme (2 minutes)  Refresher on Random Forest and R Support (2 minutes)  Results and continuing work (1 minute)  Q&A (1 minute and later)

17. ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5651638 My Results? OK, but incomplete due to very small training set. Need parallel computing cluster

18.  Attributions, code and dataset location (1 minute)  Overview of the scheme (2 minutes)  Refresher on Random Forest and R Support (2 minutes)  Results and continuing work (1 minute)  Q&A (1 minute and later)

Notes de l'éditeur

I’m the opening act before the real show. An opening act or warm-up act (in British English and Australia, supporting act) is an entertainer, musician, band, or entertainment act that performs at a concert before the featured (or headline) entertainer/musician(s). Rarely, an opening act may perform again at the end of the concert.The opening act's performance serves to "warm up" the audience, making it appropriately excited and enthusiastic for the headliner.
How many of you were in the previous MeetUp? Thank the organizers
Original implementation, probably in MATLAB,used in the paper.
R provides libraries to read JPEG – no surprise there
How many of you were in the previous MeetUp? Thank the organizers
Random forest is an ensemble classifier having a quicktraining phase and a very high generalization accuracy [10,11, 12]. It is successfully used in image classification [13],image matching [14], segmentation [15] and gesture recognition[16].
Why do you need the IHLS-to-RGB?
Anyone aware of a color-space conversion library
How many of you were in the previous MeetUp? Thank the organizers
What’s the theory? If we take a large collection of very poor learners (weak learners, in the jargon), each performing only better than chance, then by “putting them together”, it is possible to make an ensemble learner that can perform arbitrarily well.For growing trees, if the number of cases in the trainingset is N, sample N cases at random - but with replacement,from the original data. This sample will be the training setfor growing the tree. If there are M input variables, a numberm <<M is specified such that at each node, m variables areselected at random out of the M and the best split on thesem is used to split the node. The value of m is held constantduring the forest growing. Each tree is grown to the largestextent possible. There is no pruning. For classification, thefinal selection by the forest is based on the maximum votingamong the trees.
For classification, thefinal selection by the forest is based on the maximum votingamong the trees.
How many of you were in the previous MeetUp? Thank the organizers
How many of you were in the previous MeetUp? Thank the organizers