SlideShare une entreprise Scribd logo
1  sur  2
Clarkson Honors Program Thesis Proposal

                   Altering the AdaBoost algorithm to produce a new boosting method
                  yielding more accurate results under the same number of repetitions.

April 5, 2000
Daniel Lawry
Professor Christino Tamon, Advisor

Topics:


          Boosting is a method used implicitly to improve the accuracy of learning algorithms. Boosting's
roots lie in a theoretical framework for studying machine learning called "PAC" learning model. The
creators of this model: Kearns and Valiant, presented the hypothesis that a "weak" learning algorithm, an
algorithm which produces results slightly better than random guessing, in the PAC model can be boosted,
increasing the weak learning algorithm's accuracy and creating a "strong" learning algorithm. Currently, a
boosting algorithm called AdaBoost produces the desired increase in accuracy given a weak learning
algorithm. The AdaBoost method utilizes the weak learning method it is given and a training set (xl,yl),...,
(xm,ym) where xi belongs to a domain X and each label yi is in some label set Y. AdaBoost then calls the
weak learning algorithm repeatedly in a series of T rounds giving weights to the training sets and
updating the weights of these sets each repetition by utilizing data from the last run of the weak
predictor and current weights. These weights will increase or decrease as they are run through the
method, yielding more accurate results based on these new weights each time the training set is run
through the weak learning algorithm. It is believed that by eliminating the last k runs of the weak learning
algorithm where k < t where t is the number of times the weak learning algorithm has been used so far
will force this method to produce more accurate results with the same amount of repetitions. The
elimination of the last k runs forces the current run's data to draw on a smaller set of output from the weak
hypothesis repetitions. By doing this, the hope is that the algorithm will place more of an emphasis on the
runs left in the hypothesis repetitions forcing it to have to become more accurate faster. The parameters to
investigate include the appropriate value of k based on the weak learning algorithm and the number of
repetitions, T, of that algorithm. The investigation will also involve developing this new boosting method
and testing it against the AdaBoost method.


Methodology:


The new boosting method will be developed and constructed for testing purposes using the C
programming language. Likewise the AdaBoost method will be constructed using the C programming
language. A formula to maximize the value of k using the number of repetitions, T, and the efficiency of
the weak learning method will be formed. This formula will then be tested in conjuncture with the new
boosting method and variations on the variable k. Once the optimal value of k is achieved, the two
methods will be run on the same training sets and the resulting data will be compared as to see which
method yields more accurate results.

Contenu connexe

En vedette

Lead To Win Bootcamp - Day 3
Lead To Win Bootcamp - Day 3Lead To Win Bootcamp - Day 3
Lead To Win Bootcamp - Day 3Lead To Win
 
State Owned Enterprises (SOEs)
State Owned Enterprises (SOEs)State Owned Enterprises (SOEs)
State Owned Enterprises (SOEs)ISLs Unisa Library
 
Human action recognition using local space time features and adaboost svm
Human action recognition using local space time features and adaboost svmHuman action recognition using local space time features and adaboost svm
Human action recognition using local space time features and adaboost svmeSAT Publishing House
 
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...I3E Technologies
 
Handout14
Handout14Handout14
Handout14butest
 
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...ijdpsjournal
 
Download It
Download ItDownload It
Download Itbutest
 
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...Hadi Santoso
 
Poggi analytics - ensamble - 1b
Poggi   analytics - ensamble - 1bPoggi   analytics - ensamble - 1b
Poggi analytics - ensamble - 1bGaston Liberman
 
Ada boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataAda boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataShadhin Rahman
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...butest
 
Kato Mivule: An Overview of Adaptive Boosting – AdaBoost
Kato Mivule: An Overview of  Adaptive Boosting – AdaBoostKato Mivule: An Overview of  Adaptive Boosting – AdaBoost
Kato Mivule: An Overview of Adaptive Boosting – AdaBoostKato Mivule
 
Datamining 4th Adaboost
Datamining 4th AdaboostDatamining 4th Adaboost
Datamining 4th Adaboostsesejun
 
Cliffs Notes on Computer Vision
Cliffs Notes on Computer VisionCliffs Notes on Computer Vision
Cliffs Notes on Computer VisionNatalie Kunstadter
 
A neural ada boost based facial expression recogniton System
A neural ada boost based facial expression recogniton SystemA neural ada boost based facial expression recogniton System
A neural ada boost based facial expression recogniton SystemInternational Islamic University
 
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQ
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQCleantech Innovations (CTEK) files Injunctive Relief Against NASDAQ
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQGeoInvesting LLC
 

En vedette (20)

Lead To Win Bootcamp - Day 3
Lead To Win Bootcamp - Day 3Lead To Win Bootcamp - Day 3
Lead To Win Bootcamp - Day 3
 
Guidance & counseling report
Guidance & counseling reportGuidance & counseling report
Guidance & counseling report
 
State Owned Enterprises (SOEs)
State Owned Enterprises (SOEs)State Owned Enterprises (SOEs)
State Owned Enterprises (SOEs)
 
Human action recognition using local space time features and adaboost svm
Human action recognition using local space time features and adaboost svmHuman action recognition using local space time features and adaboost svm
Human action recognition using local space time features and adaboost svm
 
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
 
Handout14
Handout14Handout14
Handout14
 
Adaboost Clustering In Defining Los Criteria of Mumbai City
Adaboost Clustering In Defining Los Criteria of Mumbai CityAdaboost Clustering In Defining Los Criteria of Mumbai City
Adaboost Clustering In Defining Los Criteria of Mumbai City
 
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
 
Download It
Download ItDownload It
Download It
 
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...
A Parallel Architecture for Multiple-Face Detection Technique Using AdaBoost ...
 
Poggi analytics - ensamble - 1b
Poggi   analytics - ensamble - 1bPoggi   analytics - ensamble - 1b
Poggi analytics - ensamble - 1b
 
Ada boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataAda boost brown boost performance with noisy data
Ada boost brown boost performance with noisy data
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...
 
Kato Mivule: An Overview of Adaptive Boosting – AdaBoost
Kato Mivule: An Overview of  Adaptive Boosting – AdaBoostKato Mivule: An Overview of  Adaptive Boosting – AdaBoost
Kato Mivule: An Overview of Adaptive Boosting – AdaBoost
 
Datamining 4th Adaboost
Datamining 4th AdaboostDatamining 4th Adaboost
Datamining 4th Adaboost
 
Cliffs Notes on Computer Vision
Cliffs Notes on Computer VisionCliffs Notes on Computer Vision
Cliffs Notes on Computer Vision
 
A neural ada boost based facial expression recogniton System
A neural ada boost based facial expression recogniton SystemA neural ada boost based facial expression recogniton System
A neural ada boost based facial expression recogniton System
 
Ada boost
Ada boostAda boost
Ada boost
 
18.02.2011, NEWSWIRE, Issue 155
18.02.2011, NEWSWIRE, Issue 15518.02.2011, NEWSWIRE, Issue 155
18.02.2011, NEWSWIRE, Issue 155
 
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQ
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQCleantech Innovations (CTEK) files Injunctive Relief Against NASDAQ
Cleantech Innovations (CTEK) files Injunctive Relief Against NASDAQ
 

Similaire à Lawry-Daniel.doc

SemiBoost: Boosting for Semi-supervised Learning
SemiBoost: Boosting for Semi-supervised LearningSemiBoost: Boosting for Semi-supervised Learning
SemiBoost: Boosting for Semi-supervised Learningbutest
 
Learning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification DataLearning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification Data萍華 楊
 
Ensemble hybrid learning technique
Ensemble hybrid learning techniqueEnsemble hybrid learning technique
Ensemble hybrid learning techniqueDishaSinha9
 
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGA HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGcscpconf
 
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...ijistjournal
 
Presentation
PresentationPresentation
Presentationbutest
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Hayim Makabee
 
PPT SLIDES
PPT SLIDESPPT SLIDES
PPT SLIDESbutest
 
PPT SLIDES
PPT SLIDESPPT SLIDES
PPT SLIDESbutest
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...cscpconf
 
Lazy Association Classification
Lazy Association ClassificationLazy Association Classification
Lazy Association ClassificationJason Yang
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learningHaris Jamil
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptxKaviya452563
 
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...ijistjournal
 
Machine learning (5)
Machine learning (5)Machine learning (5)
Machine learning (5)NYversity
 

Similaire à Lawry-Daniel.doc (20)

SemiBoost: Boosting for Semi-supervised Learning
SemiBoost: Boosting for Semi-supervised LearningSemiBoost: Boosting for Semi-supervised Learning
SemiBoost: Boosting for Semi-supervised Learning
 
Learning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification DataLearning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification Data
 
Ensemble hybrid learning technique
Ensemble hybrid learning techniqueEnsemble hybrid learning technique
Ensemble hybrid learning technique
 
Machine learning
Machine learningMachine learning
Machine learning
 
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGA HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
 
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
 
Presentation
PresentationPresentation
Presentation
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
PPT SLIDES
PPT SLIDESPPT SLIDES
PPT SLIDES
 
PPT SLIDES
PPT SLIDESPPT SLIDES
PPT SLIDES
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
 
Lazy Association Classification
Lazy Association ClassificationLazy Association Classification
Lazy Association Classification
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
Ensemblelearning 181220105413
Ensemblelearning 181220105413Ensemblelearning 181220105413
Ensemblelearning 181220105413
 
One Graduate Paper
One Graduate PaperOne Graduate Paper
One Graduate Paper
 
18.1 combining models
18.1 combining models18.1 combining models
18.1 combining models
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptx
 
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
Implementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Mai...
 
Machine learning (5)
Machine learning (5)Machine learning (5)
Machine learning (5)
 

Plus de butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

Plus de butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Lawry-Daniel.doc

  • 1. Clarkson Honors Program Thesis Proposal Altering the AdaBoost algorithm to produce a new boosting method yielding more accurate results under the same number of repetitions. April 5, 2000 Daniel Lawry Professor Christino Tamon, Advisor Topics: Boosting is a method used implicitly to improve the accuracy of learning algorithms. Boosting's roots lie in a theoretical framework for studying machine learning called "PAC" learning model. The creators of this model: Kearns and Valiant, presented the hypothesis that a "weak" learning algorithm, an algorithm which produces results slightly better than random guessing, in the PAC model can be boosted, increasing the weak learning algorithm's accuracy and creating a "strong" learning algorithm. Currently, a boosting algorithm called AdaBoost produces the desired increase in accuracy given a weak learning algorithm. The AdaBoost method utilizes the weak learning method it is given and a training set (xl,yl),..., (xm,ym) where xi belongs to a domain X and each label yi is in some label set Y. AdaBoost then calls the weak learning algorithm repeatedly in a series of T rounds giving weights to the training sets and updating the weights of these sets each repetition by utilizing data from the last run of the weak predictor and current weights. These weights will increase or decrease as they are run through the method, yielding more accurate results based on these new weights each time the training set is run through the weak learning algorithm. It is believed that by eliminating the last k runs of the weak learning algorithm where k < t where t is the number of times the weak learning algorithm has been used so far will force this method to produce more accurate results with the same amount of repetitions. The elimination of the last k runs forces the current run's data to draw on a smaller set of output from the weak hypothesis repetitions. By doing this, the hope is that the algorithm will place more of an emphasis on the runs left in the hypothesis repetitions forcing it to have to become more accurate faster. The parameters to investigate include the appropriate value of k based on the weak learning algorithm and the number of repetitions, T, of that algorithm. The investigation will also involve developing this new boosting method and testing it against the AdaBoost method. Methodology: The new boosting method will be developed and constructed for testing purposes using the C programming language. Likewise the AdaBoost method will be constructed using the C programming
  • 2. language. A formula to maximize the value of k using the number of repetitions, T, and the efficiency of the weak learning method will be formed. This formula will then be tested in conjuncture with the new boosting method and variations on the variable k. Once the optimal value of k is achieved, the two methods will be run on the same training sets and the resulting data will be compared as to see which method yields more accurate results.