SlideShare a Scribd company logo
1 of 80
Download to read offline
Introduction to Genetics-based
             Machine Learning
                                Mahalingam.P.R
             Semester III, M.Tech CSESIS, RSET
Contents
       Background of Machine Learning
           About Machine Learning
           GA in Machine Learning
       Evolution of GBML
       Classifier System
           Rule and Message System
           Apportionment of Credit System
           Genetic Algorithm
       A simple classifier system in Pascal
       Conclusion


    2                                Introduction to GBML
Introduction




3   Introduction to GBML
Machine Learning
       A machine learns whenever it changes its structure,
        program, or data (based on its inputs or in response
        to external information) in such a manner that its
        expected future performance improves.
       Machine learning usually refers to the changes in
        systems that perform tasks associated with artificial
        intelligence (AI).
       Such tasks involve recognition, diagnosis, planning,
        robot control, prediction, etc. The “changes" might be
        either enhancements to already performing systems
        or ab-initio synthesis of new systems.

    4                             Introduction to GBML
A typical AI system

Here, the model progressively learns from the experience it earns from the environment




 5                                                 Introduction to GBML
Need for Machine Learning
       Some tasks cannot be defined well except by
        example.
           We might be able to specify input/output pairs but not a
            concise relationship between inputs and desired outputs.
            We would like machines to be able to adjust their internal
            structure to produce correct outputs for a large number of
            sample inputs
               suitably constrain their input/output function to approximate the
                relationship implicit in the examples.
       It is possible that hidden among large piles of data
        are important relationships and correlations.
            Machine learning methods can often be used to extract
            these relationships (data mining).

    6                                        Introduction to GBML
       Machine learning methods can be used for on-the-job
        improvement of existing machine designs.
       The amount of knowledge available about certain tasks
        might be too large for explicit encoding by humans.
           Machines that learn this knowledge gradually might be able to
            capture more of it than humans would want to write down.
       Environments change over time.
           Machines that can adapt to a changing environment would
            reduce the need for constant redesign.
       New knowledge about tasks is constantly                   being
        discovered by humans (Vocabulary changes).
           There is a constant stream of new events in the world.
            Continuing redesign of AI systems to conform to new
            knowledge is impractical.
           Machine learning methods might be able to track much of it.

    7                                   Introduction to GBML
More on Machine Learning
Contributions to Machine           Varieties of Machine
Learning                           Learning

       Statistics                    Functions
       Brain Models                  Logic Programs and
       Adaptive Control Theory        Rule sets
       Psychological Models          Finite-state machines
       Artificial Intelligence       Grammars
       Evolutionary Models           Problem solving
                                       systems



    8                             Introduction to GBML
GA in Machine Learning
       Conventional GA systems work with the following
        properties:
           Probabilistic
           Random
           Enumerative
       The structures adapted cater to the human-like
        mechanism adopted in the methodology.
       This adaptation itself is the biggest hindrance when
        incorporating GA into more complex, less completely
        defined environments.
           Too much “guesses” when it comes to searching
           Methodology safe under the “sandbox” of searching

    9                                 Introduction to GBML
Coming Up
                                                   Origins of GBML
   Study Machine Learning systems                 Systems
    that use genetic search as their
                                                   Classifier System
    primary discovery heuristic.
                                                   Operation of Classifier
                                                   Systems
   Adapting GA structures to work in              Implementation of a
    complex environments like Machine              Simple Classifier System
                                                   in Pascal
    Learning
                                                   Testing classifier in a
                                                   problem domain –
                                                   Learning a Boolean
                                                   function




    10                      Introduction to GBML
Origin of GBML systems




11         Introduction to GBML
    In nature, not only do individual animals learn to
     perform better, but species evolve to be better fit in
     their individual niches.
    Since the distinction between evolving and learning
     can be blurred in computer systems, techniques that
     model certain aspects of biological evolution have
     been proposed as learning methods to improve the
     performance of computer programs.
    Genetic algorithms [Holland, 1975] and genetic
     programming [Koza, 1992, Koza, 1994] are the most
     prominent computational techniques for evolution.

    12                         Introduction to GBML
Theoretical Foundation for GBML
    Laid by Holland (1962)
    Outline for Adaptive Systems Theory
        role of program replication as a method of emphasizing
         past programs.
    Fundamental role of recombination
        Holland (1965)
    Schemata Processors
        Holland (1968-1971)
    Application of a classifier system
        Holland and Reitman (1978)


    13                            Introduction to GBML
    Modern classifier systems resemble                 schemata
     processors in both outline and detail

    Holland suggested four prototypes in the initial
     proposal.
        No experiments or implementation have been reported
         yet!


    Proposal coincided with the development of the
     theory of schemata.


    14                           Introduction to GBML
Prototype 1                    Prototype 2

    Stimulus-response (SR)       Extend type 1 by adding
     processor that would          internal effectors
     link environmental            (internal states).
     schemata (conditions)
     with particular action
     effectors.




    15                        Introduction to GBML
Prototype 3                      Prototype 4

    Build upon types 1 and         Extend the other types
     2 by including explicit         by incorporating the
     environmental      state        capability to modify its
     prediction (a model of          own      effectors    and
     the real world), and an         detectors,      permitting
     internal      evaluation        greater (or lesser) range
     mechanism.                      of data detection and a
                                     larger        behavioural
                                     response.

    16                          Introduction to GBML
Broadcast language
    Derived from early proposals.
        Broad
        Unimplemented
    Creating broadcast units (production rules) from a 10-
     letter alphabet.
    The alphabet added wild card (single and multiple match)
     characters to an underlying binary alphabet.
    If included, the following would have given sufficient
     power      for   computational      completeness    and
     representational convenience.
        A fundamental punctuation mark, a persistence symbol for
         continued broadcast of a message.
        A quotation character for taking the next character literally.


    17                                Introduction to GBML
    The proposal for broadcast language was
     instrumental in unifying the earlier suggestions for
     schemata operators by:
        theoretically permitting a consistent representation of all
         operators, data, and rules or instructions.
    The generality gained in theory has not been
     realized in practice.




    18                              Introduction to GBML
First practical implementation
    Three years following the broadcast language
     proposal (Holland and Reitman, 1978).
    Cognitive System Level One (CS-1)
        Trained to learn two maze-running tasks.
        Performance system with message list and simple string
         rules (classifiers).
        GA composed of reproduction, crossover, mutation and
         crowding.
        Epochal learning mechanism where reward is
         apportioned to all classifiers active between successive
         payoff events.
    Learning mechanism has largely been supplanted by
     another mechanism – a bucket brigade.

    19                             Introduction to GBML
GBML Applications




20                  Introduction to GBML
21   Introduction to GBML
22   Introduction to GBML
23   Introduction to GBML
24   Introduction to GBML
Classifier System




25   Introduction to GBML
    A classifier system is a machine learning system that
     learns syntactically simple string rules (called
     classifiers) to guide its performance in an arbitrary
     environment.
    A classifier system consists of three main
     components:
        Rule and message system
        Apportionment of credit system
        Genetic algorithm




    26                             Introduction to GBML
    The rule and message system of a classifier system
     is a special kind of production system.
    A production system is a computational scheme that
     uses rules as its only algorithmic device. The general
     syntax of such systems are as follows:

                if <condition> then <action>

    The production means that the action may be taken
     (rule is “fired”) when the condition is satisfied.


    27                         Introduction to GBML
    It has been shown that production systems are
     computationally complete, as well as convenient.
    A simple rule or set of rules can represent a complex
     set of thoughts compactly.
    But when it comes to situations in need of learning,
     this is not advisable due to the complex rule syntax.
    Many production systems permit involved
     grammatical constructions for the condition and
     action part of the rule.



    28                         Introduction to GBML
    Classifier systems depart from the mainstream by
     restricting a rule to a fixed length representation.
    This restriction has two benefits.
        All strings under the permissible alphabet are syntactically
         meaningful
        A fixed string representation permits string operators of
         the genetic kind. This leaves the door open for a Genetic
         Algorithm search of the space of permissible rules.




    29                               Introduction to GBML
    Classifier systems use parallel rule activation,
     whereas traditional systems use serial rule
     activation.
    They permit multiple activities to be coordinated
     simultaneously. When choices must be made
     between mutually exclusive environmental actions,
     or size of matched rule set must be pruned to
     accommodate the fixed length message list, the
     choices are postponed to the last possible moment,
     and the arbitration is then performed competitively.



    30                         Introduction to GBML
    In traditional expert systems, the value or rating of a rule
     relative to other is fixed by the programmer in conjunction
     with the expert or group of experts to be emulated. But
     this is not possible in rule learning system.
    In such cases, the relative importance has to be
     “learned”. So, classifiers are forced to coexist in an
     information-based service economy. A competition is held
     among classifiers where the right to answer relevant
     messages goes to the highest bidder. Subsequent bids
     serve as the income for previously successful message
     senders.
    This competitive nature ensures that good rules
     (profitable ones) survive and the bad rules (unprofitable)
     die off.

    31                            Introduction to GBML
Internal Currency
    The exchange and accumulation of an internal
     currency provides a natural figure of merit for
     applying GA.
    Using the “bank balance” as a fitness function,
     classifiers may be reproduced, crossed, and
     mutated.
    So, it can rank extant rules, and discover new,
     possibly better rules by innovative combinations of
     old ones.
    Here, the stress is on “who gets replaced”, not
     “replace entire populations”.

    32                         Introduction to GBML
   Thus, apportionment of credit via
    competition and rule discovery                   Next…
    using GA form a reasonable basis                 Each component is
                                                     discussed in detail and
    for constructing a machine learning
                                                     the interconnections
    system atop the computationally                  studied.
    convenient and complete                          •Ruleand message
    framework of classifiers.                        system

                                                     •Apportionment   of credit

                                                     •Genetic Algorithm


                                                     Implementation in Pascal

                                                     Testing on real world
                                                     problem – learning a
                                                     Boolean function



    33                        Introduction to GBML
Rule and Message System




34          Introduction to GBML
This is the schematic of a
                            complete classifier
                            system.




35   Introduction to GBML
    The rule and message system forms the backbone
     of the silicon beast.
    Information flows from the environment through the
     detectors, where it is decoded to one or more finite
     length messages.
    The environmental messages are fed into a finite
     length message list where the messages activate
     string rules called classifiers.
    When activated, the classifier posts a message on to
     the message list, which may in-turn invoke other
     classifiers, or cause an action to be taken through
     the system’s action triggers called effectors.

    36                        Introduction to GBML
    Classifiers combine environmental cues and internal
     thoughts to determine what the system should do
     and think next.
    Coordinates the flow of information from where it is
     sensed (detectors) to where it is processed
     (message list and classifier store) to where it is
     called to action (effectors).
    Informational units
        Messages
        Classifiers



    37                        Introduction to GBML
    A message within a classifier system is simply a
     finite-length string over some finite alphabet. If we
     limit to the binary alphabet, we get
                      <message> ::= {0, 1}L
    This means taking the concatenation of 0s or 1s for
     “L” times.
    Messages are the basic token of information
     exchange in a classifier system.
    The messages on the message list may match one
     or more classifiers or string rules.


    38                         Introduction to GBML
    A classifier is a production rule with the syntax
              <classifier> ::= <condition>:<message>
    The condition is a simple pattern recognition device
     where a wild card (#) is added to the underlying alphabet.
                       <condition> ::= {0, 1, #}L
    So, the condition matches a message if at every position,
     a 0 in the message matches a 0 in the condition, 1
     matches a 1, or a # matches either a 0 or a 1.
        For example, #01# matches 0010, 0011, 1010 and 1011. But it
         doesn’t match 0000.
        Similar to schema in GA


    39                               Introduction to GBML
    Once the classifier’s condition is matched, it
     becomes a candidate to post its message to the
     message list in the next time step.
    Whether the candidate classifier posts its message
     is determined by the outcome of an activation
     auction, which in turn depends on evaluation of the
     classifier’s value or weighting.




    40                         Introduction to GBML
Sample simulation by hand
    Initial message list is as follows.




    41                           Introduction to GBML
42   Introduction to GBML
43   Introduction to GBML
44   Introduction to GBML
45   Introduction to GBML
46   Introduction to GBML
47   Introduction to GBML
48   Introduction to GBML
49   Introduction to GBML
Apportionment of Credit
                 Algorithm
        THE BUCKET BRIGADE




50         Introduction to GBML
    Many classifiers attempt to rank or rate the individual
     classifiers according to a classifier’s role in achieving
     reward from the environment.

    Most prevalent method
        Bucket Brigade Algorithm
            John Holland




    51                              Introduction to GBML
About the algorithm
    An information economy where the right to trade
     information is bought and sold by classifiers.
    Classifiers form a chain of middlemen from
     information manufacturer (the environment) to
     information consumer (the effectors).

    Components of service economy
        Auction
        ClearingHouse




    52                        Introduction to GBML
    When classifiers are matched, they don’t directly
     post their messages.
    A matching message entitles a classifier to
     participate in an activation auction.
        Each classifier maintains a record of its net worth.
            Called Strength (S).
        Each matched classifier makes a bid B.
            Bid proportional to its strength.
    In this way, rules that are highly fit (have
     accumulated a net worth) are given preference over
     other rules.

    53                                      Introduction to GBML
    Once a classifier is selected for activation, it must clear
     its payment through the clearinghouse, paying its bid to
     other classifiers for matching messages rendered.
    A matched and activated classifier sends its bid B to
     those classifiers responsible for sending the messages
     that matched the bidder classifier’s condition.
    Bid amount divided among the matching classifiers.
    Division of payoff among contributing classifiers helps
     ensure the formation of an appropriately sized
     subpopulation of rules.
    Different types of rules can cover different types of
     behavioral requirements without undue interspecies
     competition.

    54                             Introduction to GBML
    In a rule-learning system of any experience, we
      cannot search for one master rule.
     We must instead search for a co-adapted set of rules
      that together cover a range of behavior that provides
      ample payoff to the learning system.



Consider the
classifiers as before.


     55                         Introduction to GBML
    Now, lets follow the payments, with initial strength of
     200.




    56                          Introduction to GBML
57   Introduction to GBML
58   Introduction to GBML
59   Introduction to GBML
60   Introduction to GBML
61   Introduction to GBML
62   Introduction to GBML
    For steady receipts, the bid value approaches the
     receipt. For time-varying receipt values, we see that
     the bid is a geometrically weighted average of the
     input.

    As such, it acts as a filter of the possibly intermittent
     and noisy receipt values.




    63                           Introduction to GBML
Genetic Algorithm




64   Introduction to GBML
    Bucket brigade
        Clean procedure
            Evaluation of rules
            Decide among competing alternatives
    But we have to devise a way of injecting new rules
     into the system.
    Similar to SGA, we can inject new rules using the
     tripartite rules
        Reproduction
        Crossover
        Mutation

    65                                 Introduction to GBML
    The rules are placed in the population and
     processed by the auction, payment and
     reinforcement mechanism to properly evaluate their
     role in the system.
    Pay attention to “who replaces whom”
        Not replacing the entire population
    GA in classifier systems strongly resemble those
     used in search and optimization.
    Main difference in Machine Learning
        Non-overlapping population model not acceptable here.
            In non-overlapping generations, complete generations are
             selected and replaced by a new population at every run.


    66                                   Introduction to GBML
Machine Learning              Search and optimization

    High level of on-line       Convergence
     performance.                    Offline performance
    Learn to perform more
     proficiently.




    67                       Introduction to GBML
De Jong’s experiments              Machine Learning

    Conventional system              Whole population
    GA Parameter                      should not be replaced.
        Generation Gap (G)           Quantity
        Implement and test               Selection Proportion
         overlapping population            (proportion)
         Genetic Algorithms.              Replace that proportion
                                           of the population at a
                                           given algorithm run.
                                          Coupled with a number
                                           of other parameters.
    68                            Introduction to GBML
Other Parameters
    GA Period
        Represented as Tga
        Specifies number of time steps (rule and message cycles)
         between GA calls.
        Period can be treated deterministically
            GA is called every Tga cycles
        Or stochastically
            GA is called probabilistically with average period Tga
    Invocation of GA learning may be conditioned on
     particular events such as:
        Lack of match
        Poor performance
    69                                       Introduction to GBML
Selection
    Roulette Wheel Selection
        Classifier’s strength S used as the fitness.


    No longer generating entire populations
        Careful when choosing population members for
         replacement.


    De Jong’s crowding procedure
        Encourage replacement of similar population memebers




    70                               Introduction to GBML
Mutation
    Modified procedure
        Here, ternary alphabet is used
        SGA used a binary alphabet
    Mutation probability pm defined as before
    When a mutation is called for, we change the
     mutated character to one of the other two with equal
     probability.
        0  { 1, # }
        1  { 0, # }
        #  { 0, 1 }



    71                              Introduction to GBML
Next-
   With all these changes to the                     A simple Classifier
    normal SGA routine, GA may be                     System in Pascal
    dropped into the classifier system                •Simple Classifier System
                                                      Data Structure
    and used in a manner not too
    different from normal search and                  •The   Performance System

    optimization applications.                        •Apportionment    of credit
                                                      algorithm

                                                      •Geneticsearch within the
                                                      Simple Classifier System

                                                      •Real-world   testing

                                                      •Results


                                                      •Comparison    with and
                                                      without GA




    72                         Introduction to GBML
A Simple Classifier System in
                            Pascal




73               Introduction to GBML
    Construct a system designed to learn a boolean
     function
        A multiplexer
    Collapse the finite-length message list to a single
     message (the environmental message)
        Immediate feedback
        Simple payoff




    74                          Introduction to GBML
Components
    Simple Classifier System Data Structure
        Adapt to learning strategies
    Performance System
        Heart of SCS
        Matching procedures are the heart of the performance system
    Apportionment of Credit
        Procedures
            Auction
            Clearinghouse
            Taxcollector
    Genetic Search within Simple Classifier System
        Similar to SGA
    Learning the multiplexing system
    Main procedure
        Reinforcement

    75                                  Introduction to GBML
Six – bit multiplexing system




76                  Introduction to GBML
Results using the Simple Classifier System

Without GA             With GA




77                    Introduction to GBML
Conclusion
    The following were discussed
        Machine Learning
        Role of GA in Machine Learning
        The evolution of GA concepts in Machine Learning
        Some applications of GBML
        Classifier System
        Components of Classifier System
            Rule and Message System
            Apportionment of Credit (The Bucket Brigade)
            Genetic Algorithm
        A practical implementation in Pascal
        Increase in output efficiency when using GA
    78                                  Introduction to GBML
References
    “Introduction to Machine Learning”, Nils J. Nilsson,
     Robotics Laboratory, Department of Computer
     Science, Stanford University

    “Genetic Algorithms in Search, Optimization and
     Machine Learning”, David E. Goldberg, pp 217-260




    79                         Introduction to GBML
THANK YOU




80       Introduction to GBML

More Related Content

What's hot

Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Simplilearn
 

What's hot (20)

Artificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKSArtificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKS
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Random forest
Random forestRandom forest
Random forest
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithm
 
Simulated Annealing
Simulated AnnealingSimulated Annealing
Simulated Annealing
 
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
 
Adaline madaline
Adaline madalineAdaline madaline
Adaline madaline
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detection
 
artificial neural network
artificial neural networkartificial neural network
artificial neural network
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
 
Meta learning with memory augmented neural network
Meta learning with memory augmented neural networkMeta learning with memory augmented neural network
Meta learning with memory augmented neural network
 
Intro to modelling-supervised learning
Intro to modelling-supervised learningIntro to modelling-supervised learning
Intro to modelling-supervised learning
 
Machine learning Lecture 1
Machine learning Lecture 1Machine learning Lecture 1
Machine learning Lecture 1
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
 
Associative memory network
Associative memory networkAssociative memory network
Associative memory network
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNN
 
Brief Introduction to Boltzmann Machine
Brief Introduction to Boltzmann MachineBrief Introduction to Boltzmann Machine
Brief Introduction to Boltzmann Machine
 
Artificial nueral network slideshare
Artificial nueral network slideshareArtificial nueral network slideshare
Artificial nueral network slideshare
 

Viewers also liked

2016_eecsRetreat_fettes-v1
2016_eecsRetreat_fettes-v12016_eecsRetreat_fettes-v1
2016_eecsRetreat_fettes-v1
Quintin Fettes
 
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
Amazon Web Services
 
Classification And Diagnosis Of Diabetes
Classification And Diagnosis Of DiabetesClassification And Diagnosis Of Diabetes
Classification And Diagnosis Of Diabetes
PeninsulaEndocrine
 
Diagnosis of diabetes mellitus
Diagnosis of diabetes mellitus  Diagnosis of diabetes mellitus
Diagnosis of diabetes mellitus
Dilek Gogas Yavuz
 

Viewers also liked (17)

2016_eecsRetreat_fettes-v1
2016_eecsRetreat_fettes-v12016_eecsRetreat_fettes-v1
2016_eecsRetreat_fettes-v1
 
Classification ANN
Classification ANNClassification ANN
Classification ANN
 
Machine Learning - Breast Cancer Diagnosis
Machine Learning - Breast Cancer DiagnosisMachine Learning - Breast Cancer Diagnosis
Machine Learning - Breast Cancer Diagnosis
 
Alex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsAlex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning Applications
 
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
AWS re:Invent 2016: Automatic Grading of Diabetic Retinopathy through Deep Le...
 
New Developments in Machine Learning - Prof. Dr. Max Welling
New Developments in Machine Learning - Prof. Dr. Max WellingNew Developments in Machine Learning - Prof. Dr. Max Welling
New Developments in Machine Learning - Prof. Dr. Max Welling
 
Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...
Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...
Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...
 
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAIDeep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
 
DRS Presentation
DRS PresentationDRS Presentation
DRS Presentation
 
Classification And Diagnosis Of Diabetes
Classification And Diagnosis Of DiabetesClassification And Diagnosis Of Diabetes
Classification And Diagnosis Of Diabetes
 
Diagnosis of diabetes mellitus
Diagnosis of diabetes mellitus  Diagnosis of diabetes mellitus
Diagnosis of diabetes mellitus
 
Tutorial on Deep Learning
Tutorial on Deep LearningTutorial on Deep Learning
Tutorial on Deep Learning
 
Classification Rule Discovery Using Ant-Miner Algorithm: An Application Of N...
Classification Rule Discovery Using Ant-Miner Algorithm: An  Application Of N...Classification Rule Discovery Using Ant-Miner Algorithm: An  Application Of N...
Classification Rule Discovery Using Ant-Miner Algorithm: An Application Of N...
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016
 
Deep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial IntelligenceDeep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial Intelligence
 
Deep Learning through Examples
Deep Learning through ExamplesDeep Learning through Examples
Deep Learning through Examples
 

Similar to Gbml - Genetics Based Machine Learning

Applying Machine Learning to Software Clustering
Applying Machine Learning to Software ClusteringApplying Machine Learning to Software Clustering
Applying Machine Learning to Software Clustering
butest
 
Optimization technique genetic algorithm
Optimization technique genetic algorithmOptimization technique genetic algorithm
Optimization technique genetic algorithm
Uday Wankar
 
Introducing Evolutionary Component Capabilities in Agent ...
Introducing Evolutionary Component Capabilities in Agent ...Introducing Evolutionary Component Capabilities in Agent ...
Introducing Evolutionary Component Capabilities in Agent ...
butest
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
BigDataCloud
 
MultiObjective(11) - Copy
MultiObjective(11) - CopyMultiObjective(11) - Copy
MultiObjective(11) - Copy
AMIT KUMAR
 

Similar to Gbml - Genetics Based Machine Learning (20)

Applying Machine Learning to Software Clustering
Applying Machine Learning to Software ClusteringApplying Machine Learning to Software Clustering
Applying Machine Learning to Software Clustering
 
Everything you need to know about AutoML
Everything you need to know about AutoMLEverything you need to know about AutoML
Everything you need to know about AutoML
 
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
 
A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3
 
ML DL AI DS BD - An Introduction
ML DL AI DS BD - An IntroductionML DL AI DS BD - An Introduction
ML DL AI DS BD - An Introduction
 
IJCSI-2015-12-2-10138 (1) (2)
IJCSI-2015-12-2-10138 (1) (2)IJCSI-2015-12-2-10138 (1) (2)
IJCSI-2015-12-2-10138 (1) (2)
 
mapReduce for machine learning
mapReduce for machine learning mapReduce for machine learning
mapReduce for machine learning
 
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
 
Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...
 
Optimization technique genetic algorithm
Optimization technique genetic algorithmOptimization technique genetic algorithm
Optimization technique genetic algorithm
 
Introducing Evolutionary Component Capabilities in Agent ...
Introducing Evolutionary Component Capabilities in Agent ...Introducing Evolutionary Component Capabilities in Agent ...
Introducing Evolutionary Component Capabilities in Agent ...
 
An Iterative Improved k-means Clustering
An Iterative Improved k-means ClusteringAn Iterative Improved k-means Clustering
An Iterative Improved k-means Clustering
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
 
OO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented DevelopmentOO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented Development
 
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
IRJET-Performance Enhancement in Machine Learning System using Hybrid Bee Col...
 
MultiObjective(11) - Copy
MultiObjective(11) - CopyMultiObjective(11) - Copy
MultiObjective(11) - Copy
 
Parallel Evolutionary Algorithms for Feature Selection in High Dimensional Da...
Parallel Evolutionary Algorithms for Feature Selection in High Dimensional Da...Parallel Evolutionary Algorithms for Feature Selection in High Dimensional Da...
Parallel Evolutionary Algorithms for Feature Selection in High Dimensional Da...
 
Biology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering OptimizationBiology-Derived Algorithms in Engineering Optimization
Biology-Derived Algorithms in Engineering Optimization
 
IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...
IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...
IEEE 2014 JAVA DATA MINING PROJECTS A fast clustering based feature subset se...
 

Recently uploaded

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

Gbml - Genetics Based Machine Learning

  • 1. Introduction to Genetics-based Machine Learning Mahalingam.P.R Semester III, M.Tech CSESIS, RSET
  • 2. Contents  Background of Machine Learning  About Machine Learning  GA in Machine Learning  Evolution of GBML  Classifier System  Rule and Message System  Apportionment of Credit System  Genetic Algorithm  A simple classifier system in Pascal  Conclusion 2 Introduction to GBML
  • 3. Introduction 3 Introduction to GBML
  • 4. Machine Learning  A machine learns whenever it changes its structure, program, or data (based on its inputs or in response to external information) in such a manner that its expected future performance improves.  Machine learning usually refers to the changes in systems that perform tasks associated with artificial intelligence (AI).  Such tasks involve recognition, diagnosis, planning, robot control, prediction, etc. The “changes" might be either enhancements to already performing systems or ab-initio synthesis of new systems. 4 Introduction to GBML
  • 5. A typical AI system Here, the model progressively learns from the experience it earns from the environment 5 Introduction to GBML
  • 6. Need for Machine Learning  Some tasks cannot be defined well except by example.  We might be able to specify input/output pairs but not a concise relationship between inputs and desired outputs.  We would like machines to be able to adjust their internal structure to produce correct outputs for a large number of sample inputs  suitably constrain their input/output function to approximate the relationship implicit in the examples.  It is possible that hidden among large piles of data are important relationships and correlations.  Machine learning methods can often be used to extract these relationships (data mining). 6 Introduction to GBML
  • 7. Machine learning methods can be used for on-the-job improvement of existing machine designs.  The amount of knowledge available about certain tasks might be too large for explicit encoding by humans.  Machines that learn this knowledge gradually might be able to capture more of it than humans would want to write down.  Environments change over time.  Machines that can adapt to a changing environment would reduce the need for constant redesign.  New knowledge about tasks is constantly being discovered by humans (Vocabulary changes).  There is a constant stream of new events in the world.  Continuing redesign of AI systems to conform to new knowledge is impractical.  Machine learning methods might be able to track much of it. 7 Introduction to GBML
  • 8. More on Machine Learning Contributions to Machine Varieties of Machine Learning Learning  Statistics  Functions  Brain Models  Logic Programs and  Adaptive Control Theory Rule sets  Psychological Models  Finite-state machines  Artificial Intelligence  Grammars  Evolutionary Models  Problem solving systems 8 Introduction to GBML
  • 9. GA in Machine Learning  Conventional GA systems work with the following properties:  Probabilistic  Random  Enumerative  The structures adapted cater to the human-like mechanism adopted in the methodology.  This adaptation itself is the biggest hindrance when incorporating GA into more complex, less completely defined environments.  Too much “guesses” when it comes to searching  Methodology safe under the “sandbox” of searching 9 Introduction to GBML
  • 10. Coming Up Origins of GBML  Study Machine Learning systems Systems that use genetic search as their Classifier System primary discovery heuristic. Operation of Classifier Systems  Adapting GA structures to work in Implementation of a complex environments like Machine Simple Classifier System in Pascal Learning Testing classifier in a problem domain – Learning a Boolean function 10 Introduction to GBML
  • 11. Origin of GBML systems 11 Introduction to GBML
  • 12. In nature, not only do individual animals learn to perform better, but species evolve to be better fit in their individual niches.  Since the distinction between evolving and learning can be blurred in computer systems, techniques that model certain aspects of biological evolution have been proposed as learning methods to improve the performance of computer programs.  Genetic algorithms [Holland, 1975] and genetic programming [Koza, 1992, Koza, 1994] are the most prominent computational techniques for evolution. 12 Introduction to GBML
  • 13. Theoretical Foundation for GBML  Laid by Holland (1962)  Outline for Adaptive Systems Theory  role of program replication as a method of emphasizing past programs.  Fundamental role of recombination  Holland (1965)  Schemata Processors  Holland (1968-1971)  Application of a classifier system  Holland and Reitman (1978) 13 Introduction to GBML
  • 14. Modern classifier systems resemble schemata processors in both outline and detail  Holland suggested four prototypes in the initial proposal.  No experiments or implementation have been reported yet!  Proposal coincided with the development of the theory of schemata. 14 Introduction to GBML
  • 15. Prototype 1 Prototype 2  Stimulus-response (SR)  Extend type 1 by adding processor that would internal effectors link environmental (internal states). schemata (conditions) with particular action effectors. 15 Introduction to GBML
  • 16. Prototype 3 Prototype 4  Build upon types 1 and  Extend the other types 2 by including explicit by incorporating the environmental state capability to modify its prediction (a model of own effectors and the real world), and an detectors, permitting internal evaluation greater (or lesser) range mechanism. of data detection and a larger behavioural response. 16 Introduction to GBML
  • 17. Broadcast language  Derived from early proposals.  Broad  Unimplemented  Creating broadcast units (production rules) from a 10- letter alphabet.  The alphabet added wild card (single and multiple match) characters to an underlying binary alphabet.  If included, the following would have given sufficient power for computational completeness and representational convenience.  A fundamental punctuation mark, a persistence symbol for continued broadcast of a message.  A quotation character for taking the next character literally. 17 Introduction to GBML
  • 18. The proposal for broadcast language was instrumental in unifying the earlier suggestions for schemata operators by:  theoretically permitting a consistent representation of all operators, data, and rules or instructions.  The generality gained in theory has not been realized in practice. 18 Introduction to GBML
  • 19. First practical implementation  Three years following the broadcast language proposal (Holland and Reitman, 1978).  Cognitive System Level One (CS-1)  Trained to learn two maze-running tasks.  Performance system with message list and simple string rules (classifiers).  GA composed of reproduction, crossover, mutation and crowding.  Epochal learning mechanism where reward is apportioned to all classifiers active between successive payoff events.  Learning mechanism has largely been supplanted by another mechanism – a bucket brigade. 19 Introduction to GBML
  • 20. GBML Applications 20 Introduction to GBML
  • 21. 21 Introduction to GBML
  • 22. 22 Introduction to GBML
  • 23. 23 Introduction to GBML
  • 24. 24 Introduction to GBML
  • 25. Classifier System 25 Introduction to GBML
  • 26. A classifier system is a machine learning system that learns syntactically simple string rules (called classifiers) to guide its performance in an arbitrary environment.  A classifier system consists of three main components:  Rule and message system  Apportionment of credit system  Genetic algorithm 26 Introduction to GBML
  • 27. The rule and message system of a classifier system is a special kind of production system.  A production system is a computational scheme that uses rules as its only algorithmic device. The general syntax of such systems are as follows: if <condition> then <action>  The production means that the action may be taken (rule is “fired”) when the condition is satisfied. 27 Introduction to GBML
  • 28. It has been shown that production systems are computationally complete, as well as convenient.  A simple rule or set of rules can represent a complex set of thoughts compactly.  But when it comes to situations in need of learning, this is not advisable due to the complex rule syntax.  Many production systems permit involved grammatical constructions for the condition and action part of the rule. 28 Introduction to GBML
  • 29. Classifier systems depart from the mainstream by restricting a rule to a fixed length representation.  This restriction has two benefits.  All strings under the permissible alphabet are syntactically meaningful  A fixed string representation permits string operators of the genetic kind. This leaves the door open for a Genetic Algorithm search of the space of permissible rules. 29 Introduction to GBML
  • 30. Classifier systems use parallel rule activation, whereas traditional systems use serial rule activation.  They permit multiple activities to be coordinated simultaneously. When choices must be made between mutually exclusive environmental actions, or size of matched rule set must be pruned to accommodate the fixed length message list, the choices are postponed to the last possible moment, and the arbitration is then performed competitively. 30 Introduction to GBML
  • 31. In traditional expert systems, the value or rating of a rule relative to other is fixed by the programmer in conjunction with the expert or group of experts to be emulated. But this is not possible in rule learning system.  In such cases, the relative importance has to be “learned”. So, classifiers are forced to coexist in an information-based service economy. A competition is held among classifiers where the right to answer relevant messages goes to the highest bidder. Subsequent bids serve as the income for previously successful message senders.  This competitive nature ensures that good rules (profitable ones) survive and the bad rules (unprofitable) die off. 31 Introduction to GBML
  • 32. Internal Currency  The exchange and accumulation of an internal currency provides a natural figure of merit for applying GA.  Using the “bank balance” as a fitness function, classifiers may be reproduced, crossed, and mutated.  So, it can rank extant rules, and discover new, possibly better rules by innovative combinations of old ones.  Here, the stress is on “who gets replaced”, not “replace entire populations”. 32 Introduction to GBML
  • 33. Thus, apportionment of credit via competition and rule discovery Next… using GA form a reasonable basis Each component is discussed in detail and for constructing a machine learning the interconnections system atop the computationally studied. convenient and complete •Ruleand message framework of classifiers. system •Apportionment of credit •Genetic Algorithm Implementation in Pascal Testing on real world problem – learning a Boolean function 33 Introduction to GBML
  • 34. Rule and Message System 34 Introduction to GBML
  • 35. This is the schematic of a complete classifier system. 35 Introduction to GBML
  • 36. The rule and message system forms the backbone of the silicon beast.  Information flows from the environment through the detectors, where it is decoded to one or more finite length messages.  The environmental messages are fed into a finite length message list where the messages activate string rules called classifiers.  When activated, the classifier posts a message on to the message list, which may in-turn invoke other classifiers, or cause an action to be taken through the system’s action triggers called effectors. 36 Introduction to GBML
  • 37. Classifiers combine environmental cues and internal thoughts to determine what the system should do and think next.  Coordinates the flow of information from where it is sensed (detectors) to where it is processed (message list and classifier store) to where it is called to action (effectors).  Informational units  Messages  Classifiers 37 Introduction to GBML
  • 38. A message within a classifier system is simply a finite-length string over some finite alphabet. If we limit to the binary alphabet, we get <message> ::= {0, 1}L  This means taking the concatenation of 0s or 1s for “L” times.  Messages are the basic token of information exchange in a classifier system.  The messages on the message list may match one or more classifiers or string rules. 38 Introduction to GBML
  • 39. A classifier is a production rule with the syntax <classifier> ::= <condition>:<message>  The condition is a simple pattern recognition device where a wild card (#) is added to the underlying alphabet. <condition> ::= {0, 1, #}L  So, the condition matches a message if at every position, a 0 in the message matches a 0 in the condition, 1 matches a 1, or a # matches either a 0 or a 1.  For example, #01# matches 0010, 0011, 1010 and 1011. But it doesn’t match 0000.  Similar to schema in GA 39 Introduction to GBML
  • 40. Once the classifier’s condition is matched, it becomes a candidate to post its message to the message list in the next time step.  Whether the candidate classifier posts its message is determined by the outcome of an activation auction, which in turn depends on evaluation of the classifier’s value or weighting. 40 Introduction to GBML
  • 41. Sample simulation by hand  Initial message list is as follows. 41 Introduction to GBML
  • 42. 42 Introduction to GBML
  • 43. 43 Introduction to GBML
  • 44. 44 Introduction to GBML
  • 45. 45 Introduction to GBML
  • 46. 46 Introduction to GBML
  • 47. 47 Introduction to GBML
  • 48. 48 Introduction to GBML
  • 49. 49 Introduction to GBML
  • 50. Apportionment of Credit Algorithm THE BUCKET BRIGADE 50 Introduction to GBML
  • 51. Many classifiers attempt to rank or rate the individual classifiers according to a classifier’s role in achieving reward from the environment.  Most prevalent method  Bucket Brigade Algorithm  John Holland 51 Introduction to GBML
  • 52. About the algorithm  An information economy where the right to trade information is bought and sold by classifiers.  Classifiers form a chain of middlemen from information manufacturer (the environment) to information consumer (the effectors).  Components of service economy  Auction  ClearingHouse 52 Introduction to GBML
  • 53. When classifiers are matched, they don’t directly post their messages.  A matching message entitles a classifier to participate in an activation auction.  Each classifier maintains a record of its net worth.  Called Strength (S).  Each matched classifier makes a bid B.  Bid proportional to its strength.  In this way, rules that are highly fit (have accumulated a net worth) are given preference over other rules. 53 Introduction to GBML
  • 54. Once a classifier is selected for activation, it must clear its payment through the clearinghouse, paying its bid to other classifiers for matching messages rendered.  A matched and activated classifier sends its bid B to those classifiers responsible for sending the messages that matched the bidder classifier’s condition.  Bid amount divided among the matching classifiers.  Division of payoff among contributing classifiers helps ensure the formation of an appropriately sized subpopulation of rules.  Different types of rules can cover different types of behavioral requirements without undue interspecies competition. 54 Introduction to GBML
  • 55. In a rule-learning system of any experience, we cannot search for one master rule.  We must instead search for a co-adapted set of rules that together cover a range of behavior that provides ample payoff to the learning system. Consider the classifiers as before. 55 Introduction to GBML
  • 56. Now, lets follow the payments, with initial strength of 200. 56 Introduction to GBML
  • 57. 57 Introduction to GBML
  • 58. 58 Introduction to GBML
  • 59. 59 Introduction to GBML
  • 60. 60 Introduction to GBML
  • 61. 61 Introduction to GBML
  • 62. 62 Introduction to GBML
  • 63. For steady receipts, the bid value approaches the receipt. For time-varying receipt values, we see that the bid is a geometrically weighted average of the input.  As such, it acts as a filter of the possibly intermittent and noisy receipt values. 63 Introduction to GBML
  • 64. Genetic Algorithm 64 Introduction to GBML
  • 65. Bucket brigade  Clean procedure  Evaluation of rules  Decide among competing alternatives  But we have to devise a way of injecting new rules into the system.  Similar to SGA, we can inject new rules using the tripartite rules  Reproduction  Crossover  Mutation 65 Introduction to GBML
  • 66. The rules are placed in the population and processed by the auction, payment and reinforcement mechanism to properly evaluate their role in the system.  Pay attention to “who replaces whom”  Not replacing the entire population  GA in classifier systems strongly resemble those used in search and optimization.  Main difference in Machine Learning  Non-overlapping population model not acceptable here.  In non-overlapping generations, complete generations are selected and replaced by a new population at every run. 66 Introduction to GBML
  • 67. Machine Learning Search and optimization  High level of on-line  Convergence performance.  Offline performance  Learn to perform more proficiently. 67 Introduction to GBML
  • 68. De Jong’s experiments Machine Learning  Conventional system  Whole population  GA Parameter should not be replaced.  Generation Gap (G)  Quantity  Implement and test  Selection Proportion overlapping population (proportion) Genetic Algorithms.  Replace that proportion of the population at a given algorithm run.  Coupled with a number of other parameters. 68 Introduction to GBML
  • 69. Other Parameters  GA Period  Represented as Tga  Specifies number of time steps (rule and message cycles) between GA calls.  Period can be treated deterministically  GA is called every Tga cycles  Or stochastically  GA is called probabilistically with average period Tga  Invocation of GA learning may be conditioned on particular events such as:  Lack of match  Poor performance 69 Introduction to GBML
  • 70. Selection  Roulette Wheel Selection  Classifier’s strength S used as the fitness.  No longer generating entire populations  Careful when choosing population members for replacement.  De Jong’s crowding procedure  Encourage replacement of similar population memebers 70 Introduction to GBML
  • 71. Mutation  Modified procedure  Here, ternary alphabet is used  SGA used a binary alphabet  Mutation probability pm defined as before  When a mutation is called for, we change the mutated character to one of the other two with equal probability.  0  { 1, # }  1  { 0, # }  #  { 0, 1 } 71 Introduction to GBML
  • 72. Next-  With all these changes to the A simple Classifier normal SGA routine, GA may be System in Pascal dropped into the classifier system •Simple Classifier System Data Structure and used in a manner not too different from normal search and •The Performance System optimization applications. •Apportionment of credit algorithm •Geneticsearch within the Simple Classifier System •Real-world testing •Results •Comparison with and without GA 72 Introduction to GBML
  • 73. A Simple Classifier System in Pascal 73 Introduction to GBML
  • 74. Construct a system designed to learn a boolean function  A multiplexer  Collapse the finite-length message list to a single message (the environmental message)  Immediate feedback  Simple payoff 74 Introduction to GBML
  • 75. Components  Simple Classifier System Data Structure  Adapt to learning strategies  Performance System  Heart of SCS  Matching procedures are the heart of the performance system  Apportionment of Credit  Procedures  Auction  Clearinghouse  Taxcollector  Genetic Search within Simple Classifier System  Similar to SGA  Learning the multiplexing system  Main procedure  Reinforcement 75 Introduction to GBML
  • 76. Six – bit multiplexing system 76 Introduction to GBML
  • 77. Results using the Simple Classifier System Without GA With GA 77 Introduction to GBML
  • 78. Conclusion  The following were discussed  Machine Learning  Role of GA in Machine Learning  The evolution of GA concepts in Machine Learning  Some applications of GBML  Classifier System  Components of Classifier System  Rule and Message System  Apportionment of Credit (The Bucket Brigade)  Genetic Algorithm  A practical implementation in Pascal  Increase in output efficiency when using GA 78 Introduction to GBML
  • 79. References  “Introduction to Machine Learning”, Nils J. Nilsson, Robotics Laboratory, Department of Computer Science, Stanford University  “Genetic Algorithms in Search, Optimization and Machine Learning”, David E. Goldberg, pp 217-260 79 Introduction to GBML
  • 80. THANK YOU 80 Introduction to GBML