SlideShare a Scribd company logo
1 of 21
Download to read offline
Using Free Cloud Storage
   Services For Distributed
   Evolutionary Algorithms
Maribel García-Arenas,
Juan-J. Merelo,
Antonio M. Mora,
Pedro Castillo
Outline
1) Idea and how to test it
2) Dropbox features
3) Putting in practice with Evolutionary Computation
4) File-individuals
5) Island Algorithm
6) Goals
7) Problems
8) Results


                   2
IDEA
• What do you know about cloud storage
   services?
• Why not use them for computing?
• How can we use all our computers to
   make a multicomputer?
  – Desktop computer
  – Portable computer
  – Home computer
  – Any other computers...
                3
How to test the idea
• Look for some free storage
    services and test them: What
    are their features and what is
    the availability for storing,
    sharing and synchronizing
    information
• After that, We have selected
  Dropbox


                4
TM
Dropbox            features
• It is free up to a certain level of use
     (measured in traffic and usage)
• It is popular, so many people use it,
     and we may found many
     volunteers for computation
• It monitors the local filesystem and
     uploads information
     asynchronously
• It looks like a local directory
               5
Putting in practice with
Evolutionary Computation
  • What do we need to build Evolutionary
     Distributed Algorithms?
    – Exchange individuals among populations:
      Phenotype and Genotype
  • We can exchange this information using
    files. So the name of the file
    represents the phenotype and
    genotype and all connected PCs share
    it with Dropbox
                  6
Let's go
• File distribution via Dropbox
• It synchronizes the file-individuals with
  other computers
• Each computer evolves an island
• Dropbox folder contains a pool of
  individuals and each computer adds
  and gets file-individuals from it


                7
Let's go (II)
• Each computer connected or
  synchronized by Dropbox is part of a
  multi-computer
• Each Island-computer evolves a
  population of individuals and exchanges
  with the pool file-individuals when the
  migration process must be done


               8
File-individuals
• How to include phenotype and genotype
  into a file
  – As the contents of the file? It is not a good
    idea because we have to open and close
    files and Dropbox has to synchonize them.
  – Into the filesystem attributes? Dropbox is
    working on that and we will be testing in the
    future
  – Into the filename? It is our approach

                 9
File-individuals (II)
• The filename problem
  – How many gens can we include into the
    name?
  – We have to code the genotype into base 32
  – Ex: 00000 → 0, 00001-> 1, 01010->A ...
    111111->V
• The filename includes: Fitness,
  genotypeBase32codification and the id
  of the computer which generates the
  individual
                10
Island Algorithm
1.Creates and evaluates the initial population
2.Until to reach a number of evaluations into the
multi-computer
  • Breed the population


  • Evaluate


  • Generational replacement with 1-elitism


  • After a fixed number of generations, Immigrate

    (gets one file-individual from the pool and
    incorporates it to the population)
  • After a fixed number of generations, Migrate (adds

    the best or a random file-individual to the pool)
3.Adds the best individual to the pool

                11
Control of the number of
      evaluations
• Each computer creates a file whose
  name is the number of evaluations
  performed and its identification (random
  initial seed)
• Each computer looks for this kind of file
  within the Dropbox folder and adds the
  total of evaluations.
• When the sum of this evaluations is
  greater than the fixed minimum, the
  evolution of this island ends.
                   12
Goals
• What do we want to test?
  – We want test if we save time when use the
    multi-computer for computing a fixed
    number of evaluations.
• How can we test it?
  – Making a distributed evolutionary algorithm
    based on pool and testing that the time for
    reaching the fixed evaluations decreases
    when you add new nodes to our multi-
    computer linked by Dropbox.
                 13
Problems: MMDP
• Multimodal Deceptive Problem
• It is composed of k (k=80)
  subproblems of 6 bits each
  one called si for i=0 to 79.
• Depending of the number of
  ones si takes the values                  ones     fitness

                                            0 or 6   1
  detailed into the table                   5 or 1   0

                        k                   2 or 4   0,360384
    Fitnessindividual =∑i=1 fitness s
                                       i
                                            3        0,640576




                            14
Problems: TRAP
• It is defined for the unitation function (number of
  ones in a binary string) using the following function.

                        {                         if u     z
                             a
                                   z−u  ,
                                        x                  x
           trap u  =
                    x       b
                              z

                            l − z u − z  ,
                                     x               otherwise

• For our problem, the trap is defined for l=4, a=3,
  b=4 and z = 3
• With 30 traps
  into the genome



                                  15
Parameters
• We use as multi-computer one, two or four
  heterogeneous computers so we use one, two,
  three or four island
• Population size: 1000 individuals
• Selection: Tournament
• Crossover: uniform
• Mutation: bit-flit
• Replacement: Generational with 1-elitism
• Stop criteria: minimum number of evaluations for
  the multi-computer
• WiFi with WPA/Enterprise encryption.
                       16
Results for MMDP




        17
Results for TRAP




        18
Conclusions
• The Dropbox File-storage and sharing
  system, can be used as a migration
  device for distributed evolutionary
  computation experiments without
  needing to acquire or set up complicated
  cloud or grid infrastructure.
• With this approach everyone can use a
  multicomputer running an evolutionary
  algorithm with a good scaling behavior.
               19
Others results for MMDP
                                                                                          Time to find the solution
                          Success Rate

                                                                                                MMDP Problem
             120

                                                                             300000


             100

                                                                             250000


             80
                                                                             200000                                       100
                                                     1
                                                                                                                          200
                                                     2
                                                                                                                          400
                                                         Time(miliseconds)
                                                     4
             60
Percentage




                                                                             150000



             40
                                                                             100000




             20
                                                                              50000




               0                                                                  0
                   100               200       400                                    1                    2          4
                         Migration frecuency
                                                                                                     Islands


                                                                    20
Questions




     21

More Related Content

Similar to Presentation dropbox

Data flow super computing valentina balas
Data flow super computing   valentina balasData flow super computing   valentina balas
Data flow super computing valentina balas
Valentina Emilia Balas
 
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Matthew Lease
 
Self Organinising neural networks
Self Organinising  neural networksSelf Organinising  neural networks
Self Organinising neural networks
ESCOM
 

Similar to Presentation dropbox (20)

Nearest Neighbor Customer Insight
Nearest Neighbor Customer InsightNearest Neighbor Customer Insight
Nearest Neighbor Customer Insight
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
 
Data flow super computing valentina balas
Data flow super computing   valentina balasData flow super computing   valentina balas
Data flow super computing valentina balas
 
Model-based Testing: Today And Tomorrow
Model-based Testing: Today And TomorrowModel-based Testing: Today And Tomorrow
Model-based Testing: Today And Tomorrow
 
Opportunistic Routing Based on Daily Routines
Opportunistic Routing Based on Daily RoutinesOpportunistic Routing Based on Daily Routines
Opportunistic Routing Based on Daily Routines
 
Paris data-geeks-2013-03-28
Paris data-geeks-2013-03-28Paris data-geeks-2013-03-28
Paris data-geeks-2013-03-28
 
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
 
Hadoop Tutorial with @techmilind
Hadoop Tutorial with @techmilindHadoop Tutorial with @techmilind
Hadoop Tutorial with @techmilind
 
ACM 2013-02-25
ACM 2013-02-25ACM 2013-02-25
ACM 2013-02-25
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
OpenPOWER Workshop in Silicon Valley
OpenPOWER Workshop in Silicon ValleyOpenPOWER Workshop in Silicon Valley
OpenPOWER Workshop in Silicon Valley
 
Self Organinising neural networks
Self Organinising  neural networksSelf Organinising  neural networks
Self Organinising neural networks
 
Artificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningArtificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep Learning
 
Presentation dropbox
Presentation dropboxPresentation dropbox
Presentation dropbox
 
Oxford 05-oct-2012
Oxford 05-oct-2012Oxford 05-oct-2012
Oxford 05-oct-2012
 
Discovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI Projects
Discovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI ProjectsDiscovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI Projects
Discovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI Projects
 
Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical Applications
 
Database Sharding the Right Way: Easy, Reliable, and Open source - HighLoad++...
Database Sharding the Right Way: Easy, Reliable, and Open source - HighLoad++...Database Sharding the Right Way: Easy, Reliable, and Open source - HighLoad++...
Database Sharding the Right Way: Easy, Reliable, and Open source - HighLoad++...
 
MLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, CaptioningMLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, Captioning
 
Serving predictive models with Redis
Serving predictive models with RedisServing predictive models with Redis
Serving predictive models with Redis
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Presentation dropbox

  • 1. Using Free Cloud Storage Services For Distributed Evolutionary Algorithms Maribel García-Arenas, Juan-J. Merelo, Antonio M. Mora, Pedro Castillo
  • 2. Outline 1) Idea and how to test it 2) Dropbox features 3) Putting in practice with Evolutionary Computation 4) File-individuals 5) Island Algorithm 6) Goals 7) Problems 8) Results 2
  • 3. IDEA • What do you know about cloud storage services? • Why not use them for computing? • How can we use all our computers to make a multicomputer? – Desktop computer – Portable computer – Home computer – Any other computers... 3
  • 4. How to test the idea • Look for some free storage services and test them: What are their features and what is the availability for storing, sharing and synchronizing information • After that, We have selected Dropbox 4
  • 5. TM Dropbox features • It is free up to a certain level of use (measured in traffic and usage) • It is popular, so many people use it, and we may found many volunteers for computation • It monitors the local filesystem and uploads information asynchronously • It looks like a local directory 5
  • 6. Putting in practice with Evolutionary Computation • What do we need to build Evolutionary Distributed Algorithms? – Exchange individuals among populations: Phenotype and Genotype • We can exchange this information using files. So the name of the file represents the phenotype and genotype and all connected PCs share it with Dropbox 6
  • 7. Let's go • File distribution via Dropbox • It synchronizes the file-individuals with other computers • Each computer evolves an island • Dropbox folder contains a pool of individuals and each computer adds and gets file-individuals from it 7
  • 8. Let's go (II) • Each computer connected or synchronized by Dropbox is part of a multi-computer • Each Island-computer evolves a population of individuals and exchanges with the pool file-individuals when the migration process must be done 8
  • 9. File-individuals • How to include phenotype and genotype into a file – As the contents of the file? It is not a good idea because we have to open and close files and Dropbox has to synchonize them. – Into the filesystem attributes? Dropbox is working on that and we will be testing in the future – Into the filename? It is our approach 9
  • 10. File-individuals (II) • The filename problem – How many gens can we include into the name? – We have to code the genotype into base 32 – Ex: 00000 → 0, 00001-> 1, 01010->A ... 111111->V • The filename includes: Fitness, genotypeBase32codification and the id of the computer which generates the individual 10
  • 11. Island Algorithm 1.Creates and evaluates the initial population 2.Until to reach a number of evaluations into the multi-computer • Breed the population • Evaluate • Generational replacement with 1-elitism • After a fixed number of generations, Immigrate (gets one file-individual from the pool and incorporates it to the population) • After a fixed number of generations, Migrate (adds the best or a random file-individual to the pool) 3.Adds the best individual to the pool 11
  • 12. Control of the number of evaluations • Each computer creates a file whose name is the number of evaluations performed and its identification (random initial seed) • Each computer looks for this kind of file within the Dropbox folder and adds the total of evaluations. • When the sum of this evaluations is greater than the fixed minimum, the evolution of this island ends. 12
  • 13. Goals • What do we want to test? – We want test if we save time when use the multi-computer for computing a fixed number of evaluations. • How can we test it? – Making a distributed evolutionary algorithm based on pool and testing that the time for reaching the fixed evaluations decreases when you add new nodes to our multi- computer linked by Dropbox. 13
  • 14. Problems: MMDP • Multimodal Deceptive Problem • It is composed of k (k=80) subproblems of 6 bits each one called si for i=0 to 79. • Depending of the number of ones si takes the values ones fitness 0 or 6 1 detailed into the table 5 or 1 0 k 2 or 4 0,360384 Fitnessindividual =∑i=1 fitness s  i 3 0,640576 14
  • 15. Problems: TRAP • It is defined for the unitation function (number of ones in a binary string) using the following function. { if u     z a  z−u  , x x trap u  = x b z l − z u − z  , x otherwise • For our problem, the trap is defined for l=4, a=3, b=4 and z = 3 • With 30 traps into the genome 15
  • 16. Parameters • We use as multi-computer one, two or four heterogeneous computers so we use one, two, three or four island • Population size: 1000 individuals • Selection: Tournament • Crossover: uniform • Mutation: bit-flit • Replacement: Generational with 1-elitism • Stop criteria: minimum number of evaluations for the multi-computer • WiFi with WPA/Enterprise encryption. 16
  • 19. Conclusions • The Dropbox File-storage and sharing system, can be used as a migration device for distributed evolutionary computation experiments without needing to acquire or set up complicated cloud or grid infrastructure. • With this approach everyone can use a multicomputer running an evolutionary algorithm with a good scaling behavior. 19
  • 20. Others results for MMDP Time to find the solution Success Rate MMDP Problem 120 300000 100 250000 80 200000 100 1 200 2 400 Time(miliseconds) 4 60 Percentage 150000 40 100000 20 50000 0 0 100 200 400 1 2 4 Migration frecuency Islands 20
  • 21. Questions 21