SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Evaluating Simulation Software
                             Components with Player Rating
                             Systems
                             6. 3. 2013, SIMUTools 2013


                              Jonathan Wienß                     Michael Stein     Roland Ewald




                                                                  Sponsored by:




6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                  1
Component-Based Simulation Systems

     •       Simulator: combination of components

     •       Typical components:

                •     Event management
                •     Collision detection
                •     State saving
                •     Result storage
                •     Random number generation
                •     etc.

     •       Example: JAMES II
                                                                                   http://flickr.com/photos/jdhancock/7239958506, cc-by

6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                                                         2
Problem: Evaluating Individual Components




                        https://commons.wikimedia.org/wiki/File:Rowing_-_USA_Lwt_4_@_World_Champs_2003.jpg


     •       Only component combinations are comparable
     •       Dedicated performance studies are expensive & difficult
6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                             3
Solution: Player Rating Systems
                       Performance Comparison                                       Multiplayer Team Results

             E.g., Event Queues
                                     {      A                  B                      1.   SC
                                                                                      2.   SE      B
Simulators
                {       SC                 SD                 SE                      3.   SD      A
                       15 s                25 s              17 s

    1. Component Combination = Team of Players
    2. Record results (of multiple combinations)
    3. Update global component rating
⇒ Component Rating Systems, e.g. to find good default components.

6. 3. 2013    c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                              4
Component Rating Systems



     • What is required?

     • How does it work?

     • How well does it work?



6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   5
Component Rating Systems: Requirements


     •       Re-usable (system-independent)

     •       Inexpensive (memory, execution time)

     •       Scalable (w.r.t. components / component combinations)

     •       Robust (w.r.t. ‘outlier problems’)

     •       Adaptive (component updates)




6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   6
Microsoft’s TrueSkillTM Approach1 (used for XBox LiveTM )
 • Input:
                • Team defined by player indices, e.g., Ai = {4, 8, 125}
                • Team assignment A = {A1 , . . . , Ak } (pairwise disjoint)
                • Team ranking r (game result)

     •       Output: player skill ratings µ                 i



     •       Assumptions:
                • Player skill si ∼ N (µi , σi2 )
                • Player performance pi ∼ N (si , β 2 )
                • Team performance tj =                          i ∈ A pi
                                                                       j



1: Herbrich, Minka, and Graepel: TrueSkill(tm): A Bayesian Skill Rating System, Adv. in Neural Information Processing Systems 19, 2007
6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                                                         7
Bayesian Inference in TrueSkill



                                                   P (r |s , A)p(s )
                           p(s |r , A) =
                                                       P (r |A)
                                                       ∞               ∞
                                             =              ...             p(s , p , t |r , A)d p d t
                                                     −∞              −∞




r : ranking                                      t : team performances                    s : player skills
A: team assignment                               p : player performances


   6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                           8
Factor Graphs & Message Passing
parison          Multi-Player Team Results

  B                    1.        SC                                                     Skills    sSC        sSE       sB        sSD       sA


                       2.        SE             B
 SE
                       3.        SD             A
 17 s                                                                               Performance   pSC        pSE       pB        pSD       pA


        1. Pass messages downwards:
           s→p→t
        2. Expectation propagation (≈):                                                Team

           t ↔ d (r )                                                               Performance   tSC          tSE+B               tSD+A


        3. Pass messages upwards:
           t→p→s                                                                       Team
                                                                                    Performance
                                                                                     Difference
                                                                                                        d1                  d2




          6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                                                      9
Limitations & Adaptations


     •       Strong assumptions that may not hold:
                • Player performance independence
                • Normally distributed performance


     •       No additive team performance → average


     •       Player may play in more than one team




6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   10
Ranking Event Queues in JAMES II: Reference Data
  Event Queues / Models            1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20            Sum
  MList                             4 4 5 2 3 5 7 3 5 4 4 4 4 4 4 5 1 1 1 6                       76
  LinkedList                        1 1 1 5 1 3 3 5 3 7 6 6 6 6 6 2 5 5 5 3                       80
  TwoList                           2 2 2 6 2 2 2 7 2 3 7 7 7 7 7 4 6 6 6 4                       91
  CalendarQueue                     7 7 8 3 6 8 8 1 6 5 2 2 2 3 2 9 2 2 2 7                       92
  BucketsThreshold                 10 9 9 1 8 1 4 6 10 8 1 1 1 1 1 8 4 4 4 2                      93
  MPLinkedList                      3 3 3 7 4 6 5 4 4 1 9 9 9 9 9 3 7 7 7 5                      114
  CalendarReQueue                   9 10 10 4 7 9 6 10 9 9 3 3 3 2 3 7 3 3 3 8                   121
  Heap                              5 5 4 8 5 4 1 9 8 6 8 8 8 8 8 6 9 9 9 1                      129
  Simple                            8 6 6 9 9 7 9 2 1 10 5 5 5 5 5 1 10 10 10 9                  132
  DynamicCalendarQueue              6 8 7 10 10 9 10 8 7 2 10 10 10 10 10 10 8 8 8 10            171

                                   55 55 55 55 55 54 55 55 55 55 55 55 55 55 55 55 55 55 55 55
     •       Five models for each formalism: SRS, stoch-π, PDEVS, SR
     •       Per formalism: (1 + 3 + 1 + 3 = 8 simulators) × 10 event queues
     •       80 comp. combinations × 20 replications × 5 models = 8.000 runs

6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                       11
Experiment Setup


                                                                                       1.               A, r
Simulation Problems                                                                    2.


                                   A                       B
                                                                                   Component Rating System
Eligible Component
Combinations
                                  SD                     SE                        Current Event Queue Ranking:
                                                                                    1. ...
Execution Times
                                                                                        ...   ? Count Inversions
                                                                                   10. ...




6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                                   12
Evaluation: Ranking Event Queues
                          25
                                                                                                                      Default Setup
                                                                                                                      β = 833.3
                          20
      Average Number of Inversions




                          15



                          10



                                     5



                                     0
                                          0         1000      2000       3000  4000   5000     6000   7000     8000      9000     10000
                                                                         Component Combination Comparisons

6. 3. 2013                               c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                              13
Summary

Problem: How to evaluate individual components of a simulation system?
Solution: A scalable and robust component rating system.
Method: Bayesian inference (MS TrueSkill algorithm).

Outlook:
     •       Global component rankings
     •       Consider ‘margin of victory’
     •       Improve usage for experiment steering


6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   14
http://bitbucket.org/alesia




6. 3. 2013   c 2013                             (License: Apache 2.0)
                      UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   15
Thank you.
                                                         Questions?




6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP   16
Operation Modes

                         Passive Mode                                        Active Mode
                                                                                   Simulation
                                   Users
                                                                                    Software
                       Problem                Results             Performance            Problem &
                                                                        Results          Component
                             Simulation                                                  Combinations
                              Software                                        Match Selection &
                                                                              Experiment Control
                Component                     Component            Component             Component
               Comparisons                    Ranks               Comparisons            Ranks etc.

                          Component                                          Component
                         Rating System                                      Rating System


6. 3. 2013   c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                        17
Evaluation: Ranking Event Queues
                               30
                                                                                                          Passive Mode      Active Mode
                               25
Average Number of Inversions




                               20


                               15


                               10


                                5


                                0
                                    0        1000       2000       3000 4000 5000 6000 7000                              8000   9000   10000
                                                                   Component Combination Comparisons
6. 3. 2013                          c 2013   UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP                                        18

Contenu connexe

Similaire à Evaluating Simulation Software Components with Player Rating Systems (SIMUTools 2013)

Pricipal Component Analysis Using R
Pricipal Component Analysis Using RPricipal Component Analysis Using R
Pricipal Component Analysis Using R
Karthi Keyan
 

Similaire à Evaluating Simulation Software Components with Player Rating Systems (SIMUTools 2013) (20)

Apache Sparkを用いたスケーラブルな時系列データの異常検知モデル学習ソフトウェアの開発
Apache Sparkを用いたスケーラブルな時系列データの異常検知モデル学習ソフトウェアの開発Apache Sparkを用いたスケーラブルな時系列データの異常検知モデル学習ソフトウェアの開発
Apache Sparkを用いたスケーラブルな時系列データの異常検知モデル学習ソフトウェアの開発
 
Pricipal Component Analysis Using R
Pricipal Component Analysis Using RPricipal Component Analysis Using R
Pricipal Component Analysis Using R
 
RDataMining slides-regression-classification
RDataMining slides-regression-classificationRDataMining slides-regression-classification
RDataMining slides-regression-classification
 
Machine Learning: Classification Concepts (Part 1)
Machine Learning: Classification Concepts (Part 1)Machine Learning: Classification Concepts (Part 1)
Machine Learning: Classification Concepts (Part 1)
 
Efficient aggregation for graph summarization
Efficient aggregation for graph summarizationEfficient aggregation for graph summarization
Efficient aggregation for graph summarization
 
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
 
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
 
Evolution of Spark APIs
Evolution of Spark APIsEvolution of Spark APIs
Evolution of Spark APIs
 
Data profiling in Apache Calcite
Data profiling in Apache CalciteData profiling in Apache Calcite
Data profiling in Apache Calcite
 
Data profiling with Apache Calcite
Data profiling with Apache CalciteData profiling with Apache Calcite
Data profiling with Apache Calcite
 
Data Profiling in Apache Calcite
Data Profiling in Apache CalciteData Profiling in Apache Calcite
Data Profiling in Apache Calcite
 
Data miningpresentation
Data miningpresentationData miningpresentation
Data miningpresentation
 
Optimum failure truncated testing strategies
Optimum failure truncated testing strategies Optimum failure truncated testing strategies
Optimum failure truncated testing strategies
 
2016 QDB VLDB Workshop - Towards Rigorous Evaluation of Data Integration Syst...
2016 QDB VLDB Workshop - Towards Rigorous Evaluation of Data Integration Syst...2016 QDB VLDB Workshop - Towards Rigorous Evaluation of Data Integration Syst...
2016 QDB VLDB Workshop - Towards Rigorous Evaluation of Data Integration Syst...
 
R studio
R studio R studio
R studio
 
presentationIDC - 14MAY2015
presentationIDC - 14MAY2015presentationIDC - 14MAY2015
presentationIDC - 14MAY2015
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」
 
Human Activity Recognition Using AccelerometerData
Human Activity Recognition Using AccelerometerDataHuman Activity Recognition Using AccelerometerData
Human Activity Recognition Using AccelerometerData
 
Fundamentals of Engineering Probability Visualization Techniques & MatLab Cas...
Fundamentals of Engineering Probability Visualization Techniques & MatLab Cas...Fundamentals of Engineering Probability Visualization Techniques & MatLab Cas...
Fundamentals of Engineering Probability Visualization Techniques & MatLab Cas...
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Evaluating Simulation Software Components with Player Rating Systems (SIMUTools 2013)

  • 1. Evaluating Simulation Software Components with Player Rating Systems 6. 3. 2013, SIMUTools 2013 Jonathan Wienß Michael Stein Roland Ewald Sponsored by: 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 1
  • 2. Component-Based Simulation Systems • Simulator: combination of components • Typical components: • Event management • Collision detection • State saving • Result storage • Random number generation • etc. • Example: JAMES II http://flickr.com/photos/jdhancock/7239958506, cc-by 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 2
  • 3. Problem: Evaluating Individual Components https://commons.wikimedia.org/wiki/File:Rowing_-_USA_Lwt_4_@_World_Champs_2003.jpg • Only component combinations are comparable • Dedicated performance studies are expensive & difficult 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 3
  • 4. Solution: Player Rating Systems Performance Comparison Multiplayer Team Results E.g., Event Queues { A B 1. SC 2. SE B Simulators { SC SD SE 3. SD A 15 s 25 s 17 s 1. Component Combination = Team of Players 2. Record results (of multiple combinations) 3. Update global component rating ⇒ Component Rating Systems, e.g. to find good default components. 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 4
  • 5. Component Rating Systems • What is required? • How does it work? • How well does it work? 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 5
  • 6. Component Rating Systems: Requirements • Re-usable (system-independent) • Inexpensive (memory, execution time) • Scalable (w.r.t. components / component combinations) • Robust (w.r.t. ‘outlier problems’) • Adaptive (component updates) 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 6
  • 7. Microsoft’s TrueSkillTM Approach1 (used for XBox LiveTM ) • Input: • Team defined by player indices, e.g., Ai = {4, 8, 125} • Team assignment A = {A1 , . . . , Ak } (pairwise disjoint) • Team ranking r (game result) • Output: player skill ratings µ i • Assumptions: • Player skill si ∼ N (µi , σi2 ) • Player performance pi ∼ N (si , β 2 ) • Team performance tj = i ∈ A pi j 1: Herbrich, Minka, and Graepel: TrueSkill(tm): A Bayesian Skill Rating System, Adv. in Neural Information Processing Systems 19, 2007 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 7
  • 8. Bayesian Inference in TrueSkill P (r |s , A)p(s ) p(s |r , A) = P (r |A) ∞ ∞ = ... p(s , p , t |r , A)d p d t −∞ −∞ r : ranking t : team performances s : player skills A: team assignment p : player performances 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 8
  • 9. Factor Graphs & Message Passing parison Multi-Player Team Results B 1. SC Skills sSC sSE sB sSD sA 2. SE B SE 3. SD A 17 s Performance pSC pSE pB pSD pA 1. Pass messages downwards: s→p→t 2. Expectation propagation (≈): Team t ↔ d (r ) Performance tSC tSE+B tSD+A 3. Pass messages upwards: t→p→s Team Performance Difference d1 d2 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 9
  • 10. Limitations & Adaptations • Strong assumptions that may not hold: • Player performance independence • Normally distributed performance • No additive team performance → average • Player may play in more than one team 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 10
  • 11. Ranking Event Queues in JAMES II: Reference Data Event Queues / Models 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Sum MList 4 4 5 2 3 5 7 3 5 4 4 4 4 4 4 5 1 1 1 6 76 LinkedList 1 1 1 5 1 3 3 5 3 7 6 6 6 6 6 2 5 5 5 3 80 TwoList 2 2 2 6 2 2 2 7 2 3 7 7 7 7 7 4 6 6 6 4 91 CalendarQueue 7 7 8 3 6 8 8 1 6 5 2 2 2 3 2 9 2 2 2 7 92 BucketsThreshold 10 9 9 1 8 1 4 6 10 8 1 1 1 1 1 8 4 4 4 2 93 MPLinkedList 3 3 3 7 4 6 5 4 4 1 9 9 9 9 9 3 7 7 7 5 114 CalendarReQueue 9 10 10 4 7 9 6 10 9 9 3 3 3 2 3 7 3 3 3 8 121 Heap 5 5 4 8 5 4 1 9 8 6 8 8 8 8 8 6 9 9 9 1 129 Simple 8 6 6 9 9 7 9 2 1 10 5 5 5 5 5 1 10 10 10 9 132 DynamicCalendarQueue 6 8 7 10 10 9 10 8 7 2 10 10 10 10 10 10 8 8 8 10 171 55 55 55 55 55 54 55 55 55 55 55 55 55 55 55 55 55 55 55 55 • Five models for each formalism: SRS, stoch-π, PDEVS, SR • Per formalism: (1 + 3 + 1 + 3 = 8 simulators) × 10 event queues • 80 comp. combinations × 20 replications × 5 models = 8.000 runs 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 11
  • 12. Experiment Setup 1. A, r Simulation Problems 2. A B Component Rating System Eligible Component Combinations SD SE Current Event Queue Ranking: 1. ... Execution Times ... ? Count Inversions 10. ... 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 12
  • 13. Evaluation: Ranking Event Queues 25 Default Setup β = 833.3 20 Average Number of Inversions 15 10 5 0 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 Component Combination Comparisons 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 13
  • 14. Summary Problem: How to evaluate individual components of a simulation system? Solution: A scalable and robust component rating system. Method: Bayesian inference (MS TrueSkill algorithm). Outlook: • Global component rankings • Consider ‘margin of victory’ • Improve usage for experiment steering 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 14
  • 15. http://bitbucket.org/alesia 6. 3. 2013 c 2013 (License: Apache 2.0) UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 15
  • 16. Thank you. Questions? 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 16
  • 17. Operation Modes Passive Mode Active Mode Simulation Users Software Problem Results Performance Problem & Results Component Simulation Combinations Software Match Selection & Experiment Control Component Component Component Component Comparisons Ranks Comparisons Ranks etc. Component Component Rating System Rating System 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 17
  • 18. Evaluation: Ranking Event Queues 30 Passive Mode Active Mode 25 Average Number of Inversions 20 15 10 5 0 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 Component Combination Comparisons 6. 3. 2013 c 2013 UNIVERSITÄT ROSTOCK | MODELING & SIMULATION RESEARCH GROUP 18