Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Towards the Characterization of Realistic Models: Evaluation of Multidisciplinary Graph Metrics
1. Budapest University of Technology and Economics
Department of Measurement and Information Systems
MTA-BME Lendület Research Group on Cyber-Physical Systems
Budapest University of Technology and Economics
Fault Tolerant Systems Research Group
Towards the Characterization of Realistic Models:
Evaluation of Multidisciplinary Graph Metrics
Gábor Szárnyas, Zsolt Kővári,
Ágnes Salánki, Dániel Varró
2. Motivation
Research Community
Problems of
experimental
evaluation of
MDE papers
Difficult to find real
industrial model
Tool Providers
Test generation for
modeling tools
Scalability evaluation
and stress testing of
MDE tools
Smart CPS
Synthesis of
prototypical test
context/environment
Testing of
autonomous robots
(R3COP project)
3. Motivation
Research Community
Problems of
experimental
evaluation of
MDE papers
Difficult to find real
industrial model
Tool Providers
Test generation for
modeling tools
Scalability evaluation
and stress testing of
MDE tools
Smart CPS
Synthesis of
prototypical test
context/environment
Testing of
autonomous robots
(R3COP project)
How to automatically synthesize graph models…?
4. Research Question and Objectives
• All well-formedness constraints satisfied
• Designated seed fragments includedConsistent
• How to characterize realistic models?
• How to distinguish real and generated models?Realistic
• Guaranteed test coverage
• Required for tool qualificationDiverse
• Performance benchmarks
• Stress testing of tools and control algorithmsScalable
How to automatically synthesize graph models which are...
5. Research Question and Objectives
• All well-formedness constraints satisfied
• Designated seed fragments includedConsistent
• How to characterize realistic models?
• How to distinguish real and generated models?Realistic
• Guaranteed test coverage
• Required for tool qualificationDiverse
• Performance benchmarks
• Stress testing of tools and control algorithmsScalable
How to automatically synthesize graph models which are...
8. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
9. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
Workaround #1: „Never mind, my tool has very
good performance for the TTC 2038 case.”
10. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
Workaround #1: „Never mind, my tool has very
good performance for the TTC 2038 case.”
o Great, but what does that imply for real use cases?
11. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
Workaround #1: „Never mind, my tool has very
good performance for the TTC 2038 case.”
o Great, but what does that imply for real use cases?
Workaround #2: Implement a custom benchmark
12. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
Workaround #1: „Never mind, my tool has very
good performance for the TTC 2038 case.”
o Great, but what does that imply for real use cases?
Workaround #2: Implement a custom benchmark
o Again, what does that imply for real use cases?
13. Performance Experiments
„I would like to benchmark my tool on real
models”
o Industrial models are difficult to obtain.
Workaround #1: „Never mind, my tool has very
good performance for the TTC 2038 case.”
o Great, but what does that imply for real use cases?
Workaround #2: Implement a custom benchmark
o Again, what does that imply for real use cases?
Qualitative description of models is required
14. How to Obtain Models for Benchmarking?
• Difficult to obtain
• Obfuscated models
Industrial
• Quality of models?Student work
• Good quality models
• Small in size
Tutorial
• How realistic are these models?Generated
15. What Makes a Model Realistic?
How to decide if a model is realistic
without domain-specific knowledge?
35. Graph Metrics
S1 S2 S3 S4
T1 T2 T3 T4 T5
E S4 S1 S2 S3
T1 T2 T3 T4 T5
E
Which is the graph
of a real model?
36. Graph Metrics
S1 S2 S3 S4
T1 T2 T3 T4 T5
E S4 S1 S2 S3
T1 T2 T3 T4 T5
E
Which is the graph
of a real model?
37. Graph Metrics
S1 S2 S3 S4
T1 T2 T3 T4 T5
E S4 S1 S2 S3
T1 T2 T3 T4 T5
E
They are isomorphic.
Which is the graph
of a real model?
38. Graph Metrics
S1 S2 S3 S4
T1 T2 T3 T4 T5
E S4 S1 S2 S3
T1 T2 T3 T4 T5
E
They are isomorphic.
Which is the graph
of a real model?
Related finding: simple
graph metrics are unable to
predict query performance
39. Network Theory
Mid ‘90s, László Albert-Barabási et al.
o Preferential attachment: „the rich gets richer”
Scale-free networks (web, power grid, etc.)
Most approaches only consider untyped graphs.
40. Network Theory
Mid ‘90s, László Albert-Barabási et al.
o Preferential attachment: „the rich gets richer”
Scale-free networks (web, power grid, etc.)
Most approaches only consider untyped graphs.
S
1
S
2
S
3
S
4
T
1
T
2
T
3
T
4
T
5
E
41. Network Theory
Mid ‘90s, László Albert-Barabási et al.
o Preferential attachment: „the rich gets richer”
Scale-free networks (web, power grid, etc.)
Most approaches only consider untyped graphs.
S
1
S
2
S
3
S
4
T
1
T
2
T
3
T
4
T
5
E S4 S1 S2 S3
T1 T2 T3 T4 T5
E
43. Multidimensional Metrics
Dimensional degree distributions
Node dimension connectivity
o ratio of nodes in the that belong to a dimension
Multiplex participation coefficient
o the connections of v are uniformly distributed among D
Node activity & pairwise multiplexity
o the ratio of nodes, which are active in both d1 and d2
48. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
49. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
50. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
51. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
52. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
o small models
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
53. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
o small models
o derived references
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
54. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
o small models
o derived references
3. Calculate graph metrics
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
55. Methodology
1. Collect models
2. Data Cleansing: remove
o layout information
o attributes
o object types
o small models
o derived references
3. Calculate graph metrics
4. Analyze results
o Statistical + exploratory
Red
Red-
Orange
Green Orange
T1 T2 T3 T4 T5
Entry
71. Findings
1. Metamodel-level information is insufficient
1. The ratio of containment edge types in the Capella
metamodels: 75%
2. The ratio of containment edges in the Capella
models: 42–50 %
74. Future Directions
Use metrics for
o Instance model generators
o Query optimization
Improve performance of calculating metrics:
incremental calculation
o https://github.com/ftsrg/model-analyzer
o Works for both EMF and RDF models
All analysis results & code are available online:
o http://docs.inf.mit.bme.hu/model-metrics/
75. The Train Benchmark
SOSYM paper – The Train Benchmark: Cross-Technology
Performance Evaluation of Continuous Model Validation
o 6 queries, 12 transformations
o EMF, property graphs, RDF, SQL
o 12+ tools
o Automated visualization & reporting
http://github.com/ftsrg/trainbenchmark