Synthesis of Attributed Feature Models From Product Descriptions

University of Rennes, INSA Rennes, Inria/IRISA, CNRS
University of Rennes, INSA Rennes, Inria/IRISA, CNRSProfessor à University of Rennes, INSA Rennes, Inria/IRISA, CNRS
root F0 F1 F2 F3 A1 A2
Yes Yes Yes No Yes 3 0
Yes Yes No Yes Yes 2 2
Yes Yes Yes No No 2 0
Yes Yes No Yes No 0 8
Synthesis
root
F1
F3
F2F0
A1 A2
A2 < 8 => A1 >= 2
A1 > 0 => A2 <= 2
Guillaume Bécan, Razieh Behjati, Arnaud Gotlieb, Mathieu Acher
Synthesis of Attributed Feature Models
From Product Descriptions
Notes
● Slides have been presented at SPLC'15
(Nashville, USA)
● Guillaume Bécan has made the vast
majority of the slides
● Some slides were previously presented at
FOSD'15 meeting
● Paper here:
https://hal.inria.fr/hal-01178454
Synthesis of Attributed Feature Models: Foundations 3
Product Lines/Highly configurable systems
config NR_CPUS
int "Maximum number of CPUs" if SMP && !MAXSMP
range 2 8 if SMP && X86_32 && !X86_BIGSMP
range 2 512 if SMP && !MAXSMP && !CPUMASK_OFFSTACK
range 2 8192 if SMP && !MAXSMP && CPUMASK_OFFSTACK && X86_64
default "1" if !SMP
default "8192" if MAXSMP
default "32" if SMP && X86_BIGSMP
default "8" if SMP
---help---
This allows you to specify the maximum number of CPUs which this
kernel will support. If CPUMASK_OFFSTACK is enabled, the maximum
supported value is 4096, otherwise the maximum value is 512. The
minimum value which makes sense is 2.
This is purely to save memory - each supported CPU adds
approximately eight kilobytes to the kernel image.
Linux kernel
Synthesis of Attributed Feature Models: Foundations 4
● 2000+ options in about:config
● 3 types: boolean, integers, string
Firefox
Product Lines/Highly configurable systems
Synthesis of Attributed Feature Models: Foundations 5
Product Lines
Product comparison matrices
Comparison of digital SLRs
Synthesis of Attributed Feature Models: Foundations 6
Modeling and Synthesis
● Boolean feature models (FMs) are nice but options are not only
boolean. Languages/operations have been defined on top of
attributed FMs; so where are attributed feature models?
● Numerous works address the synthesis of Boolean feature models,
eg [Czarnecki et al. SPLC'07, Andersen et al. SPLC'12, Davril et al.
FSE'13, Becan et al. ESE'15] but none of them consider attributes
●
We introduce for the first time an algorithmic and
parameterizable approach for synthesizing attributed
FMs from product descriptions
Synthesis of Attributed Feature Models: Foundations 7
From Configuration Matrix to AFM
#2 synthesis procedure
#3 scalability evaluation
#1
semantics
Synthesis of Attributed Feature Models: Foundations 8
Attributed Feature Model (AFM)
● AFM = attributed feature diagram + an arbitrary constraint
● Attributed feature diagram =
– Hierarchy of features
– Attributes: placed in features, they take a value in a domain
– Domain = set of values, null value and a partial order
– A set of human readable constraints
Synthesis of Attributed Feature Models: Foundations 9
Formalizing the AFM Synthesis Problem
● Configuration matrix = intermediate representation of a
set of configurations
● Given a configuration matrix, synthesize an AFM which is
– Maximal = the feature diagram contains as much as
possible information
– Sound and Complete = represent exactly the set of
configurations
Synthesis of Attributed Feature Models: Foundations 10
Formalizing the AFM Synthesis Problem
● Theoretical contributions (more details in the paper)
– Over-approximation of the attributed feature diagram
– Several AFMs are possible for one configuration matrix
root F0 F1 F2 F3 A1 A2
Yes Yes Yes No Yes 3 0
Yes Yes No Yes Yes 2 2
Yes Yes Yes No No 2 0
Yes Yes No Yes No 0 8
root
F1
F3
F2F0
A1 A2
root
F1
F3F2F0
A1
A2
excludes
Synthesis
Synthesis of Attributed Feature Models: Foundations 11
Synthesis algorithm
● Input: Configuration Matrix + Domain Knowledge ( )
● Output: AFM which is maximal, sound and complete
= user input + default heuristics
Tool name : FOReverSE
Synthesis of Attributed Feature Models: Foundations 12
Synthesis algorithm
● Extract feature, attributes and their domains
Features
● Root
● F0
● F1
● F2
● F3
Attributes
● A1 : {0,2,3}, 0
● A2 : {0,2,8}, 0
root F0 F1 F2 F3 A1 A2
Yes Yes Yes No Yes 3 0
Yes Yes No Yes Yes 2 2
Yes Yes Yes No No 2 0
Yes Yes No Yes No 0 8
● Compute binary implications
root F0 F1 F2 F3 A1 A2
Yes Yes Yes No Yes 3 0
Yes Yes No Yes Yes 2 2
Yes Yes Yes No No 2 0
Yes Yes No Yes No 0 8
...
F2 => ¬ F1
¬ F2 => F1
F2 => A1 ∈ {0,2}
¬ F2 => A1 ∈ {2,3}
F2 => A2 ∈ {2,8}
¬ F2 => A2 ∈ {0}
…
A1 = 2 => A2 ∈ {0,2}
Synthesis of Attributed Feature Models: Foundations 13
Synthesis algorithm
● Define hierarchy (feature tree + place of attributes)
Binary implications
...
F2 => ¬ F1
¬ F2 => F1
F2 => A1 ∈ {0,2}
¬ F2 => A1 ∈ {2,3}
F2 => A2 ∈ {2,8}
¬ F2 => A2 ∈ {0}
...
root
F1
F3
F2F0
Binary implication graph
(all possible hierarchies)
root
F1
F3
F2F0
Hierarchy
Possible places for attributes
¬ f => ( a = null value of a )
A1 : root, F0
A2 : root, F0, F2
A1 A2
Synthesis of Attributed Feature Models: Foundations 14
Synthesis algorithm
● Compute variability information (mandatory features, features groups)
– Only based on features
– Reuse existing algorithms (She et al., Inform Software Tech, 2014)
Xor: {F1, F2}
root
F1
F3
F2 F0
Mutex graph
Solver
Binary implications
Possible
feature groups
(Mutex, Or, Xor)
root
F1
F3
F2F0
A1 A2
+
+
Binary implication graph Hierarchy+ Mandatory
features
Synthesis of Attributed Feature Models: Foundations 15
Synthesis algorithm
● Compute readable constraints
Readable constraints
A1 > 0 => A2 <= 2
A2 < 8 => A1 >= 2
Interesting values
for attributes
A1 → 0
A2 → 8
Binary implications
...
A1 = 0 => A2 ∈ {8}
A1 = 2 => A2 ∈ {0,2}
A1 = 3 => A2 ∈ {0}
A2 = 0 => A1 ∈ {2,3}
A2 = 2 => A1 ∈ {2}
A2 = 8 => A1 ∈ {0}
…
Merge
constraints
Grammar of readable constraints
Synthesis of Attributed Feature Models: Foundations 16
User Effort
Domain knowledge =
● user input
● default heuristics
No user effort
Arbitrary choices
Fully automated algorithm
More user effort => better AFM
● Feature or attribute?
● Interpretation of the cells (“Yes”
= true)
● Hierarchy (F3 below F0)
● Overlapping feature groups
● Bounds in constraints (“0” is an
interesting value for A1)
Synthesis of Attributed Feature Models: Foundations 17
Scalability
Random dataset
● Generator of configuration matrices
– Number of variables (features + attributes)
– Number of configurations
– Maximum domain size (number of distinct values in a column)
● Execution time of or-group computation
● 1000 configurations
● max domain size of 10
Timeout always reached with
more than 60 variables
Or groups do not scale !
= default heuristics only
Synthesis of Attributed Feature Models: Foundations 18
Scalability
Random dataset
● Execution time (no or-groups)
● Up to 2,000 variables
● 1,000 configurations
● Max domain size of 10
● 100 variables
● Up to 200,000 configurations
● Max domain size of 10
● 10 variables
● 10,000 configurations
● Up to 6000 distinct values
= default heuristics only
On all experiments:
Average: 2.6 min
Max: 62 min
Synthesis of Attributed Feature Models: Foundations 19
Scalability
Best Buy dataset
Execution time of 2.1s for the most challenging matrix:
● 77 variables
● 185 configurations
● Maximum domain size of 185
Execution time is similar to the random dataset
● 242 matrices
● < 25% of empty cells
● Interpretation of empty cells = default heuristics only
Synthesis of Attributed Feature Models: Foundations 20
Conclusion
We introduce for the first time an algorithmic and
parameterizable approach for synthesizing attributed FMs
from product descriptions
Three key contributions:
● Semantics of attributed feature models/configuration
matrix (over- approximation, equivalence)
● We designed and implemented a tool-supported synthesis
algorithm
● We empirically evaluated the scalability of the synthesis
algorithm on random and real-world matrices.
Synthesis of Attributed Feature Models: Foundations 21
Future Work
● Empirical experiments provide evidence that the number
of constraints can be huge
– Random dataset: 237 constraints in average, 8906 max
– Best Buy dataset: 6821 constraints in average, 28300 max
● How to address the problem?
– Minimization, prioritization, user-specified
– Combination thereof
● Opencompare.org
– More empirical studies on realistic matrices (beyond Best
Buy)
Synthesis of Attributed Feature Models: Foundations 22
Future Work
● Take a set of constraints as input (rather
than a set of configurations)
– arbitrary propositional formula
– require the use of SMT/CP solvers
● Develop other operations on attributed
feature models: aggregate, merge, slice...
– Instance of synthesis problem
Synthesis of Attributed Feature Models: Foundations 23
Questions?
root F0 F1 F2 F3 A1 A2
Yes Yes Yes No Yes 3 0
Yes Yes No Yes Yes 2 2
Yes Yes Yes No No 2 0
Yes Yes No Yes No 0 8
Synthesis
root
F1
F3
F2F0
A1 A2
A2 < 8 => A1 >= 2
A1 > 0 => A2 <= 2
Synthesis of Attributed Feature Models: Foundations 24
Product Lines
{
"name": "Luke Skywalker",
"height": "1.72 m",
"mass": "77 Kg",
"hair_color": "Blond",
"skin_color": "Caucasian",
"eye_color": "Blue",
"birth_year": "19 BBY",
"gender": "Male",
"homeworld":
"http://swapi.co/api/planets/1/",
"films": [
"http://swapi.co/api/films/1/",
"http://swapi.co/api/films/2/",
"http://swapi.co/api/films/3/"
],
...
}
The Star Wars API (http://swapi.co/)
{
"name": "Darth Vader",
"height": "202",
"mass": "136",
"hair_color": "none",
"skin_color": "white",
"eye_color": "yellow",
"birth_year": "41.9BBY",
"gender": "male",
"homeworld":
"http://swapi.co/api/planets/1/",
"films": [
"http://swapi.co/api/films/6/",
"http://swapi.co/api/films/3/",
"http://swapi.co/api/films/2/",
"http://swapi.co/api/films/1/"
],
...
}
1 sur 24

Recommandé

Automating the Formalization of Product Comparison Matrices par
Automating the Formalization of Product Comparison MatricesAutomating the Formalization of Product Comparison Matrices
Automating the Formalization of Product Comparison MatricesGuillaume Bécan
1.5K vues26 diapositives
Product Comparison Matrix (PCM), Variability Modeling: The Wikipedia Case Study par
Product Comparison Matrix (PCM), Variability Modeling: The Wikipedia Case StudyProduct Comparison Matrix (PCM), Variability Modeling: The Wikipedia Case Study
Product Comparison Matrix (PCM), Variability Modeling: The Wikipedia Case StudyUniversity of Rennes, INSA Rennes, Inria/IRISA, CNRS
3K vues18 diapositives
CIS110 Computer Programming Design Chapter (5) par
CIS110 Computer Programming Design Chapter  (5)CIS110 Computer Programming Design Chapter  (5)
CIS110 Computer Programming Design Chapter (5)Dr. Ahmed Al Zaidy
1.6K vues52 diapositives
CIS110 Computer Programming Design Chapter (4) par
CIS110 Computer Programming Design Chapter  (4)CIS110 Computer Programming Design Chapter  (4)
CIS110 Computer Programming Design Chapter (4)Dr. Ahmed Al Zaidy
1.6K vues46 diapositives
CIS110 Computer Programming Design Chapter (2) par
CIS110 Computer Programming Design Chapter  (2)CIS110 Computer Programming Design Chapter  (2)
CIS110 Computer Programming Design Chapter (2)Dr. Ahmed Al Zaidy
2K vues47 diapositives
Types of device models(part 2) par
Types of device models(part 2)Types of device models(part 2)
Types of device models(part 2)SudharshaG
25 vues21 diapositives

Contenu connexe

Tendances

CIS110 Computer Programming Design Chapter (7) par
CIS110 Computer Programming Design Chapter  (7)CIS110 Computer Programming Design Chapter  (7)
CIS110 Computer Programming Design Chapter (7)Dr. Ahmed Al Zaidy
862 vues43 diapositives
PUMConf: A Tool to Configure Product Specific Use Case and Domain Models in a... par
PUMConf: A Tool to Configure Product Specific Use Case and Domain Models in a...PUMConf: A Tool to Configure Product Specific Use Case and Domain Models in a...
PUMConf: A Tool to Configure Product Specific Use Case and Domain Models in a...Lionel Briand
300 vues25 diapositives
Programming Logic and Design: Working with Data par
Programming Logic and Design: Working with DataProgramming Logic and Design: Working with Data
Programming Logic and Design: Working with DataNicole Ryan
1.2K vues45 diapositives
Thai par
ThaiThai
ThaiMr. Oliver Cromwell Holley Sr.
53 vues4 diapositives
CIS110 Computer Programming Design Chapter (13) par
CIS110 Computer Programming Design Chapter  (13)CIS110 Computer Programming Design Chapter  (13)
CIS110 Computer Programming Design Chapter (13)Dr. Ahmed Al Zaidy
309 vues42 diapositives
spss Help par
spss Helpspss Help
spss HelpSaroj Suwal
3.3K vues38 diapositives

Tendances(9)

Similaire à Synthesis of Attributed Feature Models From Product Descriptions

Designing Architecture-aware Library using Boost.Proto par
Designing Architecture-aware Library using Boost.ProtoDesigning Architecture-aware Library using Boost.Proto
Designing Architecture-aware Library using Boost.ProtoJoel Falcou
1.8K vues58 diapositives
Matlab basic and image par
Matlab basic and imageMatlab basic and image
Matlab basic and imageDivyanshu Rasauria
1.2K vues56 diapositives
A Brief Conceptual Introduction to Functional Java 8 and its API par
A Brief Conceptual Introduction to Functional Java 8 and its APIA Brief Conceptual Introduction to Functional Java 8 and its API
A Brief Conceptual Introduction to Functional Java 8 and its APIJörn Guy Süß JGS
180 vues20 diapositives
Mat lab workshop par
Mat lab workshopMat lab workshop
Mat lab workshopVinay Kumar
516 vues64 diapositives
Ehsan parallel accelerator-dec2015 par
Ehsan parallel accelerator-dec2015Ehsan parallel accelerator-dec2015
Ehsan parallel accelerator-dec2015Christian Peel
2.2K vues31 diapositives
Hierarchical free monads and software design in fp par
Hierarchical free monads and software design in fpHierarchical free monads and software design in fp
Hierarchical free monads and software design in fpAlexander Granin
161 vues59 diapositives

Similaire à Synthesis of Attributed Feature Models From Product Descriptions(20)

Designing Architecture-aware Library using Boost.Proto par Joel Falcou
Designing Architecture-aware Library using Boost.ProtoDesigning Architecture-aware Library using Boost.Proto
Designing Architecture-aware Library using Boost.Proto
Joel Falcou1.8K vues
A Brief Conceptual Introduction to Functional Java 8 and its API par Jörn Guy Süß JGS
A Brief Conceptual Introduction to Functional Java 8 and its APIA Brief Conceptual Introduction to Functional Java 8 and its API
A Brief Conceptual Introduction to Functional Java 8 and its API
Ehsan parallel accelerator-dec2015 par Christian Peel
Ehsan parallel accelerator-dec2015Ehsan parallel accelerator-dec2015
Ehsan parallel accelerator-dec2015
Christian Peel2.2K vues
Hierarchical free monads and software design in fp par Alexander Granin
Hierarchical free monads and software design in fpHierarchical free monads and software design in fp
Hierarchical free monads and software design in fp
Alexander Granin161 vues
How to create a high performance excel engine in java script par Viktor Turskyi
How to create a high performance excel engine in java scriptHow to create a high performance excel engine in java script
How to create a high performance excel engine in java script
Viktor Turskyi601 vues
Introduction to matlab lecture 1 of 4 par Randa Elanwar
Introduction to matlab lecture 1 of 4Introduction to matlab lecture 1 of 4
Introduction to matlab lecture 1 of 4
Randa Elanwar2.5K vues
C++ CoreHard Autumn 2018. Debug C++ Without Running - Anastasia Kazakova par corehard_by
C++ CoreHard Autumn 2018. Debug C++ Without Running - Anastasia KazakovaC++ CoreHard Autumn 2018. Debug C++ Without Running - Anastasia Kazakova
C++ CoreHard Autumn 2018. Debug C++ Without Running - Anastasia Kazakova
corehard_by35 vues
MADlib Architecture and Functional Demo on How to Use MADlib/PivotalR par PivotalOpenSourceHub
MADlib Architecture and Functional Demo on How to Use MADlib/PivotalRMADlib Architecture and Functional Demo on How to Use MADlib/PivotalR
MADlib Architecture and Functional Demo on How to Use MADlib/PivotalR
Automatic Task-based Code Generation for High Performance DSEL par Joel Falcou
Automatic Task-based Code Generation for High Performance DSELAutomatic Task-based Code Generation for High Performance DSEL
Automatic Task-based Code Generation for High Performance DSEL
Joel Falcou1.4K vues
Madeo - a CAD Tool for reconfigurable Hardware par ESUG
Madeo - a CAD Tool for reconfigurable HardwareMadeo - a CAD Tool for reconfigurable Hardware
Madeo - a CAD Tool for reconfigurable Hardware
ESUG383 vues
Archi Modelling par dilane007
Archi ModellingArchi Modelling
Archi Modelling
dilane007308 vues
Accelerating the Development of Efficient CP Optimizer Models par Philippe Laborie
Accelerating the Development of Efficient CP Optimizer ModelsAccelerating the Development of Efficient CP Optimizer Models
Accelerating the Development of Efficient CP Optimizer Models
Philippe Laborie2.5K vues
[ppt] par butest
[ppt][ppt]
[ppt]
butest257 vues

Plus de University of Rennes, INSA Rennes, Inria/IRISA, CNRS

Generative AI for Reengineering Variants into Software Product Lines: An Expe... par
Generative AI for Reengineering Variants into Software Product Lines: An Expe...Generative AI for Reengineering Variants into Software Product Lines: An Expe...
Generative AI for Reengineering Variants into Software Product Lines: An Expe...University of Rennes, INSA Rennes, Inria/IRISA, CNRS
57 vues28 diapositives
Tackling Deep Software Variability Together par
Tackling Deep Software Variability TogetherTackling Deep Software Variability Together
Tackling Deep Software Variability TogetherUniversity of Rennes, INSA Rennes, Inria/IRISA, CNRS
37 vues46 diapositives
On anti-cheating in chess, science, reproducibility, and variability par
On anti-cheating in chess, science, reproducibility, and variabilityOn anti-cheating in chess, science, reproducibility, and variability
On anti-cheating in chess, science, reproducibility, and variabilityUniversity of Rennes, INSA Rennes, Inria/IRISA, CNRS
110 vues30 diapositives
Feature Subset Selection for Learning Huge Configuration Spaces: The case of ... par
Feature Subset Selection for Learning Huge Configuration Spaces: The case of ...Feature Subset Selection for Learning Huge Configuration Spaces: The case of ...
Feature Subset Selection for Learning Huge Configuration Spaces: The case of ...University of Rennes, INSA Rennes, Inria/IRISA, CNRS
44 vues68 diapositives
Machine Learning and Deep Software Variability par
Machine Learning and Deep Software VariabilityMachine Learning and Deep Software Variability
Machine Learning and Deep Software VariabilityUniversity of Rennes, INSA Rennes, Inria/IRISA, CNRS
40 vues75 diapositives
Mastering Software Variability for Innovation and Science par
Mastering Software Variability for Innovation and ScienceMastering Software Variability for Innovation and Science
Mastering Software Variability for Innovation and ScienceUniversity of Rennes, INSA Rennes, Inria/IRISA, CNRS
45 vues73 diapositives

Plus de University of Rennes, INSA Rennes, Inria/IRISA, CNRS(20)

Dernier

How to be(come) a successful PhD student par
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD studentTom Mens
491 vues62 diapositives
application of genetic engineering 2.pptx par
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptxSankSurezz
10 vues12 diapositives
Distinct distributions of elliptical and disk galaxies across the Local Super... par
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...Sérgio Sacani
31 vues12 diapositives
SANJAY HPLC.pptx par
SANJAY HPLC.pptxSANJAY HPLC.pptx
SANJAY HPLC.pptxsanjayudps2016
185 vues38 diapositives
DATABASE MANAGEMENT SYSTEM par
DATABASE MANAGEMENT SYSTEMDATABASE MANAGEMENT SYSTEM
DATABASE MANAGEMENT SYSTEMDr. GOPINATH D
7 vues50 diapositives
TF-FAIR.pdf par
TF-FAIR.pdfTF-FAIR.pdf
TF-FAIR.pdfDirk Roorda
6 vues120 diapositives

Dernier(20)

How to be(come) a successful PhD student par Tom Mens
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens491 vues
application of genetic engineering 2.pptx par SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz10 vues
Distinct distributions of elliptical and disk galaxies across the Local Super... par Sérgio Sacani
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...
Sérgio Sacani31 vues
"How can I develop my learning path in bioinformatics? par Bioinformy
"How can I develop my learning path in bioinformatics?"How can I develop my learning path in bioinformatics?
"How can I develop my learning path in bioinformatics?
Bioinformy24 vues
Open Access Publishing in Astrophysics par Peter Coles
Open Access Publishing in AstrophysicsOpen Access Publishing in Astrophysics
Open Access Publishing in Astrophysics
Peter Coles906 vues
Metatheoretical Panda-Samaneh Borji.pdf par samanehborji
Metatheoretical Panda-Samaneh Borji.pdfMetatheoretical Panda-Samaneh Borji.pdf
Metatheoretical Panda-Samaneh Borji.pdf
samanehborji16 vues
himalay baruah acid fast staining.pptx par HimalayBaruah
himalay baruah acid fast staining.pptxhimalay baruah acid fast staining.pptx
himalay baruah acid fast staining.pptx
HimalayBaruah7 vues
Experimental animal Guinea pigs.pptx par Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya17 vues
Light Pollution for LVIS students par CWBarthlmew
Light Pollution for LVIS studentsLight Pollution for LVIS students
Light Pollution for LVIS students
CWBarthlmew7 vues
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl... par GIFT KIISI NKIN
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...
GIFT KIISI NKIN26 vues
Artificial Intelligence Helps in Drug Designing and Discovery.pptx par abhinashsahoo2001
Artificial Intelligence Helps in Drug Designing and Discovery.pptxArtificial Intelligence Helps in Drug Designing and Discovery.pptx
Artificial Intelligence Helps in Drug Designing and Discovery.pptx

Synthesis of Attributed Feature Models From Product Descriptions

  • 1. root F0 F1 F2 F3 A1 A2 Yes Yes Yes No Yes 3 0 Yes Yes No Yes Yes 2 2 Yes Yes Yes No No 2 0 Yes Yes No Yes No 0 8 Synthesis root F1 F3 F2F0 A1 A2 A2 < 8 => A1 >= 2 A1 > 0 => A2 <= 2 Guillaume Bécan, Razieh Behjati, Arnaud Gotlieb, Mathieu Acher Synthesis of Attributed Feature Models From Product Descriptions
  • 2. Notes ● Slides have been presented at SPLC'15 (Nashville, USA) ● Guillaume Bécan has made the vast majority of the slides ● Some slides were previously presented at FOSD'15 meeting ● Paper here: https://hal.inria.fr/hal-01178454
  • 3. Synthesis of Attributed Feature Models: Foundations 3 Product Lines/Highly configurable systems config NR_CPUS int "Maximum number of CPUs" if SMP && !MAXSMP range 2 8 if SMP && X86_32 && !X86_BIGSMP range 2 512 if SMP && !MAXSMP && !CPUMASK_OFFSTACK range 2 8192 if SMP && !MAXSMP && CPUMASK_OFFSTACK && X86_64 default "1" if !SMP default "8192" if MAXSMP default "32" if SMP && X86_BIGSMP default "8" if SMP ---help--- This allows you to specify the maximum number of CPUs which this kernel will support. If CPUMASK_OFFSTACK is enabled, the maximum supported value is 4096, otherwise the maximum value is 512. The minimum value which makes sense is 2. This is purely to save memory - each supported CPU adds approximately eight kilobytes to the kernel image. Linux kernel
  • 4. Synthesis of Attributed Feature Models: Foundations 4 ● 2000+ options in about:config ● 3 types: boolean, integers, string Firefox Product Lines/Highly configurable systems
  • 5. Synthesis of Attributed Feature Models: Foundations 5 Product Lines Product comparison matrices Comparison of digital SLRs
  • 6. Synthesis of Attributed Feature Models: Foundations 6 Modeling and Synthesis ● Boolean feature models (FMs) are nice but options are not only boolean. Languages/operations have been defined on top of attributed FMs; so where are attributed feature models? ● Numerous works address the synthesis of Boolean feature models, eg [Czarnecki et al. SPLC'07, Andersen et al. SPLC'12, Davril et al. FSE'13, Becan et al. ESE'15] but none of them consider attributes ● We introduce for the first time an algorithmic and parameterizable approach for synthesizing attributed FMs from product descriptions
  • 7. Synthesis of Attributed Feature Models: Foundations 7 From Configuration Matrix to AFM #2 synthesis procedure #3 scalability evaluation #1 semantics
  • 8. Synthesis of Attributed Feature Models: Foundations 8 Attributed Feature Model (AFM) ● AFM = attributed feature diagram + an arbitrary constraint ● Attributed feature diagram = – Hierarchy of features – Attributes: placed in features, they take a value in a domain – Domain = set of values, null value and a partial order – A set of human readable constraints
  • 9. Synthesis of Attributed Feature Models: Foundations 9 Formalizing the AFM Synthesis Problem ● Configuration matrix = intermediate representation of a set of configurations ● Given a configuration matrix, synthesize an AFM which is – Maximal = the feature diagram contains as much as possible information – Sound and Complete = represent exactly the set of configurations
  • 10. Synthesis of Attributed Feature Models: Foundations 10 Formalizing the AFM Synthesis Problem ● Theoretical contributions (more details in the paper) – Over-approximation of the attributed feature diagram – Several AFMs are possible for one configuration matrix root F0 F1 F2 F3 A1 A2 Yes Yes Yes No Yes 3 0 Yes Yes No Yes Yes 2 2 Yes Yes Yes No No 2 0 Yes Yes No Yes No 0 8 root F1 F3 F2F0 A1 A2 root F1 F3F2F0 A1 A2 excludes Synthesis
  • 11. Synthesis of Attributed Feature Models: Foundations 11 Synthesis algorithm ● Input: Configuration Matrix + Domain Knowledge ( ) ● Output: AFM which is maximal, sound and complete = user input + default heuristics Tool name : FOReverSE
  • 12. Synthesis of Attributed Feature Models: Foundations 12 Synthesis algorithm ● Extract feature, attributes and their domains Features ● Root ● F0 ● F1 ● F2 ● F3 Attributes ● A1 : {0,2,3}, 0 ● A2 : {0,2,8}, 0 root F0 F1 F2 F3 A1 A2 Yes Yes Yes No Yes 3 0 Yes Yes No Yes Yes 2 2 Yes Yes Yes No No 2 0 Yes Yes No Yes No 0 8 ● Compute binary implications root F0 F1 F2 F3 A1 A2 Yes Yes Yes No Yes 3 0 Yes Yes No Yes Yes 2 2 Yes Yes Yes No No 2 0 Yes Yes No Yes No 0 8 ... F2 => ¬ F1 ¬ F2 => F1 F2 => A1 ∈ {0,2} ¬ F2 => A1 ∈ {2,3} F2 => A2 ∈ {2,8} ¬ F2 => A2 ∈ {0} … A1 = 2 => A2 ∈ {0,2}
  • 13. Synthesis of Attributed Feature Models: Foundations 13 Synthesis algorithm ● Define hierarchy (feature tree + place of attributes) Binary implications ... F2 => ¬ F1 ¬ F2 => F1 F2 => A1 ∈ {0,2} ¬ F2 => A1 ∈ {2,3} F2 => A2 ∈ {2,8} ¬ F2 => A2 ∈ {0} ... root F1 F3 F2F0 Binary implication graph (all possible hierarchies) root F1 F3 F2F0 Hierarchy Possible places for attributes ¬ f => ( a = null value of a ) A1 : root, F0 A2 : root, F0, F2 A1 A2
  • 14. Synthesis of Attributed Feature Models: Foundations 14 Synthesis algorithm ● Compute variability information (mandatory features, features groups) – Only based on features – Reuse existing algorithms (She et al., Inform Software Tech, 2014) Xor: {F1, F2} root F1 F3 F2 F0 Mutex graph Solver Binary implications Possible feature groups (Mutex, Or, Xor) root F1 F3 F2F0 A1 A2 + + Binary implication graph Hierarchy+ Mandatory features
  • 15. Synthesis of Attributed Feature Models: Foundations 15 Synthesis algorithm ● Compute readable constraints Readable constraints A1 > 0 => A2 <= 2 A2 < 8 => A1 >= 2 Interesting values for attributes A1 → 0 A2 → 8 Binary implications ... A1 = 0 => A2 ∈ {8} A1 = 2 => A2 ∈ {0,2} A1 = 3 => A2 ∈ {0} A2 = 0 => A1 ∈ {2,3} A2 = 2 => A1 ∈ {2} A2 = 8 => A1 ∈ {0} … Merge constraints Grammar of readable constraints
  • 16. Synthesis of Attributed Feature Models: Foundations 16 User Effort Domain knowledge = ● user input ● default heuristics No user effort Arbitrary choices Fully automated algorithm More user effort => better AFM ● Feature or attribute? ● Interpretation of the cells (“Yes” = true) ● Hierarchy (F3 below F0) ● Overlapping feature groups ● Bounds in constraints (“0” is an interesting value for A1)
  • 17. Synthesis of Attributed Feature Models: Foundations 17 Scalability Random dataset ● Generator of configuration matrices – Number of variables (features + attributes) – Number of configurations – Maximum domain size (number of distinct values in a column) ● Execution time of or-group computation ● 1000 configurations ● max domain size of 10 Timeout always reached with more than 60 variables Or groups do not scale ! = default heuristics only
  • 18. Synthesis of Attributed Feature Models: Foundations 18 Scalability Random dataset ● Execution time (no or-groups) ● Up to 2,000 variables ● 1,000 configurations ● Max domain size of 10 ● 100 variables ● Up to 200,000 configurations ● Max domain size of 10 ● 10 variables ● 10,000 configurations ● Up to 6000 distinct values = default heuristics only On all experiments: Average: 2.6 min Max: 62 min
  • 19. Synthesis of Attributed Feature Models: Foundations 19 Scalability Best Buy dataset Execution time of 2.1s for the most challenging matrix: ● 77 variables ● 185 configurations ● Maximum domain size of 185 Execution time is similar to the random dataset ● 242 matrices ● < 25% of empty cells ● Interpretation of empty cells = default heuristics only
  • 20. Synthesis of Attributed Feature Models: Foundations 20 Conclusion We introduce for the first time an algorithmic and parameterizable approach for synthesizing attributed FMs from product descriptions Three key contributions: ● Semantics of attributed feature models/configuration matrix (over- approximation, equivalence) ● We designed and implemented a tool-supported synthesis algorithm ● We empirically evaluated the scalability of the synthesis algorithm on random and real-world matrices.
  • 21. Synthesis of Attributed Feature Models: Foundations 21 Future Work ● Empirical experiments provide evidence that the number of constraints can be huge – Random dataset: 237 constraints in average, 8906 max – Best Buy dataset: 6821 constraints in average, 28300 max ● How to address the problem? – Minimization, prioritization, user-specified – Combination thereof ● Opencompare.org – More empirical studies on realistic matrices (beyond Best Buy)
  • 22. Synthesis of Attributed Feature Models: Foundations 22 Future Work ● Take a set of constraints as input (rather than a set of configurations) – arbitrary propositional formula – require the use of SMT/CP solvers ● Develop other operations on attributed feature models: aggregate, merge, slice... – Instance of synthesis problem
  • 23. Synthesis of Attributed Feature Models: Foundations 23 Questions? root F0 F1 F2 F3 A1 A2 Yes Yes Yes No Yes 3 0 Yes Yes No Yes Yes 2 2 Yes Yes Yes No No 2 0 Yes Yes No Yes No 0 8 Synthesis root F1 F3 F2F0 A1 A2 A2 < 8 => A1 >= 2 A1 > 0 => A2 <= 2
  • 24. Synthesis of Attributed Feature Models: Foundations 24 Product Lines { "name": "Luke Skywalker", "height": "1.72 m", "mass": "77 Kg", "hair_color": "Blond", "skin_color": "Caucasian", "eye_color": "Blue", "birth_year": "19 BBY", "gender": "Male", "homeworld": "http://swapi.co/api/planets/1/", "films": [ "http://swapi.co/api/films/1/", "http://swapi.co/api/films/2/", "http://swapi.co/api/films/3/" ], ... } The Star Wars API (http://swapi.co/) { "name": "Darth Vader", "height": "202", "mass": "136", "hair_color": "none", "skin_color": "white", "eye_color": "yellow", "birth_year": "41.9BBY", "gender": "male", "homeworld": "http://swapi.co/api/planets/1/", "films": [ "http://swapi.co/api/films/6/", "http://swapi.co/api/films/3/", "http://swapi.co/api/films/2/", "http://swapi.co/api/films/1/" ], ... }