SlideShare une entreprise Scribd logo
1  sur  37
Analyzing Networks of Issue Reports
Markus Borg
Dietmar Pfahl
Per Runeson
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Markus Borg
Dietmar Pfahl Per Runeson
University of Tartu
Estonia
Lund University
Sweden
• Third year PhD student
• MSc CS and engineering
• Software developer (2007-2010)
• Empirical research group
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Agenda
• Background and Context
– Information management
– Safety-critical development
– Impact analysis
• Goal and method of this study
• Results
• Future work
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Background and Context
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Information management
• Large projects, much information
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Challenges
• A state of information overload
– Engineers cannot process all
information
– Causes stress
– Obstructs decision making
• Poor findability
– More effort to navigate information
landscape
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Intensified in safety development
• Safety standards mandate documentation
Railroad Nuclear Process Machinery Automotive
Industry
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Mandated documents in IEC 26262
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Work task: Impact Analysis (IA)
• Required by IEC 61508 before changes to production code
• Studied an industrial case
– Documented
– Reviewed during safety audits
Requirements
Tests
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Work task: Impact Analysis (2)
• Formal template
• Impact on code and non-code
specified as traceability links
• Manual work
IMS
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Supporting the impact analysis?
Work task
?
Reqs. DB
Code
Repo
Test DB
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Reuse knowledge from previous IAs
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Information in networks, so what?
• Search in hyperlinked structures well researched
– Also applied in software engineering (Karabatis et al. (2009))
HITS algorithm
Page et al. (1999)
Kleinberg (1999)
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Networks of issue reports
• What type of networks can we find in issue databases?
?
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Method
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Issue databases under study
• Safety IMS (2000-2012)
– Industrial control system
– Mandated by strict processes
– Issues submitted by engineers
• Android IMS (2007-2012)
– OS for handheld devices
– Open source software
– Issues submitted to public database
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Link mining in the issue databases
• Safety IMS
– ”Related cases” field in database
• Android IMS
– No separate field for linking issues
– Communication using comments
(100,000+)
– Developers refer to other issues,
stored as HTML hyperlinks
• Extracted using regular expressions
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Results
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted network - Overview
• Safety IMS
– 26,120 issue reports
– 18,000 links
– 15,000 components
– 13,000 isolated issue reports
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted network – Close-up
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted network - Overview
• Android IMS
– 20,176 issue reports
– 3,500 links
– 18,000 components
– 17,000 isolated issue reports
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Example of sub-network
Bug star
One central issue
report pointing at
several others
Caused by duplicates
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Example of sub-network
Dense ring
Most issue reports are
connected.
Caused by copy-
paste comments
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted networks
What do developers signal by
creating HTML hyperlinks?
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Link semantics
• Indicate relationships with different certainty
– Related issue report (possibly  probably  definetely)
– Duplicate issue report (possibly  probably  definetely)
– Cloned issue report
• Misc. links
– Raising awareness of issue reports
– Release planning
– Links with the wrong target
• Links appear to carry meaning
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
More recent results
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Contents of IA reports in the Safety IMS
Code
HW description
Misc. documents
Test case
User manual
Test case
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Mining IA reports in the Safety IMS
• ~ 5,000 impact analysis reports
Node types
• Issue reports
• Requirements
• Test specifications
• Hardware descriptions
Link types
• Related issue
• Specified by
• Verified by
• Needs update
• Impacted HW
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted semantic network
• 27,958 nodes
– ~26,000 issue reports
– ~3,000 other artifacts
• 28,230 links
– ~18,000 related issue
– ~4,000 specified by
– ~2,300 verified by
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Extracted semantic network –
Circle layout
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Future work
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
How can the networks be exploited?
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Neighbourhood search
Application 1:
Search for
connected
artifacts
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Centrality measures
Application 2: Identification of key artifacts (ranking)
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Goal: Impact Recommender
1. Identify similar issues
2. Identify neighbours
3. Rank candidates
Far
awayTextual
sim.
High
cent.
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Summary
• Link mining in IMSs can discover complex issue networks
– The process-heavy IMS contains more links
– Links among issue reports, created in comments by Android
developers, typically signal relations
• Networks of issue reports can be extended by other artifacts
• Networked information enables better navigation
- Broaden search (following links)
- Sharpen search (better ranking)
Analyzing networks of issue reports| Borg, Pfahl, and Runeson
Thanks!
?

Contenu connexe

Similaire à Analyzing networks of issue reports

Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Jonas Traub
 

Similaire à Analyzing networks of issue reports (20)

Submit Your Articles- International Journal of Advanced Smart Sensor Network ...
Submit Your Articles- International Journal of Advanced Smart Sensor Network ...Submit Your Articles- International Journal of Advanced Smart Sensor Network ...
Submit Your Articles- International Journal of Advanced Smart Sensor Network ...
 
Active Nets Technology Transfer through High-Performance Network Devices
Active Nets Technology Transfer through High-Performance Network DevicesActive Nets Technology Transfer through High-Performance Network Devices
Active Nets Technology Transfer through High-Performance Network Devices
 
International Journal of Advanced Smart Sensor Network Systems (IJASSN)
International Journal of Advanced Smart Sensor Network Systems (IJASSN)International Journal of Advanced Smart Sensor Network Systems (IJASSN)
International Journal of Advanced Smart Sensor Network Systems (IJASSN)
 
2017 dagstuhl-nfv-rothenberg
2017 dagstuhl-nfv-rothenberg2017 dagstuhl-nfv-rothenberg
2017 dagstuhl-nfv-rothenberg
 
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
 
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
 
SoftQL - Telecom Solutions
SoftQL - Telecom SolutionsSoftQL - Telecom Solutions
SoftQL - Telecom Solutions
 
Splunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case StudySplunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case Study
 
Rich feeds for rescue, palms cyberinfrastructure integration stories
Rich feeds for rescue, palms cyberinfrastructure   integration storiesRich feeds for rescue, palms cyberinfrastructure   integration stories
Rich feeds for rescue, palms cyberinfrastructure integration stories
 
Overview of policies for security and data sharing
Overview of policies for security and data sharingOverview of policies for security and data sharing
Overview of policies for security and data sharing
 
Rich feeds for rescue an integration story
Rich feeds for rescue   an integration storyRich feeds for rescue   an integration story
Rich feeds for rescue an integration story
 
Slides of Chapter 3 network design and management book
Slides of Chapter 3 network design and management bookSlides of Chapter 3 network design and management book
Slides of Chapter 3 network design and management book
 
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
International Journal of Advanced Smart Sensor Network Systems ( IJASSN )
 
Biological networks - building and visualizing
Biological networks - building and visualizingBiological networks - building and visualizing
Biological networks - building and visualizing
 
NWCRG-IAB-Review-IETF91.pdf
NWCRG-IAB-Review-IETF91.pdfNWCRG-IAB-Review-IETF91.pdf
NWCRG-IAB-Review-IETF91.pdf
 
Data Communication & Computer Networks
Data Communication & Computer NetworksData Communication & Computer Networks
Data Communication & Computer Networks
 
Strategy briefing: network technologies 7 March 2013
Strategy briefing: network technologies 7 March 2013Strategy briefing: network technologies 7 March 2013
Strategy briefing: network technologies 7 March 2013
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
 
The Internet of Things: an academic perspective
The Internet of Things: an academic perspectiveThe Internet of Things: an academic perspective
The Internet of Things: an academic perspective
 
International Journal of Advanced Smart Sensor Network Systems (IJASSN)free p...
International Journal of Advanced Smart Sensor Network Systems (IJASSN)free p...International Journal of Advanced Smart Sensor Network Systems (IJASSN)free p...
International Journal of Advanced Smart Sensor Network Systems (IJASSN)free p...
 

Plus de Markus Borg

Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Markus Borg
 

Plus de Markus Borg (19)

Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
 
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
 
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
 
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
 
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
 
Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?
 
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
SZZ Unleashed:  An Open Implementation of the SZZ AlgorithmSZZ Unleashed:  An Open Implementation of the SZZ Algorithm
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
 
Explainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural NetworksExplainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural Networks
 
Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?
 
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
 
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
 
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
 
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
 
From Bugs to Decision Support - Selected Research Highlights
From Bugs to Decision Support - Selected Research HighlightsFrom Bugs to Decision Support - Selected Research Highlights
From Bugs to Decision Support - Selected Research Highlights
 
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
 
Automation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and TracingAutomation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and Tracing
 
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
 
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
 
Recommendation Systems for Issue Management
Recommendation Systems for Issue ManagementRecommendation Systems for Issue Management
Recommendation Systems for Issue Management
 

Dernier

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
University of Hertfordshire
 

Dernier (20)

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 

Analyzing networks of issue reports

  • 1. Analyzing Networks of Issue Reports Markus Borg Dietmar Pfahl Per Runeson
  • 2. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Markus Borg Dietmar Pfahl Per Runeson University of Tartu Estonia Lund University Sweden • Third year PhD student • MSc CS and engineering • Software developer (2007-2010) • Empirical research group
  • 3. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Agenda • Background and Context – Information management – Safety-critical development – Impact analysis • Goal and method of this study • Results • Future work
  • 4. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Background and Context
  • 5. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Information management • Large projects, much information
  • 6. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Challenges • A state of information overload – Engineers cannot process all information – Causes stress – Obstructs decision making • Poor findability – More effort to navigate information landscape
  • 7. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Intensified in safety development • Safety standards mandate documentation Railroad Nuclear Process Machinery Automotive Industry
  • 8. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Mandated documents in IEC 26262
  • 9. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Work task: Impact Analysis (IA) • Required by IEC 61508 before changes to production code • Studied an industrial case – Documented – Reviewed during safety audits Requirements Tests
  • 10. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Work task: Impact Analysis (2) • Formal template • Impact on code and non-code specified as traceability links • Manual work IMS
  • 11. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Supporting the impact analysis? Work task ? Reqs. DB Code Repo Test DB
  • 12. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Reuse knowledge from previous IAs
  • 13. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Information in networks, so what? • Search in hyperlinked structures well researched – Also applied in software engineering (Karabatis et al. (2009)) HITS algorithm Page et al. (1999) Kleinberg (1999)
  • 14. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Networks of issue reports • What type of networks can we find in issue databases? ?
  • 15. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Method
  • 16. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Issue databases under study • Safety IMS (2000-2012) – Industrial control system – Mandated by strict processes – Issues submitted by engineers • Android IMS (2007-2012) – OS for handheld devices – Open source software – Issues submitted to public database
  • 17. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Link mining in the issue databases • Safety IMS – ”Related cases” field in database • Android IMS – No separate field for linking issues – Communication using comments (100,000+) – Developers refer to other issues, stored as HTML hyperlinks • Extracted using regular expressions
  • 18. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Results
  • 19. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted network - Overview • Safety IMS – 26,120 issue reports – 18,000 links – 15,000 components – 13,000 isolated issue reports
  • 20. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted network – Close-up
  • 21. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted network - Overview • Android IMS – 20,176 issue reports – 3,500 links – 18,000 components – 17,000 isolated issue reports
  • 22. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Example of sub-network Bug star One central issue report pointing at several others Caused by duplicates
  • 23. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Example of sub-network Dense ring Most issue reports are connected. Caused by copy- paste comments
  • 24. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted networks What do developers signal by creating HTML hyperlinks?
  • 25. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Link semantics • Indicate relationships with different certainty – Related issue report (possibly  probably  definetely) – Duplicate issue report (possibly  probably  definetely) – Cloned issue report • Misc. links – Raising awareness of issue reports – Release planning – Links with the wrong target • Links appear to carry meaning
  • 26. Analyzing networks of issue reports| Borg, Pfahl, and Runeson More recent results
  • 27. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Contents of IA reports in the Safety IMS Code HW description Misc. documents Test case User manual Test case
  • 28. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Mining IA reports in the Safety IMS • ~ 5,000 impact analysis reports Node types • Issue reports • Requirements • Test specifications • Hardware descriptions Link types • Related issue • Specified by • Verified by • Needs update • Impacted HW
  • 29. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted semantic network • 27,958 nodes – ~26,000 issue reports – ~3,000 other artifacts • 28,230 links – ~18,000 related issue – ~4,000 specified by – ~2,300 verified by
  • 30. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Extracted semantic network – Circle layout
  • 31. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Future work
  • 32. Analyzing networks of issue reports| Borg, Pfahl, and Runeson How can the networks be exploited?
  • 33. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Neighbourhood search Application 1: Search for connected artifacts
  • 34. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Centrality measures Application 2: Identification of key artifacts (ranking)
  • 35. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Goal: Impact Recommender 1. Identify similar issues 2. Identify neighbours 3. Rank candidates Far awayTextual sim. High cent.
  • 36. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Summary • Link mining in IMSs can discover complex issue networks – The process-heavy IMS contains more links – Links among issue reports, created in comments by Android developers, typically signal relations • Networks of issue reports can be extended by other artifacts • Networked information enables better navigation - Broaden search (following links) - Sharpen search (better ranking)
  • 37. Analyzing networks of issue reports| Borg, Pfahl, and Runeson Thanks! ?

Notes de l'éditeur

  1. Research agenda inspired by the my experiences as a developer.
  2. Knowledge workers spend too much time finding infoFindability definition: “the degree to which a system or environment supports navigation and retrieval”
  3. Document driven environment, strict process requirements
  4. Just as an example…
  5. In a specificindustrialcaseBefore changes to production codeIn the safety audits to strengthen the safety case
  6. IMS = Issue Management System
  7. Multiple targets possible, not a mutual link by default.
  8. All links are directed. Sometimes two issue reports are connected in both ways.
  9. The links are not weighted, a link is established when a comment in an issue report targets another report. Additional hyperlinks do not generate new links.
  10. Duplicates
  11. You even see self linksThe dense link structure, and the self-links, were created when a developer posted the comment “Do we have one ground for many problems?” followed by links to 7 reports. This comment was copy-pasted to all seven reports.
  12. Qualitative analysisDefine related/duplicate/cloneHowever, our findings indicate that a majority of the links express a meaningful relation. One reason might be that it requires some effort by a developer to create a link, thus they appear to be correct.
  13. Information in IA reports is semi-structured. Free text, but answers provided in the context of questions.
  14. Could also extract semantic information
  15. Discovered a significant network structureSemantic information
  16. All artifacts on the perimeter of the circle. Few areas are not intertwined in complex ways.
  17. Note that there is semantic information here as well, represented by the colors here.
  18. As made popular by a big American company in the search business.
  19. When working on a new impact analysis report
  20. To conretize...