SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
NL-based Software Engineering (NLBSE) '23
May 20th, 2023, 09:00 - 17:30 UTC+10
https://nlbse2023.github.io/
@NLBSE_workshop
Sebastiano Panichella Andrea Di Sorbo
Welcome to Melbourne!
Twitter → @NLBSE_workshop
On Zoom all day!
Sebastiano
General format: Hybrid Mode
Participants: Live + Remote participants
Remote participants:
- Check your email for your Zoom invitation
- Please remain muted until it is your turn to speak!
Presentations:
Live presentations
In-person: Room 103
Questions:
After each presentation we have allocated time for Q & A.
Remote participants:
- We encourage you to use the “Raise your hand option” to ask questions.
- Organizers will invite to turn on your mic to ask (voice) questions.
Everything on Twitch will be recorded and uploaded to YouTube!
Sebastiano
Participation (Paper Authors)
Sebastiano
Participation (PC Members)
Sebastiano
(50% Women)
Thanks to our sponsors!
IEEE Technical Community on Software Engineering
(TCSE)
ACM Special Interest Group on Software Engineering
(SIGSOFT)
Sebastiano
Schedule (UTC+10)
09:10 → Opening & Awards
09:30 → Live Keynote: Automated Bug Management: Reflections and
the Road Ahead by David Lo
10:30 → Break
11:00 → Live Keynote: Trends and Opportunities in the Application of
Large Language Models: the Quest for Maximum Effect by
Albert Ziegler
12:00 → Session: Position Papers
12:30 → Lunch
13:45 → Tool Competition (overview + 5 live presentations + closing)
-> Tool Award session
15:15 → Break
15:45 → Session: Research Papers
17:15 → Closing and Award session
https://conf.researchr.org/program/icse-2023/program-icse-2023/ https://nlbse2023.github.io/
Sebastiano
Keynote (09:30)
Automated Bug Management: Reflections and the
Road Ahead
For many projects, bug reports, predominantly written in natural
language, are submitted daily to issue tracking systems. The
number of such reports is often too many for busy software
engineers to manually handle and eventually resolve in a timely
fashion. Also, the resolution of each report often requires many
steps, e.g., detecting invalid reports, assigning the reports to
engineers with the right expertise, finding the buggy files requiring
changes, fixing the buggy files, etc. Incorrect decisions made for
any of these steps can slow down the resolution of the bug report.
To help reduce engineers’ workload and improve the reliability of
systems, in the last decade, many automated solutions have been
proposed for various steps in the bug management and resolution
process. This talk will first do a reflection on the hundreds of studies
done in this popular area of Natural-Language Based Software
Engineering (NLBSE), highlighting success cases and the explored
directions. It will then highlight interesting future work in the road
ahead, describing important unsolved problems and untapped
opportunities.
David Lo
Bio:
Professor of Computer Science at the School of
Computing and Information Systems, Singapore
Management University. He leads the SOftware
Analytics Research (SOAR) group. His research
interest is in the intersection of software
engineering, cybersecurity, and data science,
encompassing socio-technical aspects and
analysis of different kinds of software artifacts,.
He has won more than 15 international research
and service awards, including 2 Most Influential
Paper Awards and 6 ACM SIGSOFT
Distinguished Paper Awards. He is currently
serving on the SIGSOFT Executive Committee,
Editorial Boards of TSE, TRel, and EMSE, and as a
PC Co-Chair of ESEC/FSE 2024 and ICSE 2025.
Sebastiano
Break (10:30 – 11:00)
Keynote (11:00)
Trends and Opportunities in the Application of Large
Language Models: the Quest for Maximum Effect
As large language models become more and more sophisticated,
the machine learning problem "How to train a great new model so it
best solves my task" increasingly pivots to "How to run a great
existing model so it best solves my task". This is easier said than
done and requires reconciliation of four goals:
1. How to communicate the problem and the format in which
you expect your answer;
2. How to communicate all background information the
model might need to arrive at that answer;
3. How to communicate with the model robustly, in particular
in a way that it is used to from its training set;
4. How to keep the question short in order to adhere to the
context window and save computing time and cost.
The talk discusses strategies for addressing each of these goals in
the code domain, as well as methods for balancing them against
each other. The keynote will in particular focus on the example of
GitHub Copilot and related AI for software development projects.
Albert Ziegler
Bio:
Principal machine learning engineer with a
background in Mathematics and a home at
GitHub Next, GitHub's innovation and
future group. His main interests are
combinations of deductive and intuitive
reasoning to improve the software
development experience. He's previously
worked on developer productivity, ML
guided CodeQL, and he was part of the trio
that conceived and then implemented the
GitHub Copilot project. His most recent
projects include Copilot Radar and AI for
Pull Requests.
Sebastiano
Research papers (2 sessions)
Full papers (20 minutes):
- 15 minutes for talk
- 5 minutes for questions
Short papers (15 minutes):
- 10 minutes for talk
- 5 minutes for questions
Position papers (15 minutes):
- 8 minutes for talk
- 7 minutes for questions
Sebastiano
Position papers: schedule (12:00)
The (Ab)use of Open Source Code to Train Language Models (position)
Ali Al-Kaswan and Maliheh Izadi Delft University of Technology
Exploring Generalizability of NLP-based Models for Modern Software Development
Cross-Domain Environments (position - online)
Rrezarta Krasniqi and Hyunsook Do University of North Texas
Sebastiano
Lunch (12:30)
Tool competition (13:45)
Competition reports:
- 10 minutes per paper (inclusive)
Tool Competition Co-chairs
Tool Chairs
Tool Competition schedule
Opening & Issue Report Classification Competition
Rafael Kallis1
, Maliheh Izadi2
, Pooja Rani3
, Luca Pascarella4
, Oscar Chaparro5
[1] Rafael Kallis Consulting, [2] Delft University of
Technology, [3] University of Zurich, [4] ETH Zurich [5] College of William and Mary
An Intelligent Tool for Classifying Issue Reports
Muhammad Laiq Blekinge Institute of Technology
Few-Shot Learning for Issue Report Classification
Giuseppe Colavito, Filippo Lanubile, Nicole Novielli University of Bari
Code Comment Classification Competition
Pooja Rani1
, Luca Pascarella2
, Oscar Chaparro3
[1] University of Zurich, [2] ETH Zurich [3] College of William and Mary
Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory
Study
Amila Indika, Peter Y. Washington and Anthony Peruma University of Hawaiʻi at Mānoa
Classifying Code Comments via Pre-trained Programming Language Model
Ying Li, Haibo Wang, Huaien Zhang and Shin Hwei Tan Southern University of Science and Technology
STACC: Code Comment Classification using Sentence Transformers
Ali Al-Kaswan, Maliheh Izadi and Arie van Deursen Delft University of Technology
Closing
Rafael Kallis1
, Maliheh Izadi2
, Pooja Rani3
, Luca Pascarella4
, Oscar Chaparro5
[1] Rafael Kallis Consulting, [2] Delft University of
Technology, [3] University of Zurich, [4] ETH Zurich [5] College of William and Mary
Tool Chairs
Break (15:15 – 15:45)
Research papers: schedule (15:45)
An Exploratory Study on the Usage and Readability of Messages within Assertion Methods of Test Cases (full - online)
Taryn Takebayashi1
, Anthony Peruma1
, Mohamed Weim Mkaouer2
and Christian Newman2
[1] University of Hawai‘i at Mānoa, [2] Rochester Institute of Technology
Stop Words for Processing Software Engineering Documents: Do they Matter? (full)
Yaohou Fan1
, Chetan Arora2
and Christoph Treude1
[1] University of Melbourne, [2] Monash University
Applying Information Theory to Software Evolution (full)
Adriano Torres1
, Sebastian Baltes1
, Christoph Treude2
and Markus Wagner3
[1] University of Adelaide, [2] University of
Melbourne, [3] Monash University
Zero-shot Prompting for Code Complexity Prediction Using GitHub Copilot (short - online)
Mohammed Latif Siddiq1
, Abdus Samee2
, Sk Ruhul Azgor2
, Md. Asif Haider2
, Shehabul Islam Sawraz2
and Joanna Cecilia
da Silva Santos1
[1] University of Notre Dame, [2] Bangladesh University of Engineering and Technology
Evaluating Code Comment Generation with Summarized API Docs (short - online)
Bilel Matmti and Fatemeh Fard University of British Columbia
Sebastiano
Closing (17:15)
Closing (17:15)
25 registered participants specifically for NLBSE
12 (unique) participants attending virtually
+
40 (unique) attending physically at NLBSE
Sebastiano
Thanks to: All of you
for joining!
Sebastiano
Thanks to: Our Keynote Speakers
for giving stimulating, enlightening, and instructive talks!
Sebastiano
Sebastiano
Thanks to: the Tool Competition Co-chairs
Rafael Kallis Maliheh Izadi Pooja Rani Luca Pascarella Oscar Chaparro
for organizing two exciting and relevant tool competitions!
Sebastiano
Sebastiano
Sebastiano
Thanks to: our Web Chair
Arnaldo Sgueglia
for his support with the website and virtualization!
Sebastiano
Thanks to: the Program Committee members
for their support in reviewing papers!
Sebastiano
Thanks to: Student Volunteers
Christian Birchler Sajad Khatiri
for their help with technical duties and virtualization!
Sebastiano
What’s Next?
Special issue at Science of Computer Programming 2023:
“NLBSE’23: Natural Language-based Software to
Support Software Engineering Processes”
Open Call!
Short papers with a great focus on software and replication packages
Submission Dates: November 1st, 2023
Recordings of our Workshop will be made available in the webpage.
Sebastiano
Sebastiano
What’s Next?
• Coordinate with similar workshops (e.g., NLP-SEA, NLP4RE) in other SE venues
to continuously foster research in the field.
• Involve more industrial subjects and practitioners.
• Promote discussion around current and relevant themes (e.g., AI-language
models) and new competition in other relevant NLBSE areas
• Encourage the design, implementation, and public availability of usable and
high-quality tools to deal with NLBSE-related challenges.
• We are generally open to ideas or new NLBSE tool competition/challenges
(contact us)!
Sebastiano
Thank you all for participating!
See you next year in Lisbon
at NLBSE 2024!

Contenu connexe

Similaire à The 2nd Intl. Workshop on NL-based Software Engineering

project report of social networking web sites
project report of social networking web sitesproject report of social networking web sites
project report of social networking web sites
Gyanendra Pratap Singh
 
Te computer-syllabus-2015-course-3-4-17
Te computer-syllabus-2015-course-3-4-17Te computer-syllabus-2015-course-3-4-17
Te computer-syllabus-2015-course-3-4-17
abc19789
 
Statistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog SystemsStatistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog Systems
butest
 
Te computer syllabus 2015 course 3-4-17 3-5-17
Te computer syllabus 2015 course 3-4-17 3-5-17Te computer syllabus 2015 course 3-4-17 3-5-17
Te computer syllabus 2015 course 3-4-17 3-5-17
VishalButkar2
 
Bhavin Shah 1.10
Bhavin Shah 1.10Bhavin Shah 1.10
Bhavin Shah 1.10
Bhavin Shah
 

Similaire à The 2nd Intl. Workshop on NL-based Software Engineering (20)

project report of social networking web sites
project report of social networking web sitesproject report of social networking web sites
project report of social networking web sites
 
How to sustain a tool building community-driven effort
How to sustain a tool building community-driven effortHow to sustain a tool building community-driven effort
How to sustain a tool building community-driven effort
 
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdfCS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
 
DEEPESH KUSHWAH PROJECT 3rd sem 1.docx
DEEPESH KUSHWAH PROJECT 3rd sem 1.docxDEEPESH KUSHWAH PROJECT 3rd sem 1.docx
DEEPESH KUSHWAH PROJECT 3rd sem 1.docx
 
Chapter 2 modeling the process and life-cycle
Chapter 2  modeling the process and life-cycleChapter 2  modeling the process and life-cycle
Chapter 2 modeling the process and life-cycle
 
Te computer-syllabus-2015-course-3-4-17
Te computer-syllabus-2015-course-3-4-17Te computer-syllabus-2015-course-3-4-17
Te computer-syllabus-2015-course-3-4-17
 
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
 
Statistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog SystemsStatistical and Empirical Approaches to Spoken Dialog Systems
Statistical and Empirical Approaches to Spoken Dialog Systems
 
Lopez
LopezLopez
Lopez
 
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
 
Topic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 TutorialTopic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 Tutorial
 
Tom Healy Introduction
Tom Healy IntroductionTom Healy Introduction
Tom Healy Introduction
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
Icpc16.ppt
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
Icpc16.ppt
 
AIM/VRE Crossover Workshop
AIM/VRE Crossover WorkshopAIM/VRE Crossover Workshop
AIM/VRE Crossover Workshop
 
Introduction to MDE
Introduction to MDEIntroduction to MDE
Introduction to MDE
 
Te computer syllabus 2015 course 3-4-17 3-5-17
Te computer syllabus 2015 course 3-4-17 3-5-17Te computer syllabus 2015 course 3-4-17 3-5-17
Te computer syllabus 2015 course 3-4-17 3-5-17
 
4 mohammad naderuzzaman_finalpaper--35-40
4 mohammad naderuzzaman_finalpaper--35-404 mohammad naderuzzaman_finalpaper--35-40
4 mohammad naderuzzaman_finalpaper--35-40
 
Bhavin Shah 1.10
Bhavin Shah 1.10Bhavin Shah 1.10
Bhavin Shah 1.10
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software Analytics
 

Plus de Sebastiano Panichella

NLBSE’22: Tool Competition
NLBSE’22: Tool CompetitionNLBSE’22: Tool Competition
NLBSE’22: Tool Competition
Sebastiano Panichella
 

Plus de Sebastiano Panichella (20)

Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...
Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...
Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation Track
 
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation TrackSBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
 
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...Testing with Fewer Resources:  Toward Adaptive Approaches for Cost-effective ...
Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...
 
COSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical SystemsCOSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical Systems
 
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...
 
An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...
An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...
An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...
 
Automated Identification and Qualitative Characterization of Safety Concerns ...
Automated Identification and Qualitative Characterization of Safety Concerns ...Automated Identification and Qualitative Characterization of Safety Concerns ...
Automated Identification and Qualitative Characterization of Safety Concerns ...
 
The 16th Intl. Workshop on Search-Based and Fuzz Testing
The 16th Intl. Workshop on Search-Based and Fuzz TestingThe 16th Intl. Workshop on Search-Based and Fuzz Testing
The 16th Intl. Workshop on Search-Based and Fuzz Testing
 
Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...
Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...
Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...
 
Exposed! A case study on the vulnerability-proneness of Google Play Apps
Exposed! A case study on the vulnerability-proneness of Google Play AppsExposed! A case study on the vulnerability-proneness of Google Play Apps
Exposed! A case study on the vulnerability-proneness of Google Play Apps
 
NLBSE’22: Tool Competition
NLBSE’22: Tool CompetitionNLBSE’22: Tool Competition
NLBSE’22: Tool Competition
 
"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.
 "An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.  "An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.
"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.
 
An Empirical Investigation of Relevant Changes and Automation Needs in Modern...
An Empirical Investigation of Relevant Changes and Automation Needs in Modern...An Empirical Investigation of Relevant Changes and Automation Needs in Modern...
An Empirical Investigation of Relevant Changes and Automation Needs in Modern...
 
Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...
Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...
Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...
 
A Framework for Multi-source Studies based on Unstructured Data.
A Framework for Multi-source Studies based on Unstructured Data.A Framework for Multi-source Studies based on Unstructured Data.
A Framework for Multi-source Studies based on Unstructured Data.
 
Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfall...
Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfall...Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfall...
Revisiting Test Smells in Automatically Generated Tests: Limitations, Pitfall...
 
Requirements-Collector: Automating Requirements Specification from Elicitatio...
Requirements-Collector: Automating Requirements Specification from Elicitatio...Requirements-Collector: Automating Requirements Specification from Elicitatio...
Requirements-Collector: Automating Requirements Specification from Elicitatio...
 
Unit Testing Tool Competition-Eighth Round
Unit Testing Tool Competition-Eighth RoundUnit Testing Tool Competition-Eighth Round
Unit Testing Tool Competition-Eighth Round
 

Dernier

Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
Kayode Fayemi
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
David Celestin
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
amilabibi1
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
raffaeleoman
 

Dernier (15)

Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
 

The 2nd Intl. Workshop on NL-based Software Engineering

  • 1. NL-based Software Engineering (NLBSE) '23 May 20th, 2023, 09:00 - 17:30 UTC+10 https://nlbse2023.github.io/ @NLBSE_workshop Sebastiano Panichella Andrea Di Sorbo
  • 2. Welcome to Melbourne! Twitter → @NLBSE_workshop On Zoom all day! Sebastiano
  • 3. General format: Hybrid Mode Participants: Live + Remote participants Remote participants: - Check your email for your Zoom invitation - Please remain muted until it is your turn to speak! Presentations: Live presentations In-person: Room 103 Questions: After each presentation we have allocated time for Q & A. Remote participants: - We encourage you to use the “Raise your hand option” to ask questions. - Organizers will invite to turn on your mic to ask (voice) questions. Everything on Twitch will be recorded and uploaded to YouTube! Sebastiano
  • 6. Thanks to our sponsors! IEEE Technical Community on Software Engineering (TCSE) ACM Special Interest Group on Software Engineering (SIGSOFT) Sebastiano
  • 7. Schedule (UTC+10) 09:10 → Opening & Awards 09:30 → Live Keynote: Automated Bug Management: Reflections and the Road Ahead by David Lo 10:30 → Break 11:00 → Live Keynote: Trends and Opportunities in the Application of Large Language Models: the Quest for Maximum Effect by Albert Ziegler 12:00 → Session: Position Papers 12:30 → Lunch 13:45 → Tool Competition (overview + 5 live presentations + closing) -> Tool Award session 15:15 → Break 15:45 → Session: Research Papers 17:15 → Closing and Award session https://conf.researchr.org/program/icse-2023/program-icse-2023/ https://nlbse2023.github.io/ Sebastiano
  • 8. Keynote (09:30) Automated Bug Management: Reflections and the Road Ahead For many projects, bug reports, predominantly written in natural language, are submitted daily to issue tracking systems. The number of such reports is often too many for busy software engineers to manually handle and eventually resolve in a timely fashion. Also, the resolution of each report often requires many steps, e.g., detecting invalid reports, assigning the reports to engineers with the right expertise, finding the buggy files requiring changes, fixing the buggy files, etc. Incorrect decisions made for any of these steps can slow down the resolution of the bug report. To help reduce engineers’ workload and improve the reliability of systems, in the last decade, many automated solutions have been proposed for various steps in the bug management and resolution process. This talk will first do a reflection on the hundreds of studies done in this popular area of Natural-Language Based Software Engineering (NLBSE), highlighting success cases and the explored directions. It will then highlight interesting future work in the road ahead, describing important unsolved problems and untapped opportunities. David Lo Bio: Professor of Computer Science at the School of Computing and Information Systems, Singapore Management University. He leads the SOftware Analytics Research (SOAR) group. His research interest is in the intersection of software engineering, cybersecurity, and data science, encompassing socio-technical aspects and analysis of different kinds of software artifacts,. He has won more than 15 international research and service awards, including 2 Most Influential Paper Awards and 6 ACM SIGSOFT Distinguished Paper Awards. He is currently serving on the SIGSOFT Executive Committee, Editorial Boards of TSE, TRel, and EMSE, and as a PC Co-Chair of ESEC/FSE 2024 and ICSE 2025. Sebastiano
  • 10. Keynote (11:00) Trends and Opportunities in the Application of Large Language Models: the Quest for Maximum Effect As large language models become more and more sophisticated, the machine learning problem "How to train a great new model so it best solves my task" increasingly pivots to "How to run a great existing model so it best solves my task". This is easier said than done and requires reconciliation of four goals: 1. How to communicate the problem and the format in which you expect your answer; 2. How to communicate all background information the model might need to arrive at that answer; 3. How to communicate with the model robustly, in particular in a way that it is used to from its training set; 4. How to keep the question short in order to adhere to the context window and save computing time and cost. The talk discusses strategies for addressing each of these goals in the code domain, as well as methods for balancing them against each other. The keynote will in particular focus on the example of GitHub Copilot and related AI for software development projects. Albert Ziegler Bio: Principal machine learning engineer with a background in Mathematics and a home at GitHub Next, GitHub's innovation and future group. His main interests are combinations of deductive and intuitive reasoning to improve the software development experience. He's previously worked on developer productivity, ML guided CodeQL, and he was part of the trio that conceived and then implemented the GitHub Copilot project. His most recent projects include Copilot Radar and AI for Pull Requests. Sebastiano
  • 11. Research papers (2 sessions) Full papers (20 minutes): - 15 minutes for talk - 5 minutes for questions Short papers (15 minutes): - 10 minutes for talk - 5 minutes for questions Position papers (15 minutes): - 8 minutes for talk - 7 minutes for questions Sebastiano
  • 12. Position papers: schedule (12:00) The (Ab)use of Open Source Code to Train Language Models (position) Ali Al-Kaswan and Maliheh Izadi Delft University of Technology Exploring Generalizability of NLP-based Models for Modern Software Development Cross-Domain Environments (position - online) Rrezarta Krasniqi and Hyunsook Do University of North Texas Sebastiano
  • 14. Tool competition (13:45) Competition reports: - 10 minutes per paper (inclusive) Tool Competition Co-chairs Tool Chairs
  • 15. Tool Competition schedule Opening & Issue Report Classification Competition Rafael Kallis1 , Maliheh Izadi2 , Pooja Rani3 , Luca Pascarella4 , Oscar Chaparro5 [1] Rafael Kallis Consulting, [2] Delft University of Technology, [3] University of Zurich, [4] ETH Zurich [5] College of William and Mary An Intelligent Tool for Classifying Issue Reports Muhammad Laiq Blekinge Institute of Technology Few-Shot Learning for Issue Report Classification Giuseppe Colavito, Filippo Lanubile, Nicole Novielli University of Bari Code Comment Classification Competition Pooja Rani1 , Luca Pascarella2 , Oscar Chaparro3 [1] University of Zurich, [2] ETH Zurich [3] College of William and Mary Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study Amila Indika, Peter Y. Washington and Anthony Peruma University of Hawaiʻi at Mānoa Classifying Code Comments via Pre-trained Programming Language Model Ying Li, Haibo Wang, Huaien Zhang and Shin Hwei Tan Southern University of Science and Technology STACC: Code Comment Classification using Sentence Transformers Ali Al-Kaswan, Maliheh Izadi and Arie van Deursen Delft University of Technology Closing Rafael Kallis1 , Maliheh Izadi2 , Pooja Rani3 , Luca Pascarella4 , Oscar Chaparro5 [1] Rafael Kallis Consulting, [2] Delft University of Technology, [3] University of Zurich, [4] ETH Zurich [5] College of William and Mary Tool Chairs
  • 17. Research papers: schedule (15:45) An Exploratory Study on the Usage and Readability of Messages within Assertion Methods of Test Cases (full - online) Taryn Takebayashi1 , Anthony Peruma1 , Mohamed Weim Mkaouer2 and Christian Newman2 [1] University of Hawai‘i at Mānoa, [2] Rochester Institute of Technology Stop Words for Processing Software Engineering Documents: Do they Matter? (full) Yaohou Fan1 , Chetan Arora2 and Christoph Treude1 [1] University of Melbourne, [2] Monash University Applying Information Theory to Software Evolution (full) Adriano Torres1 , Sebastian Baltes1 , Christoph Treude2 and Markus Wagner3 [1] University of Adelaide, [2] University of Melbourne, [3] Monash University Zero-shot Prompting for Code Complexity Prediction Using GitHub Copilot (short - online) Mohammed Latif Siddiq1 , Abdus Samee2 , Sk Ruhul Azgor2 , Md. Asif Haider2 , Shehabul Islam Sawraz2 and Joanna Cecilia da Silva Santos1 [1] University of Notre Dame, [2] Bangladesh University of Engineering and Technology Evaluating Code Comment Generation with Summarized API Docs (short - online) Bilel Matmti and Fatemeh Fard University of British Columbia Sebastiano
  • 19. Closing (17:15) 25 registered participants specifically for NLBSE 12 (unique) participants attending virtually + 40 (unique) attending physically at NLBSE Sebastiano
  • 20. Thanks to: All of you for joining! Sebastiano
  • 21. Thanks to: Our Keynote Speakers for giving stimulating, enlightening, and instructive talks! Sebastiano
  • 23. Thanks to: the Tool Competition Co-chairs Rafael Kallis Maliheh Izadi Pooja Rani Luca Pascarella Oscar Chaparro for organizing two exciting and relevant tool competitions! Sebastiano
  • 26. Thanks to: our Web Chair Arnaldo Sgueglia for his support with the website and virtualization! Sebastiano
  • 27. Thanks to: the Program Committee members for their support in reviewing papers! Sebastiano
  • 28. Thanks to: Student Volunteers Christian Birchler Sajad Khatiri for their help with technical duties and virtualization! Sebastiano
  • 29. What’s Next? Special issue at Science of Computer Programming 2023: “NLBSE’23: Natural Language-based Software to Support Software Engineering Processes” Open Call! Short papers with a great focus on software and replication packages Submission Dates: November 1st, 2023 Recordings of our Workshop will be made available in the webpage. Sebastiano Sebastiano
  • 30. What’s Next? • Coordinate with similar workshops (e.g., NLP-SEA, NLP4RE) in other SE venues to continuously foster research in the field. • Involve more industrial subjects and practitioners. • Promote discussion around current and relevant themes (e.g., AI-language models) and new competition in other relevant NLBSE areas • Encourage the design, implementation, and public availability of usable and high-quality tools to deal with NLBSE-related challenges. • We are generally open to ideas or new NLBSE tool competition/challenges (contact us)! Sebastiano
  • 31. Thank you all for participating! See you next year in Lisbon at NLBSE 2024!