SlideShare une entreprise Scribd logo
1  sur  18
Merging Computer Log Files for Process Mining:An Artificial Immune System Technique Jan Claes and Geert Poels http://processmining.ugent.be
Process Mining Processes are supported by IT systems IT systems record actual process data Process data can be used to Discover process model Check conformance with existing process info Improve or extend existing process model Attention Only As-Is Only (correctly) recorded information Process Mining
Keynote BPI 2010, Michael Zur Muehlen Process Controlling Business Activity Monitoring Process Intelligence Event Detection & Correlation Decision Making Main focus point of current BPI research Deserves more focus in BPI research BPI 2010, Keynote Michael Zur Muehlen http://www.slideshare.net/mzurmuehlen/bu-5236080
Preparation Collect data: find event information Merge data: from different sources Structure data: group per instance Convert data: to tool specific format Process mining Make decisions, take action Process Mining steps M M M A M A A M M Manual task	Analysts needed in most cases A Automated task	Less human involvement needed
Merging log files My research:Merging log files
Merging log files 2. Merge chronologically 1. Find links 3. Add unlinked traces 4. Put in new log file
Find links Required properties of solution Finds traces in both log files that belong to the same process execution Without prior knowledge about the provided log files (as generic as possible) But with maximal possibilities for the (expert) user to include his knowledge about the log files
Find links Proposed solution Take the best possible guess based on assumptions Include multiple indicator factors in analysis Calculate factor scores for each analysed solution Combine factor scores into global score per solution ‘Best guess’ is solution with highest combined score,because based on assumed indicators, most indicator value points to this solution Provide user interaction possibilities
Decisions to make Which indicator factors? How to calculate a score for each factor? How to combine factor scores to global score? Which solutions to analyse?(analyse = calculate & compare scores) Which user interactions to include (expert) user knowledge? See paper for more details
Indicator factors Same trace identifier Assumption: If both logs contain a trace with the same id, there is a very high chance they match Not always though (e.g. customer id vs. order id) 16 10 17 12 18 14 19 16 20 18 21 20
Indicator factors Equal attribute values Assumption: The more attributes of a trace and its events from both logs are equal, the higher the chance they match 16 JAN 12:00 JC 14 14:00 17 17 JAN 12:10 JC 15 14:10 18 18 JAN 12:20 JC 16 14:20 19 19 JAN 12:30 JC 17 14:30 1A 20 JAN 12:40 JC 18 14:40 1B 21 JAN 12:50 JC 19 14:50 1C
Indicator factors Extra trace & Missing trace Assumption: A trace from one log has more chance to match with only one trace from the other log Extra trace: Negative if trace is linked with multiple traces in other log Missing trace: Negative if trace is not linked
Indicator factors Time difference Assumption: For a certain trace t in one log the trace in the other log that starts sooner after t has a higher chance to match More difficult when traces overlap 16 17 17 JAN 12:00 JC 10 11:45 18 18 JAN 12:10 JC 11 11:55 19 19 JAN 12:20 JC 12 12:05 1A 20 JAN 12:30 JC 13 12:15 1B JAN 12:40 JC 14 12:25 21 1C JAN 12:50 JC 15 12:35
User interaction ,[object Object]
Step 2	give feedback on individual scores:	user can change weights and restart? Step 3	present best solution per factor:	let user choose which factor dominates	based on factor score feedback ? Step 4	provide other ways for user to feed 	algorithm with his insights
Test results Simulated data (300-400 msec on standard laptop) Benefit of controllable parameters, known solution Correct number of linked traces in all tests Perfect results for same trace id and up to 50% noise, worse results for higher overlap of traces ,[object Object],Correct number of linked traces in all tests Almost perfect results for same trace id and up to 50% noise, worse results for higher overlap
Further research plans Refining merging technique Quest for optimal indicators and weights is continuous effort (based on experiences from case studies) Implementation optimisation (speed, memory usage, scalability) is continuous effort Validation (case studies)
Questions Do you agree that combined set of logical assumptions can be strong indicator (stronger than individual assumptions)? Any feedback on the used factors? Any other factors that should be included? Any concerns about performance and scalability?

Contenu connexe

En vedette

TAProViz@BPM2012
TAProViz@BPM2012TAProViz@BPM2012
TAProViz@BPM2012Jan Claes
 
COGNISE@CAiSE2013
COGNISE@CAiSE2013COGNISE@CAiSE2013
COGNISE@CAiSE2013Jan Claes
 
INISET@CAiSE 2011
INISET@CAiSE 2011INISET@CAiSE 2011
INISET@CAiSE 2011Jan Claes
 
BPMS2@BPM2016
BPMS2@BPM2016BPMS2@BPM2016
BPMS2@BPM2016Jan Claes
 

En vedette (7)

EIS 2012
EIS 2012EIS 2012
EIS 2012
 
TAProViz@BPM2012
TAProViz@BPM2012TAProViz@BPM2012
TAProViz@BPM2012
 
EIS 2011
EIS 2011EIS 2011
EIS 2011
 
COGNISE@CAiSE2013
COGNISE@CAiSE2013COGNISE@CAiSE2013
COGNISE@CAiSE2013
 
INISET@CAiSE 2011
INISET@CAiSE 2011INISET@CAiSE 2011
INISET@CAiSE 2011
 
BPM2012
BPM2012BPM2012
BPM2012
 
BPMS2@BPM2016
BPMS2@BPM2016BPMS2@BPM2016
BPMS2@BPM2016
 

Similaire à BPI@BPM2011

Data and Processes: Can we Marry Them . . . and Make the Marriage Last?
Data and Processes: Can we Marry Them . . . and Make the Marriage Last?Data and Processes: Can we Marry Them . . . and Make the Marriage Last?
Data and Processes: Can we Marry Them . . . and Make the Marriage Last?INRIA-CEDAR
 
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"SamueleBertollo1
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?BalaBit
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Universitas Bhayangkara Jakarta Raya
 
Examples of working with streaming data
Examples of working with streaming dataExamples of working with streaming data
Examples of working with streaming dataYi-Shin Chen
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer universityLászló Kovács
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentationnirvdrum
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...ijdpsjournal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...ijdpsjournal
 
Machine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AEMachine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AEbutest
 
information retrival evaluation.ppt
information retrival evaluation.pptinformation retrival evaluation.ppt
information retrival evaluation.pptBonnieKabiru
 
Data preparation and processing chapter 2
Data preparation and processing chapter  2Data preparation and processing chapter  2
Data preparation and processing chapter 2Mahmoud Alfarra
 
Delivering BAM & BPM With Run-Time Integration
Delivering BAM & BPM With Run-Time IntegrationDelivering BAM & BPM With Run-Time Integration
Delivering BAM & BPM With Run-Time IntegrationNathaniel Palmer
 
ICT741 Digital Forensics.docx
ICT741 Digital Forensics.docxICT741 Digital Forensics.docx
ICT741 Digital Forensics.docxwrite4
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERiQHub
 
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM Events
 

Similaire à BPI@BPM2011 (20)

Data and Processes: Can we Marry Them . . . and Make the Marriage Last?
Data and Processes: Can we Marry Them . . . and Make the Marriage Last?Data and Processes: Can we Marry Them . . . and Make the Marriage Last?
Data and Processes: Can we Marry Them . . . and Make the Marriage Last?
 
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
Comparative Analysis of Naive Bayes and Decision Tree Algorithms in Data Mini...
 
Examples of working with streaming data
Examples of working with streaming dataExamples of working with streaming data
Examples of working with streaming data
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer university
 
-linkedin
-linkedin-linkedin
-linkedin
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
 
Machine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AEMachine Learning for automated diagnosis of distributed ...AE
Machine Learning for automated diagnosis of distributed ...AE
 
information retrival evaluation.ppt
information retrival evaluation.pptinformation retrival evaluation.ppt
information retrival evaluation.ppt
 
Data preparation and processing chapter 2
Data preparation and processing chapter  2Data preparation and processing chapter  2
Data preparation and processing chapter 2
 
Delivering BAM & BPM With Run-Time Integration
Delivering BAM & BPM With Run-Time IntegrationDelivering BAM & BPM With Run-Time Integration
Delivering BAM & BPM With Run-Time Integration
 
Apps World 2015 Berlin
Apps World 2015 BerlinApps World 2015 Berlin
Apps World 2015 Berlin
 
ICT741 Digital Forensics.docx
ICT741 Digital Forensics.docxICT741 Digital Forensics.docx
ICT741 Digital Forensics.docx
 
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFERFASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
FASTER PROCESS DEVELOPMENT WITH HYBRID MODELING AND KNOWLEDGE TRANSFER
 
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
 

Plus de Jan Claes

COGNISE@CAiSE 2019
COGNISE@CAiSE 2019COGNISE@CAiSE 2019
COGNISE@CAiSE 2019Jan Claes
 
BPMS2@BPM2018
BPMS2@BPM2018BPMS2@BPM2018
BPMS2@BPM2018Jan Claes
 
EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018Jan Claes
 
BPM Cluster Meeting 2018
BPM Cluster Meeting 2018BPM Cluster Meeting 2018
BPM Cluster Meeting 2018Jan Claes
 
Research: Why? What? How?
Research: Why? What? How?Research: Why? What? How?
Research: Why? What? How?Jan Claes
 
TEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD ContestTEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD ContestJan Claes
 
PhD defense November 2015
PhD defense November 2015PhD defense November 2015
PhD defense November 2015Jan Claes
 
PhD pre-defense September 2015
PhD pre-defense September 2015PhD pre-defense September 2015
PhD pre-defense September 2015Jan Claes
 
UGent MIS research seminar June 2015
UGent MIS research seminar June 2015UGent MIS research seminar June 2015
UGent MIS research seminar June 2015Jan Claes
 
UGent MIS research seminar December 2014
UGent MIS research seminar December 2014UGent MIS research seminar December 2014
UGent MIS research seminar December 2014Jan Claes
 
BPM Cluster Meeting 2014
BPM Cluster Meeting 2014BPM Cluster Meeting 2014
BPM Cluster Meeting 2014Jan Claes
 
PhD Day 2014
PhD Day 2014PhD Day 2014
PhD Day 2014Jan Claes
 
Colloquium@TUe
Colloquium@TUeColloquium@TUe
Colloquium@TUeJan Claes
 
PhD Day 2013
PhD Day 2013PhD Day 2013
PhD Day 2013Jan Claes
 
Stad Gent 2012
Stad Gent 2012Stad Gent 2012
Stad Gent 2012Jan Claes
 
Confenis 2012
Confenis 2012Confenis 2012
Confenis 2012Jan Claes
 
Confenis2012DC
Confenis2012DCConfenis2012DC
Confenis2012DCJan Claes
 

Plus de Jan Claes (20)

COGNISE@CAiSE 2019
COGNISE@CAiSE 2019COGNISE@CAiSE 2019
COGNISE@CAiSE 2019
 
BPMS2@BPM2018
BPMS2@BPM2018BPMS2@BPM2018
BPMS2@BPM2018
 
ICLTC 2018
ICLTC 2018ICLTC 2018
ICLTC 2018
 
EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018
 
BPM Cluster Meeting 2018
BPM Cluster Meeting 2018BPM Cluster Meeting 2018
BPM Cluster Meeting 2018
 
Research: Why? What? How?
Research: Why? What? How?Research: Why? What? How?
Research: Why? What? How?
 
TEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD ContestTEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD Contest
 
PhD defense November 2015
PhD defense November 2015PhD defense November 2015
PhD defense November 2015
 
PhD pre-defense September 2015
PhD pre-defense September 2015PhD pre-defense September 2015
PhD pre-defense September 2015
 
UGent MIS research seminar June 2015
UGent MIS research seminar June 2015UGent MIS research seminar June 2015
UGent MIS research seminar June 2015
 
UGent MIS research seminar December 2014
UGent MIS research seminar December 2014UGent MIS research seminar December 2014
UGent MIS research seminar December 2014
 
BPM Cluster Meeting 2014
BPM Cluster Meeting 2014BPM Cluster Meeting 2014
BPM Cluster Meeting 2014
 
PhD Day 2014
PhD Day 2014PhD Day 2014
PhD Day 2014
 
Colloquium@TUe
Colloquium@TUeColloquium@TUe
Colloquium@TUe
 
ECIS2013DC
ECIS2013DCECIS2013DC
ECIS2013DC
 
PhD Day 2013
PhD Day 2013PhD Day 2013
PhD Day 2013
 
Stad Gent 2012
Stad Gent 2012Stad Gent 2012
Stad Gent 2012
 
Confenis 2012
Confenis 2012Confenis 2012
Confenis 2012
 
Confenis2012DC
Confenis2012DCConfenis2012DC
Confenis2012DC
 
BPI@BPM2012
BPI@BPM2012BPI@BPM2012
BPI@BPM2012
 

Dernier

Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Delhi Call girls
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876dlhescort
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒anilsa9823
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxAndy Lambert
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 DelhiCall Girls in Delhi
 

Dernier (20)

Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptx
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
 

BPI@BPM2011

  • 1. Merging Computer Log Files for Process Mining:An Artificial Immune System Technique Jan Claes and Geert Poels http://processmining.ugent.be
  • 2. Process Mining Processes are supported by IT systems IT systems record actual process data Process data can be used to Discover process model Check conformance with existing process info Improve or extend existing process model Attention Only As-Is Only (correctly) recorded information Process Mining
  • 3. Keynote BPI 2010, Michael Zur Muehlen Process Controlling Business Activity Monitoring Process Intelligence Event Detection & Correlation Decision Making Main focus point of current BPI research Deserves more focus in BPI research BPI 2010, Keynote Michael Zur Muehlen http://www.slideshare.net/mzurmuehlen/bu-5236080
  • 4. Preparation Collect data: find event information Merge data: from different sources Structure data: group per instance Convert data: to tool specific format Process mining Make decisions, take action Process Mining steps M M M A M A A M M Manual task Analysts needed in most cases A Automated task Less human involvement needed
  • 5. Merging log files My research:Merging log files
  • 6. Merging log files 2. Merge chronologically 1. Find links 3. Add unlinked traces 4. Put in new log file
  • 7. Find links Required properties of solution Finds traces in both log files that belong to the same process execution Without prior knowledge about the provided log files (as generic as possible) But with maximal possibilities for the (expert) user to include his knowledge about the log files
  • 8. Find links Proposed solution Take the best possible guess based on assumptions Include multiple indicator factors in analysis Calculate factor scores for each analysed solution Combine factor scores into global score per solution ‘Best guess’ is solution with highest combined score,because based on assumed indicators, most indicator value points to this solution Provide user interaction possibilities
  • 9. Decisions to make Which indicator factors? How to calculate a score for each factor? How to combine factor scores to global score? Which solutions to analyse?(analyse = calculate & compare scores) Which user interactions to include (expert) user knowledge? See paper for more details
  • 10. Indicator factors Same trace identifier Assumption: If both logs contain a trace with the same id, there is a very high chance they match Not always though (e.g. customer id vs. order id) 16 10 17 12 18 14 19 16 20 18 21 20
  • 11. Indicator factors Equal attribute values Assumption: The more attributes of a trace and its events from both logs are equal, the higher the chance they match 16 JAN 12:00 JC 14 14:00 17 17 JAN 12:10 JC 15 14:10 18 18 JAN 12:20 JC 16 14:20 19 19 JAN 12:30 JC 17 14:30 1A 20 JAN 12:40 JC 18 14:40 1B 21 JAN 12:50 JC 19 14:50 1C
  • 12. Indicator factors Extra trace & Missing trace Assumption: A trace from one log has more chance to match with only one trace from the other log Extra trace: Negative if trace is linked with multiple traces in other log Missing trace: Negative if trace is not linked
  • 13. Indicator factors Time difference Assumption: For a certain trace t in one log the trace in the other log that starts sooner after t has a higher chance to match More difficult when traces overlap 16 17 17 JAN 12:00 JC 10 11:45 18 18 JAN 12:10 JC 11 11:55 19 19 JAN 12:20 JC 12 12:05 1A 20 JAN 12:30 JC 13 12:15 1B JAN 12:40 JC 14 12:25 21 1C JAN 12:50 JC 15 12:35
  • 14.
  • 15. Step 2 give feedback on individual scores: user can change weights and restart? Step 3 present best solution per factor: let user choose which factor dominates based on factor score feedback ? Step 4 provide other ways for user to feed algorithm with his insights
  • 16.
  • 17. Further research plans Refining merging technique Quest for optimal indicators and weights is continuous effort (based on experiences from case studies) Implementation optimisation (speed, memory usage, scalability) is continuous effort Validation (case studies)
  • 18. Questions Do you agree that combined set of logical assumptions can be strong indicator (stronger than individual assumptions)? Any feedback on the used factors? Any other factors that should be included? Any concerns about performance and scalability?
  • 19. Contact information Jan Claes jan.claes@ugent.be http://processmining.ugent.be Twitter: @janclaesbelgium