Skip to content

Commit 4d77ca2

Browse files
Updating guide for v1.7
1 parent c9be334 commit 4d77ca2

15 files changed

Lines changed: 328 additions & 27 deletions

jbpt-pm/entropia/guide/bib.bib

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,3 +60,30 @@ @inproceedings{KalenkovaP2020
6060
year = {2020},
6161
note = {{In Press}}
6262
}
63+
64+
@inproceedings{Polyvyanyy2022Bootstrapping,
65+
author = {Polyvyanyy, Artem and Moffat, Alistair and Garc{\'{\i}}a{-}Ba{\~{n}}uelos, Luciano},
66+
title = {Bootstrapping Generalization of Process Models Discovered From Event Data},
67+
booktitle = caise,
68+
pages = {36--54},
69+
hidepublisher = {Springer},
70+
year = {2022}
71+
}
72+
73+
@inproceedings{Kabierski2023Addressing,
74+
title = {Addressing the Log Representativeness Problem using Species Discovery},
75+
author = {Kabierski, Martin and Richter, Markus and Weidlich, Matthias},
76+
booktitle = icpm,
77+
pages = {65--72},
78+
year = {2023},
79+
hideorganization = {IEEE},
80+
}
81+
82+
@InProceedings{Karunaratne2024Role,
83+
author = {Anandi Karunaratne and Artem Polyvyanyy and Alistair Moffat},
84+
booktitle = icpm,
85+
title = {The Role of Log Representativeness in Estimating Generalization in Process Mining},
86+
year = {2024},
87+
publisher = {{IEEE}},
88+
note = {{to appear}},
89+
}
22.7 KB
Binary file not shown.
21.2 KB
Binary file not shown.
21.2 KB
Binary file not shown.

jbpt-pm/entropia/guide/guide.tex

Lines changed: 19 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -2,22 +2,23 @@
22
\input{include}
33
\input{style}
44

5-
\title{\textbf{Entropia 1.5 \\ \large User's Guide}}
6-
\author[1]{Artem Polyvyanyy}
7-
\author[1]{Hanan Alkhammash}
8-
\author[2]{Claudio Di Ciccio}
9-
\author[3]{Luciano García-Bañuelos}
10-
\author[1]{Anna Kalenkova}
11-
\author[4]{Sander J. J. Leemans}
12-
\author[5]{Jan Mendling}
13-
\author[1]{Alistair Moffat}
14-
\author[6]{Matthias Weidlich}
15-
\affil[1]{The University of Melbourne \authorcr {\tt {\{artem.polyvyanyy,anna.kalenkova,ammoffat\}@unimelb.edu.au}}\\ {\tt halkhammash@student.unimelb.edu.au}\vspace{1.3ex}}
16-
\affil[2]{Sapienza University of Rome, \authorcr {\tt {diciccio@di.uniroma1.it}\vspace{1.3ex}}}
17-
\affil[3]{Tecnológico de Monterrey, \authorcr {\tt {luciano.garcia@tec.mx}\vspace{1.3ex}}}
18-
\affil[4]{Queensland University of Technology, \authorcr {\tt {s.leemans@qut.edu.au}\vspace{1.3ex}}}
19-
\affil[5]{Vienna University of Economics and Business, \authorcr {\tt {jan.mendling@wu.ac.at}\vspace{1.3ex}}}
20-
\affil[6]{Humboldt-Universität zu Berlin, \authorcr {\tt {matthias.weidlich@hu-berlin.de}}}
5+
\title{\textbf{Entropia 1.7 \\ \large User's Guide}}
6+
%\author[1]{Artem Polyvyanyy}
7+
\author[1]{Hanan Alkhammash\thanks{Documentation completed for version 1.5 on August 22, 2020.}}
8+
%\author[2]{Claudio Di Ciccio}
9+
%\author[3]{Luciano García-Bañuelos}
10+
%\author[1]{Anna Kalenkova}
11+
%\author[4]{Sander J. J. Leemans}
12+
%\author[5]{Jan Mendling}
13+
%\author[1]{Alistair Moffat}
14+
%\author[6]{Matthias Weidlich}
15+
\author[1]{Anandi Karunaratne\thanks{Documentation updated for version 1.7 on \today.}}
16+
\affil[1]{The University of Melbourne \authorcr {\tt {\{halkhammash, anandik\}@student.unimelb.edu.au}}}
17+
%\affil[2]{Sapienza University of Rome, \authorcr {\tt {diciccio@di.uniroma1.it}\vspace{1.3ex}}}
18+
%\affil[3]{Tecnológico de Monterrey, \authorcr {\tt {luciano.garcia@tec.mx}\vspace{1.3ex}}}
19+
%\affil[4]{Queensland University of Technology, \authorcr {\tt {s.leemans@qut.edu.au}\vspace{1.3ex}}}
20+
%\affil[5]{Vienna University of Economics and Business, \authorcr {\tt {jan.mendling@wu.ac.at}\vspace{1.3ex}}}
21+
%\affil[6]{Humboldt-Universität zu Berlin, \authorcr {\tt {matthias.weidlich@hu-berlin.de}}}
2122
\date{}
2223

2324

@@ -31,6 +32,8 @@
3132
\input{sec_02}
3233
\input{sec_03}
3334
\input{sec_04}
35+
\input{sec_05}
36+
\input{sec_06}
3437

3538
\newpage
3639
\bibliographystyle{plain}

jbpt-pm/entropia/guide/intro.tex

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,4 +15,6 @@
1515
\item[1.] Stochastic Precision and Recall \cite{Leemans2020}; and
1616
\item[2.] Entropic Relevance \cite{abs-2007-09310}.
1717
\end{enumerate}
18+
\item Bootstrap Generalization: A section detailing the \textit{Entropia} commands and processes for applying the Bootstrap Generalization approach to assess the generalization of a discovered process model described as a directly-follows-graph (DFG).
19+
\item Log Representativeness Measures: Learn about the measures used to evaluate log representativeness and how to apply them using \textit{Entropia}, ensuring that your logs adequately reflect the underlying process.
1820
\end{enumerate}
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
Computing bootstrap generalization.
2+
The technique is described in:
3+
Artem Polyvyanyy, Alistair Moffat, Luciano Garcia-Bonuelos. Bootstrapping
4+
Generalization of Process Models Discovered from Event Data. CAiSE 2022
5+
6+
===========================Calculating generalization===========================
7+
8+
Sample Size = 264
9+
Number of Log Generations = 16
10+
Crossover Subtrace Length = 2
11+
Breeding Probability = 1.0
12+
Threshold for confidence interval of bootstrap samples = 0.01
13+
14+
Model-log precision and recall calculated for bootstrap sample 1: 0.8640223021976667, 0.9481225021326511
15+
Model-log precision and recall calculated for bootstrap sample 2: 0.846835226895293, 0.9298562086663995
16+
Model-log precision and recall calculated for bootstrap sample 3: 0.8789360974064723, 0.9703714487638393
17+
Model-log precision and recall calculated for bootstrap sample 4: 0.8640223021901551, 0.9481512297480506
18+
Model-log precision and recall calculated for bootstrap sample 5: 0.8399359447535413, 0.9273141266860363
19+
Model-log precision and recall calculated for bootstrap sample 6: 0.8468352269011715, 0.9418416732973515
20+
Model-log precision and recall calculated for bootstrap sample 7: 0.8789360974064715, 0.9377155545915118
21+
Model-log precision and recall calculated for bootstrap sample 8: 0.8789360974064718, 0.9426623519201563
22+
Model-log precision and recall calculated for bootstrap sample 9: 0.8789360974064719, 0.9377155545915125
23+
Model-log precision and recall calculated for bootstrap sample 10: 0.8640223021081593, 0.9368237287963496
24+
Model-log precision and recall calculated for bootstrap sample 11: 0.8789360974064686, 0.9528753340039785
25+
26+
===========================Calculated generalization============================
27+
28+
Generalization calculated in 6296 ms with 11 samples.
29+
Model-system precision: 0.8654867083707584 +/- 0.009628352340204586
30+
Model-system recall: 0.9430408830179853 +/- 0.007616113385628391
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
> java -jar jbpt-pm-entropia-1.7.jar -bgen -rel=examples/log3.xes -ret=examples/model3.json -s -m=1000 -ep=0.005
2+
0.8646947781195624, 0.9443517823270182
Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,39 @@
1+
Computing bootstrap generalization.
2+
The technique is described in:
3+
Artem Polyvyanyy, Alistair Moffat, Luciano Garcia-Bonuelos. Bootstrapping
4+
Generalization of Process Models Discovered from Event Data. CAiSE 2022
5+
6+
===========================Calculating generalization===========================
7+
8+
Sample Size = 1000
9+
Number of Samples = 20
10+
Number of Log Generations = 16
11+
Crossover Subtrace Length = 2
12+
Breeding Probability = 0.5
13+
14+
Model-log precision and recall calculated for bootstrap sample 1: 0.8640223023960756, 0.9316078514432434
15+
Model-log precision and recall calculated for bootstrap sample 2: 0.8882953280705439, 0.9416055428031388
16+
Model-log precision and recall calculated for bootstrap sample 3: 0.8640223022539603, 0.921805445155556
17+
Model-log precision and recall calculated for bootstrap sample 4: 0.8640223022681677, 0.9481512298336585
18+
Model-log precision and recall calculated for bootstrap sample 5: 0.8640223022654226, 0.9368237289668605
19+
Model-log precision and recall calculated for bootstrap sample 6: 0.864022302362209, 0.9481512299368573
20+
Model-log precision and recall calculated for bootstrap sample 7: 0.8640223022423997, 0.9667939526581532
21+
Model-log precision and recall calculated for bootstrap sample 8: 0.864022302265727, 0.9481512298309965
22+
Model-log precision and recall calculated for bootstrap sample 9: 0.8640223022506948, 0.9316078512864893
23+
Model-log precision and recall calculated for bootstrap sample 10: 0.86402230226115, 0.9481512298259573
24+
Model-log precision and recall calculated for bootstrap sample 11: 0.8789360974064712, 0.9703714487638323
25+
Model-log precision and recall calculated for bootstrap sample 12: 0.8640223022574599, 0.9481512298219085
26+
Model-log precision and recall calculated for bootstrap sample 13: 0.8789360974064726, 0.9376442101432018
27+
Model-log precision and recall calculated for bootstrap sample 14: 0.8789360974064709, 0.942662351920154
28+
Model-log precision and recall calculated for bootstrap sample 15: 0.8640223022271509, 0.9539061777646949
29+
Model-log precision and recall calculated for bootstrap sample 16: 0.864022302276446, 0.9218042849623546
30+
Model-log precision and recall calculated for bootstrap sample 17: 0.878936097406473, 0.9528750949556504
31+
Model-log precision and recall calculated for bootstrap sample 18: 0.8640223022513908, 0.9481512298152467
32+
Model-log precision and recall calculated for bootstrap sample 19: 0.8789360974064728, 0.9377155545497414
33+
Model-log precision and recall calculated for bootstrap sample 20: 0.8640223022648623, 0.9539061778063306
34+
35+
===========================Calculated generalization============================
36+
37+
Generalization calculated in 12226 ms with 20 samples.
38+
Model-system precision: 0.8689644023473011 +/- 0.0036445059613284285
39+
Model-system recall: 0.9445018526122011 +/- 0.005750771071582731
Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,22 @@
1+
Computing event log statistics
2+
Completeness and Coverage are described in:
3+
M. Kabierski, M. Richter, and M. Weidlich, "Addressing the
4+
log representativeness problem using species discovery," in ICPM 2023
5+
6+
============================= Event Log Statistics =============================
7+
8+
--------------------------- Activity-based Analysis ----------------------------
9+
Activities: [A, B, C]
10+
Number of distinct activities: 3
11+
Number of total activities: 121
12+
Completeness: 1.0
13+
14+
------------------- Directly-follows-relation-based Analysis -------------------
15+
Number of distinct DF-relations: 5
16+
Number of total DF-relations: 88
17+
Completeness: 1.0
18+
19+
----------------------------- Trace-based Analysis -----------------------------
20+
Number of distinct traces: 5
21+
Number of total traces: 33
22+
Completeness: 1.0

0 commit comments

Comments
 (0)