Data-driven process discovery : revealing conditional infrequent behavior from event logs
Conference ContributionMannhardt, F., de Leoni, M., Reijers, H.A. & van der Aalst, W.M.P. (2017). Data-driven process discovery : revealing conditional infrequent behavior from event logs. In Eric Dubois & Klaus Pohl (Eds.), Advanced Information Systems Engineering: 29th International Conference, CAiSE 2017, Essen, Germany, June 12-16, 2017, Proceedings (pp. 545-560). (Lecture Notes in Computer Science, No. 10253). Cham: Springer. In Scopus Cited 2 times.
Process discovery methods automatically infer process models from event logs. Often, event logs contain so-called noise, e.g., infrequent outliers or recording errors, which obscure the main behavior of the process. Existing methods filter this noise based on the frequency of event labels: infrequent paths and activities are excluded. However, infrequent behavior may reveal important insights into the process. Thus, not all infrequent behavior should be considered as noise. This paper proposes the Data-aware Heuristic Miner (DHM), a process discovery method that uses the data attributes to distinguish infrequent paths from random noise by using classification techniques. Data- and control-flow of the process are discovered together. We show that the DHM is, to some degree, robust against random noise and reveals data-driven decisions, which are filtered by other discovery methods. The DHM has been successfully tested on several real-life event logs, two of which we present in this paper.