Parallel Distributed Patterns Mining Using Hadoop MapReduce Framework

Ishak H. A. Meddah; Khaled Belkadi

doi:10.4018/ijghpc.2017040105

Parallel Distributed Patterns Mining Using Hadoop MapReduce Framework

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2017040105 ◽

2017 ◽

Vol 9 (2) ◽

pp. 70-85 ◽

Cited By ~ 6

Author(s):

Ishak H. A. Meddah ◽

Khaled Belkadi

Keyword(s):

Business Process ◽

Process Mining ◽

Process Analysis ◽

Large Data ◽

Finite State Automaton ◽

Large Set ◽

Mapreduce Framework ◽

Business Process Analysis ◽

Finite State ◽

Log File

The treatment of large data is proving more difficult in different axes, but the arrival of the framework MapReduce is a solution of this problem. With it we can analyze and process vast amounts of data. It does this by distributing the computational work across a cluster of virtual servers running in a cloud or large set of machines while process mining provides an important bridge between data mining and business process analysis. The process mining techniques allow for extracting information from event logs. In general, there are two steps in process mining: correlation definition or discovery and process inference or composition. Firstly, the authors' work consists to mine small patterns from a log traces. Those patterns are the representation of the traces execution from a log file of a business process. In this step, they use existing techniques. The patterns are represented by finite state automaton or their regular expression. The final model is the combination of only two types of small patterns whom are represented by the regular expressions (ab)* and (ab*c)*. Secondly, the authors compute these patterns in parallel, and then combine those small patterns using the MapReduce framework. They have two parties: the first is the Map Step in which they mine patterns from execution traces; the second is the combination of these small patterns as reduce step. The authors' results are promising in that they show that their approach is scalable, general, and precise. It minimizes the execution time by the use of the MapReduce framework.

Download Full-text

Parallel and Distributed Pattern Mining

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2019070101 ◽

2019 ◽

Vol 6 (3) ◽

pp. 1-17

Author(s):

Ishak H.A Meddah ◽

Nour El Houda REMIL

Keyword(s):

Business Process ◽

Pattern Mining ◽

Process Mining ◽

Process Analysis ◽

Large Data ◽

Finite State Automaton ◽

Large Set ◽

Business Process Analysis ◽

Finite State ◽

Hadoop Framework

The treatment of large data is difficult and it looks like the arrival of the framework MapReduce is a solution of this problem. This framework can be used to analyze and process vast amounts of data. This happens by distributing the computational work across a cluster of virtual servers running in a cloud or a large set of machines. Process mining provides an important bridge between data mining and business process analysis. Its techniques allow for extracting information from event logs. Generally, there are two steps in process mining, correlation definition or discovery and the inference or composition. First of all, their work mines small patterns from log traces. Those patterns are the representation of the traces execution from a log file of a business process. In this step, the authors use existing techniques. The patterns are represented by finite state automaton or their regular expression; and the final model is the combination of only two types of different patterns whom are represented by the regular expressions (ab)* and (ab*c)*. Second, they compute these patterns in parallel, and then combine those small patterns using the Hadoop framework. They have two steps; the first is the Map Step through which they mine patterns from execution traces, and the second one is the combination of these small patterns as a reduce step. The results show that their approach is scalable, general and precise. It minimizes the execution time by the use of the Hadoop framework.

Download Full-text

Efficient Implementation of Hadoop MapReduce-Based Dataflow

Handbook of Research on Biomimicry in Information Retrieval and Knowledge Management - Advances in Web Technologies and Engineering ◽

10.4018/978-1-5225-3004-6.ch020 ◽

2018 ◽

pp. 372-385 ◽

Cited By ~ 1

Author(s):

Ishak H. A. Meddah ◽

Khaled Belkadi

Keyword(s):

Business Process ◽

Process Mining ◽

Process Analysis ◽

Large Data ◽

Finite State Automaton ◽

Hadoop Mapreduce ◽

Event Logs ◽

Execution Traces ◽

Business Process Analysis ◽

Finite State

MapReduce is a solution for the treatment of large data. With it we can analyze and process data. It does this by distributing the computation in a large set of machines. Process mining provides an important bridge between data mining and business process analysis. This technique allows for the extraction of information from event logs. Firstly, the chapter mines small patterns from log traces. Those patterns are the representation of the traces execution from a business process. The authors use existing techniques; the patterns are represented by finite state automaton; the final model is the combination of only two types of patterns that are represented by the regular expressions. Secondly, the authors compute these patterns in parallel, and then combine those patterns using MapReduce. They have two parties. The first is the Map Step. The authors mine patterns from execution traces. The second is the combination of these small patterns as reduce step. The results are promising; they show that the approach is scalable, general, and precise. It minimizes the execution time by the use of MapReduce.

Download Full-text

Parallel Mining Small Patterns from Business Process Traces

International Journal of Software Science and Computational Intelligence ◽

10.4018/ijssci.2016010103 ◽

2016 ◽

Vol 8 (1) ◽

pp. 32-45 ◽

Cited By ~ 1

Author(s):

Ishak H.A. Meddah ◽

Khaled Belkadi ◽

Mohamed Amine Boudia

Keyword(s):

Business Process ◽

Web Applications ◽

Process Mining ◽

Process Analysis ◽

Finite State Automaton ◽

Mapreduce Framework ◽

Hadoop Mapreduce ◽

Business Process Analysis ◽

Finite State ◽

Small Models

Hadoop MapReduce has arrived to solve the problem of treatment of big data, also the parallel treatment, with this framework the authors analyze, process a large size of data. It based for distributing the work in two big steps, the map and the reduce steps in a cluster or big set of machines. They apply the MapReduce framework to solve some problems in the domain of process mining how provides a bridge between data mining and business process analysis, this technique consists to mine lot of information from the process traces; In process mining, there are two steps, correlation definition and the process inference. The work consists in first time of mining patterns whom are the work flow of the process from execution traces, those patterns present the work or the history of each party of the process, the authors' small patterns are represented in this work by finite state automaton or their regular expression, the authors have only two patterns to facilitate the process, the general presentation of the process is the combination of the small mining patterns. The patterns are represented by the regular expressions (ab)* and (ab*c)*. Secondly, they compute the patterns, and combine them using the Hadoop MapReduce framework, in this work they have two general steps, first the Map step, they mine small patterns or small models from business process, and the second is the combination of models as reduce step. The authors use the business process of two web applications, the SKYPE, and VIBER applications. The general result shown that the parallel distributed process by using the Hadoop MapReduce framework is scalable, and minimizes the execution time.

Download Full-text

Discovering Patterns using Process Mining

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2016100102 ◽

2016 ◽

Vol 3 (4) ◽

pp. 21-31 ◽

Cited By ~ 2

Author(s):

Ishak Meddah ◽

Belkadi Khaled

Keyword(s):

Business Process ◽

Process Mining ◽

Process Analysis ◽

Finite State Automaton ◽

Regular Expressions ◽

Final Model ◽

Event Logs ◽

Execution Traces ◽

Business Process Analysis ◽

Finite State

Process mining provides an important bridge between data mining and business process analysis, his techniques allow for extracting information from event logs. In general, there are two steps in process mining, correlation definition or discovery and then process inference or composition. Firstly, the authors' work consists to mine small patterns from a log traces of two applications; SKYPE, and VIBER, those patterns are the representation of the execution traces of a business process. In this step, the authors use existing techniques; The patterns are represented by finite state automaton or their regular expression; The final model is the combination of only two types of small patterns whom are represented by the regular expressions (ab)* and (ab*c)*. Secondly, the authors compute these patterns in parallel, and then combine those small patterns using the composition rules, they have two parties the first is the mine, they discover patterns from execution traces and the second is the combination of these small patterns. The patterns mining and the composition is illustrated by the automaton existing techniques. The Execution traces are the different actions effected by users in the SKYPE and VIBER. The results are general and precise. It minimizes the execution time and the loss of information.

Download Full-text

Efficient Implementation of Hadoop MapReduce based Business Process Dataflow

International Journal of Decision Support System Technology ◽

10.4018/ijdsst.2017010104 ◽

2017 ◽

Vol 9 (1) ◽

pp. 49-60

Author(s):

Ishak H.A. Meddah ◽

Khaled Belkadi ◽

Mohamed Amine Boudia

Keyword(s):

Business Process ◽

Web Applications ◽

Process Mining ◽

Process Analysis ◽

Finite State Automaton ◽

Large Set ◽

Process Data ◽

Hadoop Mapreduce ◽

Execution Traces ◽

Finite State

Hadoop MapReduce is one of the solutions for the process of large and big data, with-it the authors can analyze and process data, it does this by distributing the computational in a large set of machines. Process mining provides an important bridge between data mining and business process analysis, his techniques allow for mining data information from event logs. Firstly, the work consists to mine small patterns from a log traces, those patterns are the workflow of the execution traces of business process. The authors' work is an amelioration of the existing techniques who mine only one general workflow, the workflow present the general traces of two web applications; they use existing techniques; the patterns are represented by finite state automaton; the final model is the combination of only two types of patterns whom are represented by the regular expressions. Secondly, the authors compute these patterns in parallel, and then combine those patterns using MapReduce, they have two parts the first is the Map Step, they mine patterns from execution traces and the second is the combination of these small patterns as reduce step. The results are promising; they show that the approach is scalable, general and precise. It reduces the execution time by the use of Hadoop MapReduce Framework.

Download Full-text

Mining Patterns Using Business Process Management

Handbook of Research on Biomimicry in Information Retrieval and Knowledge Management - Advances in Web Technologies and Engineering ◽

10.4018/978-1-5225-3004-6.ch005 ◽

2018 ◽

pp. 78-89 ◽

Cited By ~ 1

Author(s):

Ishak H. A. Meddah ◽

Khaled Belkadi

Keyword(s):

Business Process ◽

Process Management ◽

Pattern Mining ◽

Process Mining ◽

Process Analysis ◽

Finite State Automaton ◽

Event Logs ◽

Execution Traces ◽

Business Process Analysis ◽

Finite State

Process mining provides an important bridge between data mining and business process analysis. This technique allows for the extraction of information from event logs. In general, there are two steps in process mining: correlation definition or discovery and then process inference or composition. Firstly, the authors mine small patterns from log traces of two applications; those patterns are the representation of the execution traces of a business process. In this step, the authors use existing techniques. The patterns are represented by finite state automaton or their regular expression. The final model is the combination of only two types of small patterns that are represented by the regular expressions (ab)* and (ab*c)*. Secondly, the authors compute these patterns in parallel and then combine those small patterns using the composition rules. They have two parties. The first is the mine, where the authors discover patterns from execution traces, and the second is the combination of these small patterns. The pattern mining and the composition is illustrated by the automaton existing techniques.

Download Full-text

Novel Approach for Mining Patterns

International Journal of Applied Evolutionary Computation ◽

10.4018/ijaec.2021010103 ◽

2021 ◽

Vol 12 (1) ◽

pp. 27-42

Author(s):

Ishak H. A. Meddah ◽

Nour Elhouda Remil ◽

Hadja Nebia Meddah

Keyword(s):

Process Mining ◽

Finite State Automaton ◽

Regular Expressions ◽

Mapreduce Framework ◽

Final Model ◽

Event Logs ◽

Execution Traces ◽

Novel Approach ◽

Finite State ◽

Log File

Process mining techniques allow for extracting information from event logs. In general, there are two steps in process mining, correlation definition or discovery and then process inference or composition. Firstly, the work consists to mine small patterns from a log traces; those patterns are the representation of the traces execution from a log file of a business process. In this step, the authors use existing techniques. The patterns are represented by finite state automaton or their regular expression. The final model is the combination of only two types of small patterns that are represented by the regular expressions. Secondly, they compute these patterns in parallel and then combine those small patterns using the MapReduce framework. They have two parties the first is the map step. They mine patterns from execution traces, and the second is the combination of these small patterns as reduce step. The results are promising; they show that the approach is scalable, general, and precise. It minimizes the execution time by the use of the MapReduce framework.

Download Full-text

No Longer Out of Sight, No Longer Out of Mind? How Organizations Engage with Process Mining-Induced Transparency to Achieve Increased Process Awareness

Business & Information Systems Engineering ◽

10.1007/s12599-021-00715-x ◽

2021 ◽

Cited By ~ 1

Author(s):

Julia Eggers ◽

Andreas Hein ◽

Markus Böhm ◽

Helmut Krcmar

Keyword(s):

Business Process ◽

Process Management ◽

Business Processes ◽

Process Mining ◽

Process Analysis ◽

Structured Interviews ◽

Business Process Analysis ◽

Starting Point ◽

Shared Awareness ◽

Induced Transparency

AbstractIn recent years, process mining has emerged as the leading big data technology for business process analysis. By extracting knowledge from event logs in information systems, process mining provides unprecedented transparency of business processes while being independent of the source system. However, despite its practical relevance, there is still a limited understanding of how organizations act upon the pervasive transparency created by process mining and how they leverage it to benefit from increased process awareness. Addressing this gap, this study conducts a multiple case study to explore how four organizations achieved increased process awareness by using process mining. Drawing on data from 24 semi-structured interviews and archival sources, this study reveals seven sociotechnical mechanisms based on process mining that enable organizations to create either standardized or shared awareness of sub-processes, end-to-end processes, and the firm’s process landscape. Thereby, this study contributes to research on business process management by revealing how process mining facilitates mechanisms that serve as a new, data-driven way of creating process awareness. In addition, the findings indicate that these mechanisms are influenced by the governance approach chosen to conduct process mining, i.e., a top-down or bottom-up driven implementation approach. Last, this study also points to the importance of balancing the social complications of increased process transparency and awareness. These results serve as a valuable starting point for practitioners to reflect on measures to increase organizational process awareness through process mining.

Download Full-text

A Business Process Analysis Methodology Based on Process Mining for Complaint Handling Service Processes

Applied Sciences ◽

10.3390/app9163313 ◽

2019 ◽

Vol 9 (16) ◽

pp. 3313 ◽

Cited By ~ 2

Author(s):

Wu ◽

He ◽

Wang ◽

Wen ◽

Keyword(s):

Business Process ◽

Process Management ◽

Process Mining ◽

Process Analysis ◽

System Level ◽

Service Process ◽

Complaint Handling ◽

Business Process Analysis ◽

Manufacturing Company

To improve the service quality of complaint handling service in a manufacturing company, it is key to analyze the business processes. Process mining is quite a useful approach to diagnose complaint handling service process problems, such as bottlenecks and deviations. However, the current business process analysis methodologies based on process mining mainly focus on operational process analysis and neglect other system level analysis. In this study, we introduce the method of Accimap from the discipline of accident analysis to analyze the diagnosis results of process mining. By creating a complaint handling service process management Accimap model, the process mining results analysis can be carried out across different system levels. A case study in a big manufacturing company in China is implemented to verify our approach. In the case study, 42 complaint handling process management factors are identified and the complaint handling process management Accimap model is created. The testing results by Rasmussen’s seven predictions in his risk management framework show that Accimap method presents a systematic approach to analyze the process diagnosis results based on process mining.

Download Full-text

Business process analysis in advertising: An extension to a methodology based on process mining projects

2016 35th International Conference of the Chilean Computer Science Society (SCCC) ◽

10.1109/sccc.2016.7836000 ◽

2016 ◽

Cited By ~ 2

Author(s):

Anibal Silva Osses ◽

Luiz Quelves Da Silva ◽

Bernardita Fernandez Cobo ◽

Michael Arias ◽

Eric Rojas ◽

...

Keyword(s):

Business Process ◽

Process Mining ◽

Process Analysis ◽

Business Process Analysis ◽

Mining Projects

Download Full-text