condensed representations Latest Research Papers

Condensed representations of changes in dynamic graphs through emerging subgraph mining

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2020.103830 ◽

2020 ◽

Vol 94 ◽

pp. 103830

Author(s):

Angelo Impedovo ◽

Corrado Loglisci ◽

Michelangelo Ceci ◽

Donato Malerba

Keyword(s):

Dynamic Graphs ◽

Subgraph Mining ◽

Condensed Representations

Download Full-text

Hybrid ASP-based Approach to Pattern Mining

Theory and Practice of Logic Programming ◽

10.1017/s1471068418000467 ◽

2019 ◽

Vol 19 (04) ◽

pp. 505-535

Author(s):

SERGEY PARAMONOV ◽

DARIA STEPANOVA ◽

PAULI MIETTINEN

Keyword(s):

Graph Mining ◽

Pattern Mining ◽

Hybrid Approach ◽

Theory And Practice ◽

Data Sets ◽

Real World Data ◽

Data Set ◽

Condensed Representations ◽

The One ◽

Relevant Rule

AbstractDetecting small sets of relevant patterns from a given data set is a central challenge in data mining. The relevance of a pattern is based on user-provided criteria; typically, all patterns that satisfy certain criteria are considered relevant. Rule-based languages like answer set programming (ASP) seem well suited for specifying such criteria in a form of constraints. Although progress has been made, on the one hand, on solving individual mining problems and, on the other hand, developing generic mining systems, the existing methods focus either on scalability or on generality. In this paper, we make steps toward combining local (frequency, size, and cost) and global (various condensed representations like maximal, closed, and skyline) constraints in a generic and efficient way. We present a hybrid approach for itemset, sequence, and graph mining which exploits dedicated highly optimized mining systems to detect frequent patterns and then filters the results using declarative ASP. To further demonstrate the generic nature of our hybrid framework, we apply it to a problem of approximately tiling a database. Experiments on real-world data sets show the effectiveness of the proposed method and computational gains for itemset, sequence, and graph mining, as well as approximate tiling.Under consideration in Theory and Practice of Logic Programming.

Download Full-text

Skypattern mining: From pattern condensed representations to dynamic constraint satisfaction problems

Artificial Intelligence ◽

10.1016/j.artint.2015.04.003 ◽

2017 ◽

Vol 244 ◽

pp. 48-69 ◽

Cited By ~ 11

Author(s):

Willy Ugarte ◽

Patrice Boizumault ◽

Bruno Crémilleux ◽

Alban Lepailleur ◽

Samir Loudni ◽

...

Keyword(s):

Constraint Satisfaction ◽

Constraint Satisfaction Problems ◽

Dynamic Constraint ◽

Condensed Representations

Download Full-text

Towards Faster Mining of Disjunction-Based Concise Representations of Frequent Patterns

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213014500018 ◽

2014 ◽

Vol 23 (02) ◽

pp. 1450001

Author(s):

T. Hamrouni ◽

S. Ben Yahia ◽

E. Mephu Nguifo

Keyword(s):

Empirical Study ◽

Real Life ◽

Search Space ◽

Frequent Patterns ◽

Memory Consumption ◽

Efficient Tool ◽

Condensed Representation ◽

Benchmark Datasets ◽

Condensed Representations ◽

Amount Of Knowledge

In many real-life datasets, the number of extracted frequent patterns was shown to be huge, hampering the effective exploitation of such amount of knowledge by human experts. To overcome this limitation, exact condensed representations were introduced in order to offer a small-sized set of elements from which the faithful retrieval of all frequent patterns is possible. In this paper, we introduce a new exact condensed representation only based on particular elements from the disjunctive search space. In this space, a pattern is characterized by its disjunctive support, i.e., the frequency of complementary occurrences – instead of the ubiquitous co-occurrence link – of its items. For several benchmark datasets, this representation has been shown interesting in compactness terms compared to the pioneering approaches of the literature. In this respect, we mainly focus here on proposing an efficient tool for mining this representation. For this purpose, we introduce an algorithm, called DSSRM, dedicated to this task. We also propose several techniques to optimize its mining time as well as its memory consumption. The carried out empirical study on benchmark datasets shows that DSSRM is faster by several orders of magnitude than the MEP algorithm.

Download Full-text

Objectively evaluating condensed representations and interestingness measures for frequent itemset mining

Journal of Intelligent Information Systems ◽

10.1007/s10844-013-0297-9 ◽

2013 ◽

Vol 45 (3) ◽

pp. 299-317 ◽

Cited By ~ 1

Author(s):

Albrecht Zimmermann

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Interestingness Measures ◽

Itemset Mining ◽

Condensed Representations

Download Full-text

Constrained Cube Lattices for Multidimensional Database Mining

Exploring Advances in Interdisciplinary Data Mining and Analytics ◽

10.4018/978-1-61350-474-1.ch012 ◽

2011 ◽

pp. 189-218

Author(s):

Alain Casali ◽

Sébastien Nedjar ◽

Rosine Cicchetti ◽

Lotfi Lakhal

Keyword(s):

Search Space ◽

Database Mining ◽

Multidimensional Database ◽

Common Structure ◽

Power Set ◽

Condensed Representations ◽

Concise Representation ◽

Monotone Constraints ◽

Existing Data ◽

Cube Lattice

In multidimensional database mining, constrained multidimensional patterns differ from the well-known frequent patterns from both conceptual and logical points of view because of a common structure and the ability to support various types of constraints. Classical data mining techniques are based on the power set lattice of binary attribute values and, even adapted, are not suitable when addressing the discovery of constrained multidimensional patterns. In this chapter, the authors propose a foundation for various multidimensional database mining problems by introducing a new algebraic structure called cube lattice, which characterizes the search space to be explored. This chapter takes into consideration monotone and/or anti-monotone constraints enforced when mining multidimensional patterns. The authors propose condensed representations of the constrained cube lattice, which is a convex space, and present a generalized levelwise algorithm for computing them. Additionally, the authors consider the formalization of existing data cubes, and the discovery of frequent multidimensional patterns, while introducing a perfect concise representation from which any solution provided with its conjunction, disjunction and negation frequencies. Finally, emphasis on advantages of the cube lattice when compared to the power set lattice of binary attributes in multidimensional database mining are placed.

Download Full-text

A False Negative Maximal Frequent Itemsets Mining Algorithm over Stream

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.135-136.21 ◽

2011 ◽

Vol 135-136 ◽

pp. 21-25

Author(s):

Hai Feng Li ◽

Ning Zhang

Keyword(s):

Real World ◽

False Negative ◽

Frequent Itemsets ◽

Experimental Results ◽

Mining Algorithm ◽

Chernoff Bound ◽

Frequent Itemsets Mining ◽

Condensed Representations ◽

Maximal Frequent Itemsets ◽

Landmark Model

Maximal frequent itemsets are one of several condensed representations of frequent itemsets, which store most of the information contained in frequent itemsets using less space, thus being more suitable for stream mining. This paper focuses on mining maximal frequent itemsets approximately over a stream landmark model. A false negative method is proposed based on Chernoff Bound to save the computing and memory cost. Our experimental results on a real world dataset show that our algorithm is effective and efficient.

Download Full-text

Condensed Representations for Data Mining

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch040 ◽

2011 ◽

pp. 207-211 ◽

Cited By ~ 2

Author(s):

Jean-Francois Boulicaut

Keyword(s):

Data Mining ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Future Trends ◽

Itemset Mining ◽

Typical Data ◽

Condensed Representations ◽

Transactional Data ◽

Research Domain

Condensed representations have been proposed in Mannila and Toivonen (1996) as a useful concept for the optimization of typical data-mining tasks. It appears as a key concept within the inductive database framework (Boulicaut et al., 1999; de Raedt, 2002; Imielinski & Mannila, 1996), and this article introduces this research domain, its achievements in the context of frequent itemset mining (FIM) from transactional data, and its future trends.

Download Full-text

Constrained Cube Lattices for Multidimensional Database Mining

International Journal of Data Warehousing and Mining ◽

10.4018/jdwm.2010070104 ◽

2010 ◽

Vol 6 (3) ◽

pp. 43-72 ◽

Cited By ~ 2

Author(s):

Alain Casali ◽

Sébastien Nedjar ◽

Rosine Cicchetti ◽

Lotfi Lakhal

Keyword(s):

Search Space ◽

Database Mining ◽

Multidimensional Database ◽

Common Structure ◽

Power Set ◽

Condensed Representations ◽

Concise Representation ◽

Points Of View ◽

Monotone Constraints ◽

Cube Lattice

In multidimensional database mining, constrained multidimensional patterns differ from the well-known frequent patterns from both conceptual and logical points of view because of a common structure and the ability to support various types of constraints. Classical data mining techniques are based on the power set lattice of binary attribute values and, even adapted, are not suitable when addressing the discovery of constrained multidimensional patterns. In this paper, the authors propose a foundation for various multidimensional database mining problems by introducing a new algebraic structure called cube lattice, which characterizes the search space to be explored. This paper takes into consideration monotone and/or anti-monotone constraints enforced when mining multidimensional patterns. The authors propose condensed representations of the constrained cube lattice, which is a convex space, and present a generalized levelwise algorithm for computing them. Additionally, the authors consider the formalization of existing data cubes, and the discovery of frequent multidimensional patterns, while introducing a perfect concise representation from which any solution provided with its conjunction, disjunction and negation frequencies. Finally, emphasis on advantages of the cube lattice when compared to the power set lattice of binary attributes in multidimensional database mining are placed.

Download Full-text

Frequent Closed Itemsets Based Condensed Representations for Association Rules

Post-Mining of Association Rules ◽

10.4018/978-1-60566-404-0.ch013 ◽

2009 ◽

pp. 246-271 ◽

Cited By ~ 3

Author(s):

Nicolas Pasquier

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Theoretical Frameworks ◽

Interestingness Measures ◽

Rule Structure ◽

Closed Itemsets ◽

Condensed Representations ◽

Frequent Closed Itemset

After more than one decade of researches on association rule mining, efficient and scalable techniques for the discovery of relevant association rules from large high-dimensional datasets are now available. Most initial studies have focused on the development of theoretical frameworks and efficient algorithms and data structures for association rule mining. However, many applications of association rules to data from different domains have shown that techniques for filtering irrelevant and useless association rules are required to simplify their interpretation by the end-user. Solutions proposed to address this problem can be classified in four main trends: constraint-based mining, interestingness measures, association rule structure analysis, and condensed representations. This chapter focuses on condensed representations that are characterized in the frequent closed itemset framework to expose their advantages and drawbacks.

Download Full-text

condensed representations
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Condensed representations of changes in dynamic graphs through emerging subgraph mining

Hybrid ASP-based Approach to Pattern Mining

Skypattern mining: From pattern condensed representations to dynamic constraint satisfaction problems

Towards Faster Mining of Disjunction-Based Concise Representations of Frequent Patterns

Objectively evaluating condensed representations and interestingness measures for frequent itemset mining

Constrained Cube Lattices for Multidimensional Database Mining

A False Negative Maximal Frequent Itemsets Mining Algorithm over Stream

Condensed Representations for Data Mining

Constrained Cube Lattices for Multidimensional Database Mining

Frequent Closed Itemsets Based Condensed Representations for Association Rules

Export Citation Format

condensed representationsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Condensed representations of changes in dynamic graphs through emerging subgraph mining

Hybrid ASP-based Approach to Pattern Mining

Skypattern mining: From pattern condensed representations to dynamic constraint satisfaction problems

Towards Faster Mining of Disjunction-Based Concise Representations of Frequent Patterns

Objectively evaluating condensed representations and interestingness measures for frequent itemset mining

Constrained Cube Lattices for Multidimensional Database Mining

A False Negative Maximal Frequent Itemsets Mining Algorithm over Stream

Condensed Representations for Data Mining

Constrained Cube Lattices for Multidimensional Database Mining

Frequent Closed Itemsets Based Condensed Representations for Association Rules

condensed representations
Recently Published Documents