Emerging Pattern Mining To Aid Toxicological Knowledge Discovery

Richard Sherhod; Philip N. Judson; Thierry Hanser; Jonathan D. Vessey; Samuel J. Webb; Valerie J. Gillet

doi:10.1021/ci5001828

Automating Knowledge Discovery for Toxicity Prediction Using Jumping Emerging Pattern Mining

Journal of Chemical Information and Modeling ◽

10.1021/ci300254w ◽

2012 ◽

Vol 52 (11) ◽

pp. 3074-3087 ◽

Cited By ~ 21

Author(s):

Richard Sherhod ◽

Valerie J. Gillet ◽

Philip N. Judson ◽

Jonathan D. Vessey

Keyword(s):

Knowledge Discovery ◽

Pattern Mining ◽

Toxicity Prediction ◽

Emerging Pattern

Download Full-text

Improving the understanding of cancer in a descriptive way: An emerging pattern mining‐based approach

International Journal of Intelligent Systems ◽

10.1002/int.22503 ◽

2021 ◽

Author(s):

Antonio Manuel Trasierras ◽

José María Luna ◽

Sebastián Ventura

Keyword(s):

Pattern Mining ◽

Emerging Pattern

Download Full-text

Summarization in Pattern Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch287 ◽

2011 ◽

pp. 1877-1883 ◽

Cited By ~ 2

Author(s):

Mohammad Al Hasan

Keyword(s):

Knowledge Discovery ◽

Pattern Mining ◽

Real Life ◽

Frequent Pattern Mining ◽

Search Space ◽

Frequent Pattern ◽

Combinatorial Search ◽

Main Concept ◽

Set Size ◽

Memory Complexity

The research on mining interesting patterns from transactions or scientific datasets has matured over the last two decades. At present, numerous algorithms exist to mine patterns of variable complexities, such as set, sequence, tree, graph, etc. Collectively, they are referred as Frequent Pattern Mining (FPM) algorithms. FPM is useful in most of the prominent knowledge discovery tasks, like classification, clustering, outlier detection, etc. They can be further used, in database tasks, like indexing and hashing while storing a large collection of patterns. But, the usage of FPM in real-life knowledge discovery systems is considerably low in comparison to their potential. The prime reason is the lack of interpretability caused from the enormity of the output-set size. For instance, a moderate size graph dataset with merely thousand graphs can produce millions of frequent graph patterns with a reasonable support value. This is expected due to the combinatorial search space of pattern mining. However, classification, clustering, and other similar Knowledge discovery tasks should not use that many patterns as their knowledge nuggets (features), as it would increase the time and memory complexity of the system. Moreover, it can cause a deterioration of the task quality because of the popular “curse of dimensionality” effect. So, in recent years, researchers felt the need to summarize the output set of FPM algorithms, so that the summary-set is small, non-redundant and discriminative. There are different summarization techniques: lossless, profile-based, cluster-based, statistical, etc. In this article, we like to overview the main concept of these summarization techniques, with a comparative discussion of their strength, weakness, applicability and computation cost.

Download Full-text

Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments

Big Data Analytics ◽

10.1186/s41044-018-0038-8 ◽

2019 ◽

Vol 4 (1) ◽

Cited By ~ 1

Author(s):

Ángel Miguel García-Vico ◽

Pedro González ◽

Cristóbal José Carmona ◽

María José del Jesus

Keyword(s):

Big Data ◽

Evolutionary Algorithm ◽

Pattern Mining ◽

Quality Measures ◽

Multi Objective ◽

Emerging Pattern

Download Full-text

Enhancing the Process of Knowledge Discovery in Geographic Databases Using Geo-Ontologies

Database Technologies ◽

10.4018/978-1-60566-058-5.ch147 ◽

2009 ◽

pp. 2405-2426 ◽

Cited By ~ 1

Author(s):

Vania Bogorny ◽

Paulo Martins Engel ◽

Luis Otavio Alavares

Keyword(s):

Knowledge Discovery ◽

Prior Knowledge ◽

Association Rules ◽

Pattern Mining ◽

Spatial Association ◽

Geographic Pattern ◽

Geographic Patterns ◽

Geographic Databases ◽

Novel Approach ◽

Amount Of Knowledge

This chapter introduces the problem of mining frequent geographic patterns and spatial association rules from geographic databases. In the geographic domain most discovered patterns are trivial, non-novel, and noninteresting, which simply represent natural geographic associations intrinsic to geographic data. A large amount of natural geographic associations are explicitly represented in geographic database schemas and geo-ontologies, which have not been used so far in frequent geographic pattern mining. Therefore, this chapter presents a novel approach to extract patterns from geographic databases using geoontologies as prior knowledge. The main goal of this chapter is to show how the large amount of knowledge represented in geo-ontologies can be used to avoid the extraction of patterns that are previously known as noninteresting.

Download Full-text

New structural alerts for Ames mutagenicity discovered using emerging pattern mining techniques

Toxicology Research ◽

10.1039/c4tx00071d ◽

2015 ◽

Vol 4 (1) ◽

pp. 46-56 ◽

Cited By ~ 2

Author(s):

Laurence Coquin ◽

Steven J. Canipa ◽

William C. Drewe ◽

Lilia Fisk ◽

Valerie J. Gillet ◽

...

Keyword(s):

Expert System ◽

Pattern Mining ◽

System P ◽

Structural Alerts ◽

Emerging Pattern

The discovered patterns are used to develop new structural alerts for mutagenicity in the Derek Nexus expert system.

Download Full-text

Identifying emerging hotel preferences using Emerging Pattern Mining technique

Tourism Management ◽

10.1016/j.tourman.2014.06.015 ◽

2015 ◽

Vol 46 ◽

pp. 311-321 ◽

Cited By ~ 80

Author(s):

Gang Li ◽

Rob Law ◽

Huy Quan Vu ◽

Jia Rong ◽

Xinyuan (Roy) Zhao

Keyword(s):

Pattern Mining ◽

Mining Technique ◽

Emerging Pattern

Download Full-text

Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey

10.1063/1.3526223 ◽

2010 ◽

Cited By ~ 1

Author(s):

G. Shivaprasad ◽

N. V. Subbareddy ◽

U. Dinesh Acharya ◽

R. B. Patel ◽

B. P. Singh

Keyword(s):

Research And Development ◽

Knowledge Discovery ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Access Pattern ◽

Web Usage ◽

Web Access ◽

Web Access Pattern ◽

Usage Data

Download Full-text

Knowledge Discovery from Healthcare Electronic Records for Sustainable Environment

Sustainability ◽

10.3390/su13168900 ◽

2021 ◽

Vol 13 (16) ◽

pp. 8900

Author(s):

Naeem Ahmed Mahoto ◽

Asadullah Shaikh ◽

Mana Saleh Al Reshan ◽

Muhammad Ali Memon ◽

Adel Sulaiman

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Association Analysis ◽

Pattern Mining ◽

Large Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Electronic Records ◽

Data Mining Techniques ◽

Healthcare Data

The medical history of a patient is an essential piece of information in healthcare agencies, which keep records of patients. Due to the fact that each person may have different medical complications, healthcare data remain sparse, high-dimensional and possibly inconsistent. The knowledge discovery from such data is not easily manageable for patient behaviors. It becomes a challenge for both physicians and healthcare agencies to discover knowledge from many healthcare electronic records. Data mining, as evidenced from the existing published literature, has proven its effectiveness in transforming large data collections into meaningful information and knowledge. This paper proposes an overview of the data mining techniques used for knowledge discovery in medical records. Furthermore, based on real healthcare data, this paper also demonstrates a case study of discovering knowledge with the help of three data mining techniques: (1) association analysis; (2) sequential pattern mining; (3) clustering. Particularly, association analysis is used to extract frequent correlations among examinations done by patients with a specific disease, sequential pattern mining allows extracting frequent patterns of medical events and clustering is used to find groups of similar patients. The discovered knowledge may enrich healthcare guidelines, improve their processes and detect anomalous patients’ behavior with respect to the medical guidelines.

Download Full-text

Pattern mining for knowledge discovery

Proceedings of the 23rd International Database Applications & Engineering Symposium on - IDEAS '19 ◽

10.1145/3331076.3331099 ◽

2019 ◽

Cited By ~ 2

Author(s):

Carson K. Leung

Keyword(s):

Knowledge Discovery ◽

Pattern Mining

Download Full-text