scholarly journals A Variable Precision Attribute Reduction Approach in Multilabel Decision Tables

2014 ◽  
Vol 2014 ◽  
pp. 1-7 ◽  
Author(s):  
Hua Li ◽  
Deyu Li ◽  
Yanhui Zhai ◽  
Suge Wang ◽  
Jing Zhang

Owing to the high dimensionality of multilabel data, feature selection in multilabel learning will be necessary in order to reduce the redundant features and improve the performance of multilabel classification. Rough set theory, as a valid mathematical tool for data analysis, has been widely applied to feature selection (also called attribute reduction). In this study, we propose a variable precision attribute reduct for multilabel data based on rough set theory, calledδ-confidence reduct, which can correctly capture the uncertainty implied among labels. Furthermore, judgement theory and discernibility matrix associated withδ-confidence reduct are also introduced, from which we can obtain the approach to knowledge reduction in multilabel decision tables.

Author(s):  
Yasuo Kudo ◽  
◽  
Tetsuya Murai ◽  

In this paper, we propose a parallel computation framework for a heuristic attribute reduction method. Attribute reduction is a key technique to use rough set theory as a tool in data mining. The authors have previously proposed a heuristic attribute reduction method to compute as many relative reducts as possible from a given dataset with numerous attributes. We parallelize our method by using open multiprocessing. We also evaluate the performance of a parallelized attribute reduction method by experiments.


2019 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Yang Li ◽  
Xuhua Hu

Purpose The purpose of this paper is to solve the problem of information privacy and security of social users. Mobile internet and social network are more and more deeply integrated into people’s daily life, especially under the interaction of the fierce development momentum of the Internet of Things and diversified personalized services, more and more private information of social users is exposed to the network environment actively or unintentionally. In addition, a large amount of social network data not only brings more benefits to network application providers, but also provides motivation for malicious attackers. Therefore, under the social network environment, the research on the privacy protection of user information has great theoretical and practical significance. Design/methodology/approach In this study, based on the social network analysis, combined with the attribute reduction idea of rough set theory, the generalized reduction concept based on multi-level rough set from the perspectives of positive region, information entropy and knowledge granularity of rough set theory were proposed. Furthermore, it was traversed on the basis of the hierarchical compatible granularity space of the original information system and the corresponding attribute values are coarsened. The selected test data sets were tested, and the experimental results were analyzed. Findings The results showed that the algorithm can guarantee the anonymity requirement of data publishing and improve the effect of classification modeling on anonymous data in social network environment. Research limitations/implications In the test and verification of privacy protection algorithm and privacy protection scheme, the efficiency of algorithm and scheme needs to be tested on a larger data scale. However, the data in this study are not enough. In the following research, more data will be used for testing and verification. Practical implications In the context of social network, the hierarchical structure of data is introduced into rough set theory as domain knowledge by referring to human granulation cognitive mechanism, and rough set modeling for complex hierarchical data is studied for hierarchical data of decision table. The theoretical research results are applied to hierarchical decision rule mining and k-anonymous privacy protection data mining research, which enriches the connotation of rough set theory and has important theoretical and practical significance for further promoting the application of this theory. In addition, combined the theory of secure multi-party computing and the theory of attribute reduction in rough set, a privacy protection feature selection algorithm for multi-source decision table is proposed, which solves the privacy protection problem of feature selection in distributed environment. It provides a set of effective rough set feature selection method for privacy protection classification mining in distributed environment, which has practical application value for promoting the development of privacy protection data mining. Originality/value In this study, the proposed algorithm and scheme can effectively protect the privacy of social network data, ensure the availability of social network graph structure and realize the need of both protection and sharing of user attributes and relational data.


2012 ◽  
Vol 2012 ◽  
pp. 1-24 ◽  
Author(s):  
Feng Hu ◽  
Guoyin Wang

The divide and conquer method is a typical granular computing method using multiple levels of abstraction and granulations. So far, although some achievements based on divided and conquer method in the rough set theory have been acquired, the systematic methods for knowledge reduction based on divide and conquer method are still absent. In this paper, the knowledge reduction approaches based on divide and conquer method, under equivalence relation and under tolerance relation, are presented, respectively. After that, a systematic approach, named as the abstract process for knowledge reduction based on divide and conquer method in rough set theory, is proposed. Based on the presented approach, two algorithms for knowledge reduction, including an algorithm for attribute reduction and an algorithm for attribute value reduction, are presented. Some experimental evaluations are done to test the methods on uci data sets and KDDCUP99 data sets. The experimental results illustrate that the proposed approaches are efficient to process large data sets with good recognition rate, compared with KNN, SVM, C4.5, Naive Bayes, and CART.


2011 ◽  
Vol 121-126 ◽  
pp. 1579-1584
Author(s):  
Hai Zhong Tan

The rule set which is acquired based on rough set theory can be classified into two categories: deterministic rules and probabilistic rules. Traditional attribute reduction definitions in variable precision rough set model cannot guarantee the rule properties, namely deterministic or probabilistic. In this paper, a new criterion for attribute reduction is put forward based on variable precision rough set model. The rule properties can be preserved during the process of attribute reduction. The relationships between the new reduct definition and available definitions, including Ziarko’s reduct definition and β lower distribution reduct definition are also discussed.


2013 ◽  
Vol 333-335 ◽  
pp. 693-697
Author(s):  
Yong Chao Liang ◽  
Xi Jia Zhang ◽  
Peng Zhou

With the increasing of fault information transmission capacity in power grid, the volume of information which needs to be concerned by dispatchers has greatly increased, consequently making it difficult to identify the fault signal and analyze the cause of the accident quickly for dispatchers in massive fault information. To settle this problem, this paper uses a novel approach that combines rough set theory with association rule for mining fault rules in a large number of historical fault data of power grid. Firstly, it builds distributed original decision tables according to regions. And then it uses the information entropy algorithm in condition attribute reduction. Lastly, it applies the improved Apriori algorithm of association rule to fault rules mining based on the reduction decision table. In this way the problems of redundancy of massive fault information can be solved and complexity of rules extraction can be simplified effectively. It also improves the efficiency of fault rules mining.


2019 ◽  
Vol 154 ◽  
pp. 194-198
Author(s):  
Shi Qiang Wang ◽  
Cai Yun Gao ◽  
Chang Luo ◽  
Gui Mei Zheng ◽  
Yan Nian Zhou

Author(s):  
JIYE LIANG ◽  
ZHONGZHI SHI

Rough set theory is a relatively new mathematical tool for use in computer applications in circumstances which are characterized by vagueness and uncertainty. In this paper, we introduce the concepts of information entropy, rough entropy and knowledge granulation in rough set theory, and establish the relationships among those concepts. These results will be very helpful for understanding the essence of concept approximation and establishing granular computing in rough set theory.


Sign in / Sign up

Export Citation Format

Share Document