scholarly journals Failure Analysis in University and Computer Science Contexts With Data Mining

2020 ◽  
Author(s):  
Daniela De Souza Gomes ◽  
Marcos Henrique Fonseca Ribeiro ◽  
Giovanni Ventorim Comarela ◽  
Gabriel Philippe Pereira

High failure rates are a worrying and relevant problem in Brazilian universities. From a data set of student transcripts, we performed a study case for both general and Computer Science contexts, in which Data Mining Techniques were used to find patterns concerning failures. The knowledge acquired can be used for better educational administration and also build intelligent systems to support students’ decision making.

2013 ◽  
Vol 5 (1) ◽  
pp. 66-83 ◽  
Author(s):  
Iman Rahimi ◽  
Reza Behmanesh ◽  
Rosnah Mohd. Yusuff

The objective of this article is an evaluation and assessment efficiency of the poultry meat farm as a case study with the new method. As it is clear poultry farm industry is one of the most important sub- sectors in comparison to other ones. The purpose of this study is the prediction and assessment efficiency of poultry farms as decision making units (DMUs). Although, several methods have been proposed for solving this problem, the authors strongly need a methodology to discriminate performance powerfully. Their methodology is comprised of data envelopment analysis and some data mining techniques same as artificial neural network (ANN), decision tree (DT), and cluster analysis (CA). As a case study, data for the analysis were collected from 22 poultry companies in Iran. Moreover, due to a small data set and because of the fact that the authors must use large data set for applying data mining techniques, they employed k-fold cross validation method to validate the authors’ model. After assessing efficiency for each DMU and clustering them, followed by applied model and after presenting decision rules, results in precise and accurate optimizing technique.


Author(s):  
Pragati Sharma ◽  
Dr. Sanjiv Sharma

Recently, data mining is gaining more popularity among researcher. Data mining provides various techniques and methods for analysing data produced by various applications of different domain. Similarly, Educational mining is providing a way for analyzing educational data set. Educational mining concerns with developing methods for discovering knowledge from data that come from educational field and it helps to extract the hidden patterns and to discover new knowledge from large educational databases with the use of data mining techniques and tools. Extracted knowledge from educational mining can be used for decision making in higher educational institutions. This paper is based on literature review of different data mining techniques along with certain algorithms like classification, clustering etc. This paper represents the effectiveness of mining techniques with educational data set for higher education institutions.


Author(s):  
Onur Doğan ◽  
Hakan  Aşan ◽  
Ejder Ayç

In today’s competitive world, organizations need to make the right decisions to prolong their existence. Using non-scientific methods and making emotional decisions gave way to the use of scientific methods in the decision making process in this competitive area. Within this scope, many decision support models are still being developed in order to assist the decision makers and owners of organizations. It is easy to collect massive amount of data for organizations, but generally the problem is using this data to achieve economic advances. There is a critical need for specialization and automation to transform the data into the knowledge in big data sets. Data mining techniques are capable of providing description, estimation, prediction, classification, clustering, and association. Recently, many data mining techniques have been developed in order to find hidden patterns and relations in big data sets. It is important to obtain new correlations, patterns, and trends, which are understandable and useful to the decision makers. There have been many researches and applications focusing on different data mining techniques and methodologies.In this study, we aim to obtain understandable and applicable results from a large volume of record set that belong to a firm, which is active in the meat processing industry, by using data mining techniques. In the application part, firstly, data cleaning and data integration, which are the first steps of data mining process, are performed on the data in the database. With the aid of data cleaning and data integration, the data set was obtained, which is suitable for data mining. Then, various association rule algorithms were applied to this data set. This analysis revealed that finding unexplored patterns in the set of data would be beneficial for the decision makers of the firm. Finally, many association rules are obtained, which are useful for decision makers of the local firm. 


The improvement of an information processing and Memory capacity, the vast amount of data is collected for various data analyses purposes. Data mining techniques are used to get knowledgeable information. The process of extraction of data by using data mining techniques the data get discovered publically and this leads to breaches of specific privacy data. Privacypreserving data mining is used to provide to protection of sensitive information from unwanted or unsanctioned disclosure. In this paper, we analysis the problem of discovering similarity checks for functional dependencies from a given dataset such that application of algorithm (l, d) inference with generalization can anonymised the micro data without loss in utility. [8] This work has presented Functional dependency based perturbation approach which hides sensitive information from the user, by applying (l, d) inference model on the dependency attributes based on Information Gain. This approach works on both categorical and numerical attributes. The perturbed data set does not affects the original dataset it maintains the same or very comparable patterns as the original data set. Hence the utility of the application is always high, when compared to other data mining techniques. The accuracy of the original and perturbed datasets is compared and analysed using tools, data mining classification algorithm.


Author(s):  
Robab Saadatdoost ◽  
Alex Tze Hiang Sim ◽  
Hosein Jafarkarimi ◽  
Jee Mei Hee

This project presents the patterns and relations between attributes of Iran Higher Education data gained from the use of data mining techniques to discover knowledge and use them in decision making system of IHE. Large dataset of IHE is difficult to analysis and display, since they are significant for decision making in IHE. This study utilized the famous data mining software, Weka and SOM to mine and visualize IHE data. In order to discover worthwhile patterns, we used clustering techniques and visualized the results. The selected dataset includes data of five medical university of Tehran as a small data set and Ministry of Science - Research and Technology's universities as a larger data set. Knowledge discovery and visualization are necessary for analyzing of these datasets. Our analysis reveals some knowledge in higher education aspect related to program of study, degree in each program, learning style, study mode and other IHE attributes. This study helps to IHE to discover knowledge in a visualize way; our results can be focused more by experts in higher education field to assess and evaluate more.


Author(s):  
Tyler Swanger ◽  
Kaitlyn Whitlock ◽  
Anthony Scime ◽  
Brendan P. Post

This chapter data mines the usage patterns of the ANGEL Learning Management System (LMS) at a comprehensive college. The data includes counts of all the features ANGEL offers its users for the Fall and Spring semesters of the academic years beginning in 2007 and 2008. Data mining techniques are applied to evaluate which LMS features are used most commonly and most effectively by instructors and students. Classification produces a decision tree which predicts the courses that will use the ANGEL system based on course specific attributes. The dataset undergoes association mining to discover the usage of one feature’s effect on the usage of another set of features. Finally, clustering the data identifies messages and files as the features most commonly used. These results can be used by this institution, as well as similar institutions, for decision making concerning feature selection and overall usefulness of LMS design, selection and implementation.


Author(s):  
K. Abumani ◽  
R. Nedunchezhian

Data mining techniques have been widely used for extracting non-trivial information from massive amounts of data. They help in strategic decision-making as well as many more applications. However, data mining also has a few demerits apart from its usefulness. Sensitive information contained in the database may be brought out by the data mining tools. Different approaches are being utilized to hide the sensitive information. The proposed work in this article applies a novel method to access the generating transactions with minimum effort from the transactional database. It helps in reducing the time complexity of any hiding algorithm. The theoretical and empirical analysis of the algorithm shows that hiding of data using this proposed work performs association rule hiding quicker than other algorithms.


Author(s):  
Dominique Haughton ◽  
Guangying Hua ◽  
Danny Jin ◽  
John Lin ◽  
Qizhi Wei ◽  
...  

Purpose – The purpose of this paper is to propose data mining techniques to model the return on investment from various types of promotional spending to market a drug and then use the model to draw conclusions on how the pharmaceutical industry might go about allocating promotion expenditures in a more efficient manner, potentially reducing costs to the consumer. The main contributions of the paper are two-fold. First, it demonstrates how to undertake a promotion mix optimization process in the pharmaceutical context and carry it through from the beginning to the end. Second, the paper proposes using directed acyclic graphs (DAGs) to help unravel the direct and indirect effects of various promotional media on sales volume. Design/methodology/approach – A synthetic data set was constructed to prototype proposed data mining techniques and two analyses approaches were investigated. Findings – The two methods were found to yield insights into the problem of the promotion mix in the context of the healthcare industry. First, a factor analysis followed by a regression analysis and an optimization algorithm applied to the resulting equation were used. Second, DAG was used to unravel direct and indirect effects of promotional expenditures on new prescriptions. Research limitations/implications – The data are synthetic and do not incorporate any time autocorrelations. Practical implications – The promotion mix optimization process is demonstrated from the beginning to the end, and the issue of negative coefficient in promotion mix models are addressed. In addition, a method is proposed to identify direct and indirect effects on new prescriptions. Social implications – A better allocation of promotional expenditures has the potential for reducing the cost of healthcare to consumers. Originality/value – The contributions of the paper are two-fold: for the first time in the literature (to the best of the authors’ knowledge), the authors have undertaken a promotion mix optimization process and have carried it through from the beginning to the end Second, the authors propose the use of DAGs to help unravel the effects of various promotion media on sales volume, notably direct and indirect effects.


Sign in / Sign up

Export Citation Format

Share Document