A Combination Method of the Tanimoto Coefficient and Proximity Measure of Random Forest for Compound Activity Prediction

Imbalanced data might cause some issues in problem definition level, algorithm level, and data level. Some of the methods have been developed to overcome this issue, one of state-of-the-art method is Easy Ensemble. Easy Ensemble was claimed can improve model performance to classify minority class, and overcome the deficiency of random under- sampling. In this paper we discussed the implementation of Easy Ensemble with Random Forest Classifiers to handle imbalance problem in credit scoring case. This combination method is implemented in two datasets which taken from data science competition website, finhacks.id and kaggle.com with class proportion within majority and minority is 70:30 and 94:6. The results showed that resampling with Easy Ensemble can improve Random Forest classifier performance upon minority class. Recall on minority class increased significantly after the resampling. Before resampling, the recall on minority class for the first dataset (finhacks.id) was 0.49, and increased to 0.82 after the resampling. Similar results were obtained for the second data set (kaggle.com), where the recall for the minority class was increased from just 0.14 to 0.73.

Download Full-text

An Improved Combination Method of Real-time Multi-system Orbit and Clock Corrections

Proceedings of the 31st International Technical Meeting of The Satellite Division of the Institute of Navigation (ION GNSS+ 2018) ◽

10.33012/2018.16037 ◽

2018 ◽

Author(s):

Gong Xiaopeng ◽

Gu Shengfeng ◽

Yang Xinhao ◽

Zheng Fu ◽

Lou Yidong

Keyword(s):

Real Time ◽

Combination Method

Download Full-text

Implementation of data mining as a support of business application strategy

Journal of Applied Information, Communication and Technology ◽

10.33555/ejaict.v5i1.49 ◽

2018 ◽

Vol 5 (1) ◽

pp. 47-55

Author(s):

Florensia Unggul Damayanti

Keyword(s):

Data Mining ◽

Random Forest ◽

Business Strategy ◽

Input Parameter ◽

Data Mining Algorithm ◽

Complex Data ◽

Business Decision ◽

Marketing Department ◽

Business Application ◽

Complex Data Sets

Data mining help industries create intelligent decision on complex problems. Data mining algorithm can be applied to the data in order to forecasting, identity pattern, make rules and recommendations, analyze the sequence in complex data sets and retrieve fresh insights. Yet, increasing of technology and various techniques among data mining availability data give opportunity to industries to explore and gain valuable information from their data and use the information to support business decision making. This paper implement classification data mining in order to retrieve knowledge in customer databases to support marketing department while planning strategy for predict plan premium. The dataset decompose into conceptual analytic to identify characteristic data that can be used as input parameter of data mining model. Business decision and application is characterized by processing step, processing characteristic and processing outcome (Seng, J.L., Chen T.C. 2010). This paper set up experimental of data mining based on J48 and Random Forest classifiers and put a light on performance evaluation between J48 and random forest in the context of dataset in insurance industries. The experiment result are about classification accuracy and efficiency of J48 and Random Forest , also find out the most attribute that can be used to predict plan premium in context of strategic planning to support business strategy.

Download Full-text

Database-Driven Modeling based on Variable Selection using Random Forest and Its Application for Linear Air Fuel Ratio Sensor Output Prediction

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.139.850 ◽

2019 ◽

Vol 139 (8) ◽

pp. 850-857

Author(s):

Hiromu Imaji ◽

Takuya Kinoshita ◽

Toru Yamamoto ◽

Keisuke Ito ◽

Masahiro Yoshida ◽

...

Keyword(s):

Random Forest ◽

Variable Selection ◽

Sensor Output ◽

Fuel Ratio

Download Full-text

Sentinel Node Navigation Surgery with Combination Method with Dye and Radioisotopes for Malignant Melanoma

Nishi Nihon Hifuka ◽

10.2336/nishinihonhifu.68.274 ◽

2006 ◽

Vol 68 (3) ◽

pp. 274-279 ◽

Cited By ~ 1

Author(s):

Akira TAKAHASHI ◽

Naoya YAMAZAKI ◽

Akifumi YAMAMOTO ◽

Kouji YOSHINO ◽

Kenjiro NAMIKAWA ◽

...

Keyword(s):

Malignant Melanoma ◽

Sentinel Node ◽

Combination Method ◽

Navigation Surgery ◽

Sentinel Node Navigation Surgery

Download Full-text

Multiple fault diagnosis for hydraulic systems using Nearest-centroid-with-DBA and Random-Forest-based-time-series-classification

2020 39th Chinese Control Conference (CCC) ◽

10.23919/ccc50068.2020.9189401 ◽

2020 ◽

Author(s):

Zhijie Peng ◽

Ke Zhang ◽

Yi Chai

Keyword(s):

Time Series ◽

Fault Diagnosis ◽

Random Forest ◽

Time Series Classification ◽

Hydraulic Systems ◽

Multiple Fault ◽

Multiple Fault Diagnosis

Download Full-text

Research on Prediction Method of Finish Rolling Power Consumption of Multi-Specific Strip Steel Based on Random Forest Optimization Model

2020 39th Chinese Control Conference (CCC) ◽

10.23919/ccc50068.2020.9188937 ◽

2020 ◽

Author(s):

XIAO Xiong ◽

DENG Daoming ◽

XIAO Yuxiong ◽

GUO Qiang ◽

ZHANG Yongjun

Keyword(s):

Random Forest ◽

Power Consumption ◽

Optimization Model ◽

Prediction Method ◽

Strip Steel ◽

Finish Rolling

Download Full-text

Random Forest: A Review

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse/v7i1/01113 ◽

2017 ◽

Vol 7 (1) ◽

pp. 251-257 ◽

Cited By ~ 28

Author(s):

Eesha Goel ◽

◽

Er. Abhilasha ◽

Keyword(s):

Random Forest

Download Full-text