Identification of Breast Malignancy by Marker-Controlled Watershed Transformation and Hybrid Feature Set for Healthcare

Tariq Sadad; Ayyaz Hussain; Asim Munir; Muhammad Habib; Sajid Ali Khan; Shariq Hussain; Shunkun Yang; Mohammed Alawairdhi

doi:10.3390/app10061900

Identification of Breast Malignancy by Marker-Controlled Watershed Transformation and Hybrid Feature Set for Healthcare

Applied Sciences ◽

10.3390/app10061900 ◽

2020 ◽

Vol 10 (6) ◽

pp. 1900 ◽

Cited By ~ 1

Author(s):

Tariq Sadad ◽

Ayyaz Hussain ◽

Asim Munir ◽

Muhammad Habib ◽

Sajid Ali Khan ◽

...

Keyword(s):

Breast Cancer ◽

Decision Tree ◽

Nearest Neighbor ◽

Early Stage ◽

Texture Features ◽

Speckle Noise ◽

K Nearest Neighbor ◽

Tree Model ◽

Watershed Transformation ◽

Breast Malignancy

Breast cancer is a highly prevalent disease in females that may lead to mortality in severe cases. The mortality can be subsided if breast cancer is diagnosed at an early stage. The focus of this study is to detect breast malignancy through computer-aided diagnosis (CADx). In the first phase of this work, Hilbert transform is employed to reconstruct B-mode images from the raw data followed by the marker-controlled watershed transformation to segment the lesion. The methods based only on texture analysis are quite sensitive to speckle noise and other artifacts. Therefore, a hybrid feature set is developed after the extraction of shape-based and texture features from the breast lesion. Decision tree, k-nearest neighbor (KNN), and ensemble decision tree model via random under-sampling with Boost (RUSBoost) are utilized to segregate the cancerous lesions from the benign ones. The proposed technique is tested on OASBUD (Open Access Series of Breast Ultrasonic Data) and breast ultrasound (BUS) images collected at Baheya Hospital Egypt (BHE). The OASBUD dataset contains raw ultrasound data obtained from 100 patients containing 52 malignant and 48 benign lesions. The dataset collected at BHE contains 210 malignant and 437 benign images. The proposed system achieved promising accuracy of 97% with confidence interval (CI) of 91.48% to 99.38% for OASBUD and 96.6% accuracy with CI of 94.90% to 97.86% for the BHE dataset using ensemble method.

Download Full-text

Novel Mathematical Model of Breast Cancer Diagnostics Using an Associative Pattern Classification

Diagnostics ◽

10.3390/diagnostics10030136 ◽

2020 ◽

Vol 10 (3) ◽

pp. 136 ◽

Cited By ~ 2

Author(s):

Raúl Santiago-Montero ◽

Humberto Sossa ◽

David A. Gutiérrez-Hernández ◽

Víctor Zamudio ◽

Ignacio Hernández-Bautista ◽

...

Keyword(s):

Breast Cancer ◽

Nearest Neighbor ◽

Early Stage ◽

Back Propagation ◽

Cancer Diagnostics ◽

Support Vector ◽

K Nearest Neighbor ◽

Breast Cancer Death ◽

Positron Emission ◽

The Government

Breast cancer is a disease that has emerged as the second leading cause of cancer deaths in women worldwide. The annual mortality rate is estimated to continue growing. Cancer detection at an early stage could significantly reduce breast cancer death rates long-term. Many investigators have studied different breast diagnostic approaches, such as mammography, magnetic resonance imaging, ultrasound, computerized tomography, positron emission tomography and biopsy. However, these techniques have limitations, such as being expensive, time consuming and not suitable for women of all ages. Proposing techniques that support the effective medical diagnosis of this disease has undoubtedly become a priority for the government, for health institutions and for civil society in general. In this paper, an associative pattern classifier (APC) was used for the diagnosis of breast cancer. The rate of efficiency obtained on the Wisconsin breast cancer database was 97.31%. The APC’s performance was compared with the performance of a support vector machine (SVM) model, back-propagation neural networks, C4.5, naive Bayes, k-nearest neighbor (k-NN) and minimum distance classifiers. According to our results, the APC performed best. The algorithm of the APC was written and executed in a JAVA platform, as well as the experimental and comparativeness between algorithms.

Download Full-text

Early Stage Breast Cancer Detection System using GLCM feature extraction and K-Nearest Neighbor (k-NN) on Mammography image

2018 18th International Symposium on Communications and Information Technologies (ISCIT) ◽

10.1109/iscit.2018.8587920 ◽

2018 ◽

Cited By ~ 5

Author(s):

Than Than Htay ◽

Su Su Maung

Keyword(s):

Breast Cancer ◽

Feature Extraction ◽

Cancer Detection ◽

Nearest Neighbor ◽

Detection System ◽

Early Stage ◽

Breast Cancer Detection ◽

Early Stage Breast Cancer ◽

K Nearest Neighbor ◽

Mammography Image

Download Full-text

Analisis Perbandingan Kinerja Algoritma Naïve Bayes, Decision Tree-J48 dan Lazy-IBK

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i3.3055 ◽

2021 ◽

Vol 5 (3) ◽

pp. 1038

Author(s):

Indra Rukmana ◽

Arvin Rasheda ◽

Faiz Fathulhuda ◽

Muh Rizky Cahyadi ◽

Fitriyani Fitriyani

Keyword(s):

Breast Cancer ◽

Decision Tree ◽

Thoracic Surgery ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Breast Cancer Dataset ◽

Decision Tree Algorithm ◽

K Nearest Neighbor ◽

Cancer Dataset

This research is focused on knowing the performance of the classification algorithms, namely Naïve Bayes, Decision Tree-J48 and K-Nearest Neighbor. The speed and the percentage of accuracy in this study are the benchmarks for the performance of the algorithm. This study uses the Breast Cancer and Thoracic Surgery dataset, which is downloaded on the UCI Machine Learning Repository website. Using the help of Weka software Version 3.8.5 to find out the classification algorithm testing. The results show that the J-48 Decision Tree algorithm has the best accuracy, namely 75.6% in the cross-validation test mode for the Breast Cancer dataset and 84.5% for the Thoracic Surgery dataset.

Download Full-text

Analysis of Indian Food Based on Machine learning Classification Models

Journal of Scientific Research and Reports ◽

10.9734/jsrr/2021/v27i730407 ◽

2021 ◽

pp. 1-7

Author(s):

Sasmita Kumari Nayak ◽

Mamata Beura ◽

Mohammed Siddique ◽

Siba Prasad Mishra

Keyword(s):

Random Forest ◽

Decision Tree ◽

Nearest Neighbor ◽

Human Life ◽

Support Vector ◽

Classification Models ◽

K Nearest Neighbor ◽

Tree Model ◽

Machine Learning Classification ◽

Indian Food

For human life, Food is highly necessary and essential for human to live the life. The objective of the current study is to characterise, classify and compare the food consumption patterns of many Indian food diets such as non-vegetarian and vegetarian. Given data about different Indian dishes, we try to predict here the dish is vegetarian or not. To get the best predictive model, this study is conducted with the comparison of Decision Tree, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Random Forest algorithms. In this study, the concept and implementation of all these four models be made for prediction of Indian food. For training and testing the models, Indian food dataset is used that contains, in total 255 records to fit with all these four models. In short, the classification and prediction of Decision tree and KNN model provides less performance than the other models used here. However, the Random Forest model was generally more accurate than SVM, KNN and Decision Tree model, which have got from the simulation.

Download Full-text

Analysis of Decision Tree and K-Nearest Neighbor Algorithm in the Classification of Breast Cancer

Asian Pacific Journal of Cancer Prevention ◽

10.31557/apjcp.2019.20.12.3777 ◽

2019 ◽

Vol 20 (12) ◽

pp. 3777-3781 ◽

Cited By ~ 2

Author(s):

Harikumar Rajaguru ◽

Sannasi Chakravarthy S R

Keyword(s):

Breast Cancer ◽

Decision Tree ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm

Download Full-text

Supervised Classifier Approach for Intrusion Detection on KDD with Optimal MapReduce Framework Model in Cloud Computing

Recent Patents on Computer Science ◽

10.2174/1573401315666190619113510 ◽

2019 ◽

Vol 12 ◽

Author(s):

M. Ilayaraja ◽

S. Hemalatha ◽

P. Manickam ◽

K. Sathesh Kumar ◽

K. Shankar

Keyword(s):

Machine Learning ◽

Cloud Computing ◽

Intrusion Detection ◽

Decision Tree ◽

Learning Strategies ◽

Nearest Neighbor ◽

Detection System ◽

K Nearest Neighbor ◽

Mapreduce Model ◽

The Web

Cloud computing is characterized as the arrangement of assets or administrations accessible through the web to the clients on their request by cloud providers. It communicates everything as administrations over the web in view of the client request, for example operating system, organize equipment, storage, assets, and software. Nowadays, Intrusion Detection System (IDS) plays a powerful system, which deals with the influence of experts to get actions when the system is hacked under some intrusions. Most intrusion detection frameworks are created in light of machine learning strategies. Since the datasets, this utilized as a part of intrusion detection is Knowledge Discovery in Database (KDD). In this paper detect or classify the intruded data utilizing Machine Learning (ML) with the MapReduce model. The primary face considers Hadoop MapReduce model to reduce the extent of database ideal weight decided for reducer model and second stage utilizing Decision Tree (DT) classifier to detect the data. This DT classifier comprises utilizing an appropriate classifier to decide the class labels for the non-homogeneous leaf nodes. The decision tree fragment gives a coarse section profile while the leaf level classifier can give data about the qualities that influence the label inside a portion. From the proposed result accuracy for detection is 96.21% contrasted with existing classifiers, for example, Neural Network (NN), Naive Bayes (NB) and K Nearest Neighbor (KNN).

Download Full-text

Performance Comparison of Mushroom Types Classification Using K-Nearest Neighbor Method and Decision Tree Method

2020 3rd International Conference on Information and Communications Technology (ICOIACT) ◽

10.1109/icoiact50329.2020.9332148 ◽

2020 ◽

Author(s):

Nadya Chitayae ◽

Andi Sunyoto

Keyword(s):

Decision Tree ◽

Nearest Neighbor ◽

Performance Comparison ◽

K Nearest Neighbor ◽

Decision Tree Method ◽

Tree Method

Download Full-text

Reanalysis and External Validation of a Decision Tree Model for Detecting Unrecognized Diabetes in Rural Chinese Individuals

International Journal of Endocrinology ◽

10.1155/2017/3894870 ◽

2017 ◽

Vol 2017 ◽

pp. 1-6 ◽

Cited By ~ 2

Author(s):

Zhong Xin ◽

Lin Hua ◽

Xu-Hong Wang ◽

Dong Zhao ◽

Cai-Guo Yu ◽

...

Keyword(s):

Decision Tree ◽

Predictive Value ◽

Early Stage ◽

Current Model ◽

External Validation ◽

Area Under The Curve ◽

Decision Tree Model ◽

Tree Model ◽

Chinese Adult ◽

Significant Difference

We reanalyzed previous data to develop a more simplified decision tree model as a screening tool for unrecognized diabetes, using basic information in Beijing community health records. Then, the model was validated in another rural town. Only three non-laboratory-based risk factors (age, BMI, and presence of hypertension) with fewer branches were used in the new model. The sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve (AUC) for detecting diabetes were calculated. The AUC values in internal and external validation groups were 0.708 and 0.629, respectively. Subjects with high risk of diabetes had significantly higher HOMA-IR, but no significant difference in HOMA-B was observed. This simple tool will help general practitioners and residents assess the risk of diabetes quickly and easily. This study also validates the strong associations of insulin resistance and early stage of diabetes, suggesting that more attention should be paid to the current model in rural Chinese adult populations.

Download Full-text

Skin Cancer Detection and Classification using KNN Technique

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35299 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 3520-3527

Author(s):

Apeksha R Swamy

Keyword(s):

Skin Cancer ◽

Cancer Detection ◽

Nearest Neighbor ◽

Early Stage ◽

The Body ◽

K Nearest Neighbor ◽

Fast Evaluation ◽

Efficient Treatment ◽

Medical Computer ◽

Skin Cancer Detection

Skin cancer is a major health issue worldwide. Skin cancer detection at an early stage is key for an efficient treatment. Lately, it is popular that, deadly form of skin cancer among the other types of skin cancer is melanoma because it's much more likely to spread to other parts of the body if not identified and treated early. The advanced medical computer vision or medical image processing take part in increasingly significant role in clinical detection of different diseases. Such method provides an automatic image analysis device for an accurate and fast evaluation of the sore. The steps involved in this project are collecting skin cancer images from PH2 database, preprocessing, segmentation using thresholding, feature extraction and then classification using K-Nearest Neighbor technique (KNN). The results show that the achieved classification accuracy is 92.7%, Sensitivity 100% and 84.44% Specificity.

Download Full-text

Performance of Naïve Bayes, C4.5 and KNN using Breast Cancer, Iris and Hypothyroid Datasets

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8795.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2193-2197

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Specific Pattern ◽

K Nearest Neighbor ◽

Data Mining Technique ◽

Digital Format ◽

Tree Classifier

Data mining usually specifies the discovery of specific pattern or analysis of data from a large dataset. Classification is one of an efficient data mining technique, in which class the data are classified are already predefined using the existing datasets. The classification of medical records in terms of its symptoms using computerized method and storing the predicted information in the digital format is of great importance in the diagnosis of various diseases in the medical field. In this paper, finding the algorithm with highest accuracy range is concentrated so that a cost-effective algorithm can be found. Here the data mining classification algorithms are compared with their accuracy of finding exact data according to the diagnosis report and their execution rate to identify how fast the records are classified. The classification technique based algorithms used in this study are the Naive Bayes Classifier, the C4.5 tree classifier and the K-Nearest Neighbor (KNN) to predict which algorithm is the best suited for classifying any kind of medical dataset. Here the datasets such as Breast Cancer, Iris and Hypothyroid are used to predict which of the three algorithms is suitable for classifying the datasets with highest accuracy of finding the records of patients with the particular health problems. The experimental results represented in the form of table and graph shows the performance and the importance of Naïve Bayes, C4.5 and K-Nearest Neighbor algorithms. From the performance outcome of the three algorithms the C4.5 algorithm is a lot better than the Naïve Bayes and the K-Nearest Neighbor algorithm.

Download Full-text