scholarly journals Identification of Breast Malignancy by Marker-Controlled Watershed Transformation and Hybrid Feature Set for Healthcare

2020 ◽  
Vol 10 (6) ◽  
pp. 1900 ◽  
Author(s):  
Tariq Sadad ◽  
Ayyaz Hussain ◽  
Asim Munir ◽  
Muhammad Habib ◽  
Sajid Ali Khan ◽  
...  

Breast cancer is a highly prevalent disease in females that may lead to mortality in severe cases. The mortality can be subsided if breast cancer is diagnosed at an early stage. The focus of this study is to detect breast malignancy through computer-aided diagnosis (CADx). In the first phase of this work, Hilbert transform is employed to reconstruct B-mode images from the raw data followed by the marker-controlled watershed transformation to segment the lesion. The methods based only on texture analysis are quite sensitive to speckle noise and other artifacts. Therefore, a hybrid feature set is developed after the extraction of shape-based and texture features from the breast lesion. Decision tree, k-nearest neighbor (KNN), and ensemble decision tree model via random under-sampling with Boost (RUSBoost) are utilized to segregate the cancerous lesions from the benign ones. The proposed technique is tested on OASBUD (Open Access Series of Breast Ultrasonic Data) and breast ultrasound (BUS) images collected at Baheya Hospital Egypt (BHE). The OASBUD dataset contains raw ultrasound data obtained from 100 patients containing 52 malignant and 48 benign lesions. The dataset collected at BHE contains 210 malignant and 437 benign images. The proposed system achieved promising accuracy of 97% with confidence interval (CI) of 91.48% to 99.38% for OASBUD and 96.6% accuracy with CI of 94.90% to 97.86% for the BHE dataset using ensemble method.

Diagnostics ◽  
2020 ◽  
Vol 10 (3) ◽  
pp. 136 ◽  
Author(s):  
Raúl Santiago-Montero ◽  
Humberto Sossa ◽  
David A. Gutiérrez-Hernández ◽  
Víctor Zamudio ◽  
Ignacio Hernández-Bautista ◽  
...  

Breast cancer is a disease that has emerged as the second leading cause of cancer deaths in women worldwide. The annual mortality rate is estimated to continue growing. Cancer detection at an early stage could significantly reduce breast cancer death rates long-term. Many investigators have studied different breast diagnostic approaches, such as mammography, magnetic resonance imaging, ultrasound, computerized tomography, positron emission tomography and biopsy. However, these techniques have limitations, such as being expensive, time consuming and not suitable for women of all ages. Proposing techniques that support the effective medical diagnosis of this disease has undoubtedly become a priority for the government, for health institutions and for civil society in general. In this paper, an associative pattern classifier (APC) was used for the diagnosis of breast cancer. The rate of efficiency obtained on the Wisconsin breast cancer database was 97.31%. The APC’s performance was compared with the performance of a support vector machine (SVM) model, back-propagation neural networks, C4.5, naive Bayes, k-nearest neighbor (k-NN) and minimum distance classifiers. According to our results, the APC performed best. The algorithm of the APC was written and executed in a JAVA platform, as well as the experimental and comparativeness between algorithms.


2021 ◽  
Vol 5 (3) ◽  
pp. 1038
Author(s):  
Indra Rukmana ◽  
Arvin Rasheda ◽  
Faiz Fathulhuda ◽  
Muh Rizky Cahyadi ◽  
Fitriyani Fitriyani

This research is focused on knowing the performance of the classification algorithms, namely Naïve Bayes, Decision Tree-J48 and K-Nearest Neighbor. The speed and the percentage of accuracy in this study are the benchmarks for the performance of the algorithm. This study uses the Breast Cancer and Thoracic Surgery dataset, which is downloaded on the UCI Machine Learning Repository website. Using the help of Weka software Version 3.8.5 to find out the classification algorithm testing. The results show that the J-48 Decision Tree algorithm has the best accuracy, namely 75.6% in the cross-validation test mode for the Breast Cancer dataset and 84.5% for the Thoracic Surgery dataset.


Author(s):  
Sasmita Kumari Nayak ◽  
Mamata Beura ◽  
Mohammed Siddique ◽  
Siba Prasad Mishra

For human life, Food is highly necessary and essential for human to live the life. The objective of the current study is to characterise, classify and compare the food consumption patterns of many Indian food diets such as non-vegetarian and vegetarian. Given data about different Indian dishes, we try to predict here the dish is vegetarian or not. To get the best predictive model, this study is conducted with the comparison of Decision Tree, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Random Forest algorithms. In this study, the concept and implementation of all these four models be made for prediction of Indian food. For training and testing the models, Indian food dataset is used that contains, in total 255 records to fit with all these four models. In short, the classification and prediction of Decision tree and KNN model provides less performance than the other models used here. However, the Random Forest model was generally more accurate than SVM, KNN and Decision Tree model, which have got from the simulation. 


Author(s):  
M. Ilayaraja ◽  
S. Hemalatha ◽  
P. Manickam ◽  
K. Sathesh Kumar ◽  
K. Shankar

Cloud computing is characterized as the arrangement of assets or administrations accessible through the web to the clients on their request by cloud providers. It communicates everything as administrations over the web in view of the client request, for example operating system, organize equipment, storage, assets, and software. Nowadays, Intrusion Detection System (IDS) plays a powerful system, which deals with the influence of experts to get actions when the system is hacked under some intrusions. Most intrusion detection frameworks are created in light of machine learning strategies. Since the datasets, this utilized as a part of intrusion detection is Knowledge Discovery in Database (KDD). In this paper detect or classify the intruded data utilizing Machine Learning (ML) with the MapReduce model. The primary face considers Hadoop MapReduce model to reduce the extent of database ideal weight decided for reducer model and second stage utilizing Decision Tree (DT) classifier to detect the data. This DT classifier comprises utilizing an appropriate classifier to decide the class labels for the non-homogeneous leaf nodes. The decision tree fragment gives a coarse section profile while the leaf level classifier can give data about the qualities that influence the label inside a portion. From the proposed result accuracy for detection is 96.21% contrasted with existing classifiers, for example, Neural Network (NN), Naive Bayes (NB) and K Nearest Neighbor (KNN).


2017 ◽  
Vol 2017 ◽  
pp. 1-6 ◽  
Author(s):  
Zhong Xin ◽  
Lin Hua ◽  
Xu-Hong Wang ◽  
Dong Zhao ◽  
Cai-Guo Yu ◽  
...  

We reanalyzed previous data to develop a more simplified decision tree model as a screening tool for unrecognized diabetes, using basic information in Beijing community health records. Then, the model was validated in another rural town. Only three non-laboratory-based risk factors (age, BMI, and presence of hypertension) with fewer branches were used in the new model. The sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve (AUC) for detecting diabetes were calculated. The AUC values in internal and external validation groups were 0.708 and 0.629, respectively. Subjects with high risk of diabetes had significantly higher HOMA-IR, but no significant difference in HOMA-B was observed. This simple tool will help general practitioners and residents assess the risk of diabetes quickly and easily. This study also validates the strong associations of insulin resistance and early stage of diabetes, suggesting that more attention should be paid to the current model in rural Chinese adult populations.


Author(s):  
Apeksha R Swamy

Skin cancer is a major health issue worldwide. Skin cancer detection at an early stage is key for an efficient treatment. Lately, it is popular that, deadly form of skin cancer among the other types of skin cancer is melanoma because it's much more likely to spread to other parts of the body if not identified and treated early. The advanced medical computer vision or medical image processing take part in increasingly significant role in clinical detection of different diseases. Such method provides an automatic image analysis device for an accurate and fast evaluation of the sore. The steps involved in this project are collecting skin cancer images from PH2 database, preprocessing, segmentation using thresholding, feature extraction and then classification using K-Nearest Neighbor technique (KNN). The results show that the achieved classification accuracy is 92.7%, Sensitivity 100% and 84.44% Specificity.


Data mining usually specifies the discovery of specific pattern or analysis of data from a large dataset. Classification is one of an efficient data mining technique, in which class the data are classified are already predefined using the existing datasets. The classification of medical records in terms of its symptoms using computerized method and storing the predicted information in the digital format is of great importance in the diagnosis of various diseases in the medical field. In this paper, finding the algorithm with highest accuracy range is concentrated so that a cost-effective algorithm can be found. Here the data mining classification algorithms are compared with their accuracy of finding exact data according to the diagnosis report and their execution rate to identify how fast the records are classified. The classification technique based algorithms used in this study are the Naive Bayes Classifier, the C4.5 tree classifier and the K-Nearest Neighbor (KNN) to predict which algorithm is the best suited for classifying any kind of medical dataset. Here the datasets such as Breast Cancer, Iris and Hypothyroid are used to predict which of the three algorithms is suitable for classifying the datasets with highest accuracy of finding the records of patients with the particular health problems. The experimental results represented in the form of table and graph shows the performance and the importance of Naïve Bayes, C4.5 and K-Nearest Neighbor algorithms. From the performance outcome of the three algorithms the C4.5 algorithm is a lot better than the Naïve Bayes and the K-Nearest Neighbor algorithm.


Sign in / Sign up

Export Citation Format

Share Document