Automatic Detection and Staging of Lung Tumors using Locational Features and Double-Staged Classifications

May Phu Paing; Kazuhiko Hamamoto; Supan Tungjitkusolmun; Chuchart Pintavirooj

doi:10.3390/app9112329

Automatic Detection and Staging of Lung Tumors using Locational Features and Double-Staged Classifications

Applied Sciences ◽

10.3390/app9112329 ◽

2019 ◽

Vol 9 (11) ◽

pp. 2329 ◽

Cited By ~ 3

Author(s):

May Phu Paing ◽

Kazuhiko Hamamoto ◽

Supan Tungjitkusolmun ◽

Chuchart Pintavirooj

Keyword(s):

Lung Cancer ◽

Nearest Neighbor ◽

Treatment Options ◽

Back Propagation ◽

Back Propagation Neural Network ◽

Classification Performance ◽

Clinical Staging ◽

Support Vector ◽

K Nearest Neighbor ◽

Experimental Findings

Lung cancer is a life-threatening disease with the highest morbidity and mortality rates of any cancer worldwide. Clinical staging of lung cancer can significantly reduce the mortality rate, because effective treatment options strongly depend on the specific stage of cancer. Unfortunately, manual staging remains a challenge due to the intensive effort required. This paper presents a computer-aided diagnosis (CAD) method for detecting and staging lung cancer from computed tomography (CT) images. This CAD works in three fundamental phases: segmentation, detection, and staging. In the first phase, lung anatomical structures from the input tomography scans are segmented using gray-level thresholding. In the second, the tumor nodules inside the lungs are detected using some extracted features from the segmented tumor candidates. In the last phase, the clinical stages of the detected tumors are defined by extracting locational features. For accurate and robust predictions, our CAD applies a double-staged classification: the first is for the detection of tumors and the second is for staging. In both classification stages, five alternative classifiers, namely the Decision Tree (DT), K-nearest neighbor (KNN), Support Vector Machine (SVM), Ensemble Tree (ET), and Back Propagation Neural Network (BPNN), are applied and compared to ensure high classification performance. The average accuracy levels of 92.8% for detection and 90.6% for staging are achieved using BPNN. Experimental findings reveal that the proposed CAD method provides preferable results compared to previous methods; thus, it is applicable as a clinical diagnostic tool for lung cancer.

Download Full-text

Rapid and Nondestructive On-Site Classification Method for Consumer-Grade Plastics Based on Portable NIR Spectrometer and Machine Learning

Journal of Spectroscopy ◽

10.1155/2020/6631234 ◽

2020 ◽

Vol 2020 ◽

pp. 1-8

Author(s):

Yinglin Yang ◽

Xin Zhang ◽

Jianwei Yin ◽

Xiangyang Yu

Keyword(s):

Near Infrared ◽

Nearest Neighbor ◽

Diffuse Reflectance Spectrum ◽

Back Propagation ◽

Principal Component ◽

Back Propagation Neural Network ◽

Support Vector ◽

Site Classification ◽

Classification Models ◽

K Nearest Neighbor

The classification of plastic waste before recycling is of great significance to achieve effective recycling. In order to achieve rapid, nondestructive, and on-site detection, a portable near-infrared spectrometer was used in this study to obtain the diffuse reflectance spectrum for both standard and commercial plastics made by ABS, PC, PE, PET, PP, PS, and PVC. After applying a series of pretreatments, the principal component analysis (PCA) was used to analyze the cluster trend. K-nearest neighbor (KNN), support vector machine (SVM), and back propagation neural network (BPNN) classification models were developed and evaluated, respectively. The result showed that different plastics could be well separated in top three principal components space after pretreatment, and the classification models performed excellent classification results and high generalization capability. This study indicated that the portable NIR spectrometer, integrated with chemometrics, could achieve excellent performance and has great potential in the field of commercial plastic identification.

Download Full-text

A New Approach to Fall Detection Based on Improved Dual Parallel Channels Convolutional Neural Network

Sensors ◽

10.3390/s19122814 ◽

2019 ◽

Vol 19 (12) ◽

pp. 2814 ◽

Cited By ~ 2

Author(s):

Xiaoguang Liu ◽

Huanliang Li ◽

Cunguang Lou ◽

Tie Liang ◽

Xiuling Liu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Nearest Neighbor ◽

Fall Detection ◽

Classification Performance ◽

Daily Activities ◽

Support Vector ◽

K Nearest Neighbor ◽

Linear Discriminant ◽

Parallel Channels

Falls are the major cause of fatal and non-fatal injury among people aged more than 65 years. Due to the grave consequences of the occurrence of falls, it is necessary to conduct thorough research on falls. This paper presents a method for the study of fall detection using surface electromyography (sEMG) based on an improved dual parallel channels convolutional neural network (IDPC-CNN). The proposed IDPC-CNN model is designed to identify falls from daily activities using the spectral features of sEMG. Firstly, the classification accuracy of time domain features and spectrograms are compared using linear discriminant analysis (LDA), k-nearest neighbor (KNN) and support vector machine (SVM). Results show that spectrograms provide a richer way to extract pattern information and better classification performance. Therefore, the spectrogram features of sEMG are selected as the input of IDPC-CNN to distinguish between daily activities and falls. Finally, The IDPC-CNN is compared with SVM and three different structure CNNs under the same conditions. Experimental results show that the proposed IDPC-CNN achieves 92.55% accuracy, 95.71% sensitivity and 91.7% specificity. Overall, The IDPC-CNN is more effective than the comparison in accuracy, efficiency, training and generalization.

Download Full-text

Comparative analysis of breast cancer detection in mammograms and thermograms

Biomedical Engineering / Biomedizinische Technik ◽

10.1515/bmt-2014-0047 ◽

2015 ◽

Vol 60 (1) ◽

Cited By ~ 7

Author(s):

Marina Milosevic ◽

Dragan Jankovic ◽

Aleksandar Peulic

Keyword(s):

Nearest Neighbor ◽

Region Of Interest ◽

Texture Features ◽

Classification Performance ◽

Support Vector ◽

K Nearest Neighbor ◽

Characteristic Analysis ◽

Analysis Society ◽

Fold Cross Validation ◽

Neighbor Classifier

AbstractIn this paper, we present a system based on feature extraction techniques for detecting abnormal patterns in digital mammograms and thermograms. A comparative study of texture-analysis methods is performed for three image groups: mammograms from the Mammographic Image Analysis Society mammographic database; digital mammograms from the local database; and thermography images of the breast. Also, we present a procedure for the automatic separation of the breast region from the mammograms. Computed features based on gray-level co-occurrence matrices are used to evaluate the effectiveness of textural information possessed by mass regions. A total of 20 texture features are extracted from the region of interest. The ability of feature set in differentiating abnormal from normal tissue is investigated using a support vector machine classifier, Naive Bayes classifier and K-Nearest Neighbor classifier. To evaluate the classification performance, five-fold cross-validation method and receiver operating characteristic analysis was performed.

Download Full-text

Novel Mathematical Model of Breast Cancer Diagnostics Using an Associative Pattern Classification

Diagnostics ◽

10.3390/diagnostics10030136 ◽

2020 ◽

Vol 10 (3) ◽

pp. 136 ◽

Cited By ~ 2

Author(s):

Raúl Santiago-Montero ◽

Humberto Sossa ◽

David A. Gutiérrez-Hernández ◽

Víctor Zamudio ◽

Ignacio Hernández-Bautista ◽

...

Keyword(s):

Breast Cancer ◽

Nearest Neighbor ◽

Early Stage ◽

Back Propagation ◽

Cancer Diagnostics ◽

Support Vector ◽

K Nearest Neighbor ◽

Breast Cancer Death ◽

Positron Emission ◽

The Government

Breast cancer is a disease that has emerged as the second leading cause of cancer deaths in women worldwide. The annual mortality rate is estimated to continue growing. Cancer detection at an early stage could significantly reduce breast cancer death rates long-term. Many investigators have studied different breast diagnostic approaches, such as mammography, magnetic resonance imaging, ultrasound, computerized tomography, positron emission tomography and biopsy. However, these techniques have limitations, such as being expensive, time consuming and not suitable for women of all ages. Proposing techniques that support the effective medical diagnosis of this disease has undoubtedly become a priority for the government, for health institutions and for civil society in general. In this paper, an associative pattern classifier (APC) was used for the diagnosis of breast cancer. The rate of efficiency obtained on the Wisconsin breast cancer database was 97.31%. The APC’s performance was compared with the performance of a support vector machine (SVM) model, back-propagation neural networks, C4.5, naive Bayes, k-nearest neighbor (k-NN) and minimum distance classifiers. According to our results, the APC performed best. The algorithm of the APC was written and executed in a JAVA platform, as well as the experimental and comparativeness between algorithms.

Download Full-text

Performance Research on Medical Data Classification using Traditional and Soft Computing Techniques

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1185.0782s319 ◽

2019 ◽

Vol 8 (2S3) ◽

pp. 990-995

Keyword(s):

Data Mining ◽

Soft Computing ◽

Nearest Neighbor ◽

Classification Performance ◽

Medical Data ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbor ◽

Classification Techniques ◽

Soft Computing Techniques

The world today has made giant leaps in the field of Medicine. There is tremendous amount of researches being carried out in this field leading to new discoveries that is making a heavy impact on the mankind. Data being generated in this field is increasing enormously. A need has arisen to analyze these data in order to find out the meaningful and relevant hidden patterns. These patterns can be used for clinical diagnosis. Data mining is an efficient approach in discovering these patterns. Among the many data mining techniques that exists, this paper aims at analyzing the medical data using various Classification techniques. The classification techniques used in this study include k-Nearest neighbor (kNN), Decision Tree, Naive Bayes which are hard computing algorithms, whereas the soft computing algorithms used in this study include Support Vector Machine (SVM), Artificial Neural Networks (ANN) and Fuzzy k-Means clustering. We have applied these algorithms to three kinds of datasets that are Breast Cancer Wisconsin, Haberman Data and Contraceptive Method Choice dataset. Our results show that soft computing based classification algorithms better classifications than the traditional classification algorithms in terms of various classification performance measures

Download Full-text

A New Algorithm for Analysis of MiRNA Expression Profiles—SVM-RFE-FKNN

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.3.030407 ◽

2021 ◽

Author(s):

Duan Mei ◽

Qiang Liu

Keyword(s):

Mirna Expression ◽

Nearest Neighbor ◽

Expression Profiles ◽

Binary Classification ◽

Characteristic Curve ◽

Classification Performance ◽

Recursive Feature Elimination ◽

Support Vector ◽

K Nearest Neighbor ◽

Mirna Expression Profiles

Based on MicroRNA (miRNA) expression profiles, this article proposes a new algorithm—SVM-RFE-FKNN, which combines the support vector machine-recursive feature elimination (SVM-RFE) algorithm and the fuzzy K -nearest neighbor (FKNN) algorithm, to realize binary classification of tumors. First, the SVM-RFE algorithm was used to select features from the miRNA expression profile dataset to constitute feature subsets and to determine the maximum number of support vectors. Next, this maximum number was regarded as the upper limit of the parameter K in the FKNN algorithm that was then used to classify the samples to be tested. Finally, the leave-one-out cross-validation method was adopted to assess the classification performance of the proposed algorithm. Through experiments, our proposed algorithm was compared with other twelve classification methods, and the result shows that our algorithm had better classification performance. Specifically, with only a few miRNA biomarkers, the proposed algorithm could reach an accuracy of 99.46% and an area under the receiver operating characteristic curve (AUC) of 0.9874.

Download Full-text

Comparison of the Hybrid Credit Scoring Models Based on Various Classifiers

Insights into Advancements in Intelligent Information Technologies ◽

10.4018/978-1-4666-0158-1.ch012 ◽

2012 ◽

pp. 220-239

Author(s):

Fei-Long Chen ◽

Feng-Chia Li

Keyword(s):

Nearest Neighbor ◽

Credit Scoring ◽

Back Propagation ◽

Support Vector ◽

Data Sets ◽

K Nearest Neighbor ◽

Hybrid Features ◽

Credit Data ◽

Learning Machine ◽

Propagation Network

Credit scoring is an important topic for businesses and socio-economic establishments collecting huge amounts of data, with the intention of making the wrong decision obsolete. In this paper, the authors propose four approaches that combine four well-known classifiers, such as K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Back-Propagation Network (BPN) and Extreme Learning Machine (ELM). These classifiers are used to find a suitable hybrid classifier combination featuring selection that retains sufficient information for classification purposes. In this regard, different credit scoring combinations are constructed by selecting features with four approaches and classifiers than would otherwise be chosen. Two credit data sets from the University of California, Irvine (UCI), are chosen to evaluate the accuracy of the various hybrid features selection models. In this paper, the procedures that are part of the proposed approaches are described and then evaluated for their performances.

Download Full-text

Study on XML Retrieval Results Classification

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.263-266.1773 ◽

2012 ◽

Vol 263-266 ◽

pp. 1773-1777

Author(s):

Hong Yu ◽

Xiao Lei Huang ◽

Zhi Ling Wei ◽

Chen Xia Yang

Keyword(s):

Nearest Neighbor ◽

Classification Performance ◽

Feedback Mechanism ◽

Support Vector ◽

Svm Classifier ◽

K Nearest Neighbor ◽

Xml Retrieval ◽

Text Documents ◽

Plain Text ◽

Xml Documents

Mining (classify or clustering) retrieval results to serve relevance feedback mechanism of search engine is an important solution to improve effectiveness of retrieval. Unlike plain text documents, since the XML documents are semi-structured data, for XML retrieval results classification, consider exploiting structure features of XML documents, such as tag paths and edges etc. We propose to use Support Vector Machine (SVM) classifier to classify XML retrieval results exploiting both their content and structure features. We implemented the classification method on XML retrieval results based on the IEEE SC corpus. Compared with k-nearest neighbor classification (KNN) on the same dataset in our application, SVM perform better. The experiment results have also shown that the use of structure features, especially tag paths and edges, can improve the classification performance significantly.

Download Full-text

Comparison of the Hybrid Credit Scoring Models Based on Various Classifiers

International Journal of Intelligent Information Technologies ◽

10.4018/jiit.2010070104 ◽

2010 ◽

Vol 6 (3) ◽

pp. 56-74 ◽

Cited By ~ 15

Author(s):

Fei-Long Chen ◽

Feng-Chia Li

Keyword(s):

Nearest Neighbor ◽

Credit Scoring ◽

Back Propagation ◽

Support Vector ◽

Sufficient Information ◽

Features Selection ◽

K Nearest Neighbor ◽

Hybrid Features ◽

Wrong Decision ◽

Propagation Network

Download Full-text

Cell morphology-based machine learning models for human cell state classification

npj Systems Biology and Applications ◽

10.1038/s41540-021-00180-y ◽

2021 ◽

Vol 7 (1) ◽

Author(s):

Yi Li ◽

Chance M. Nowak ◽

Uyen Pham ◽

Khai Nguyen ◽

Leonidas Bleris

Keyword(s):

Machine Learning ◽

Flow Cytometry ◽

Multilayer Perceptron ◽

Nearest Neighbor ◽

Annexin V ◽

Classification Performance ◽

Training Data ◽

Support Vector ◽

Apoptotic Cells ◽

K Nearest Neighbor

AbstractHerein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information.

Download Full-text