Investigating Critical Risk Factors of Liver Cancer with Deep Neural Networks

This chapter presents a comprehensive scheme for automated detection of colorectal polyps in computed tomography colonography (CTC) with particular emphasis on robust learning algorithms that differentiate polyps from non-polyp shapes. The authors’ automated CTC scheme introduces two orientation independent features which encode the shape characteristics that aid in classification of polyps and non-polyps with high accuracy, low false positive rate, and low computations making the scheme suitable for colorectal cancer screening initiatives. Experiments using state-of-the-art machine learning algorithms viz., lazy learning, support vector machines, and naïve Bayes classifiers reveal the robustness of the two features in detecting polyps at 100% sensitivity for polyps with diameter greater than 10 mm while attaining total low false positive rates, respectively, of 3.05, 3.47 and 0.71 per CTC dataset at specificities above 99% when tested on 58 CTC datasets. The results were validated using colonoscopy reports provided by expert radiologists.

Download Full-text

Early Weed Detection Using Image Processing and Machine Learning Techniques in an Australian Chilli Farm

Agriculture ◽

10.3390/agriculture11050387 ◽

2021 ◽

Vol 11 (5) ◽

pp. 387

Author(s):

Nahina Islam ◽

Md Mamunur Rashid ◽

Santoso Wibowo ◽

Cheng-Yuan Xu ◽

Ahsan Morshed ◽

...

Keyword(s):

Machine Learning ◽

False Positive Rate ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector ◽

Weed Detection ◽

Learning Techniques ◽

Positive Rate ◽

Uav Images

This paper explores the potential of machine learning algorithms for weed and crop classification from UAV images. The identification of weeds in crops is a challenging task that has been addressed through orthomosaicing of images, feature extraction and labelling of images to train machine learning algorithms. In this paper, the performances of several machine learning algorithms, random forest (RF), support vector machine (SVM) and k-nearest neighbours (KNN), are analysed to detect weeds using UAV images collected from a chilli crop field located in Australia. The evaluation metrics used in the comparison of performance were accuracy, precision, recall, false positive rate and kappa coefficient. MATLAB is used for simulating the machine learning algorithms; and the achieved weed detection accuracies are 96% using RF, 94% using SVM and 63% using KNN. Based on this study, RF and SVM algorithms are efficient and practical to use, and can be implemented easily for detecting weed from UAV images.

Download Full-text

Malicious Intrusion Detection Using Machine Learning Schemes

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f8839.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 4194-4198

Keyword(s):

Machine Learning ◽

Wireless Networks ◽

Intrusion Detection ◽

False Positive Rate ◽

Feature Selection Method ◽

Training Model ◽

True Positive Rate ◽

Machine Learning Algorithms ◽

Detection Mechanism ◽

Positive Rate

Wireless networks are continuously facing challenges in the field of Information Security. This leads to major researches in the area of Intrusion detection. The working of Intrusion detection is performed mainly by signature based detection and anomaly based detection. Anomaly based detection is based on the behavior of the network. One of the major challenge in this domain is to identify and detect the malicious node in wireless networks. The intrusion detection mechanism has to analyse the behavior of the node in the network by means of the several features possessed by each node. Intelligent schemes are the need of the hour in such scenario. This paper has taken a standard dataset for studying the features of the wireless node and reduced the features by applying the most efficient Correlation Attribute feature selection method. The machine learning algorithms are applied to obtain an effective training model which is then applied on the testing dataset to validate the model. The accuracy of the model is determined by the performance parameters such as true positive rate, false positive rate and ROC area. Neural network, bagging and decision tree algorithm RepTree are giving promising results in comparison with other classification algorithms.

Download Full-text

Reducing Pseudo-error Rate of Industrial Machine Vision Systems with Machine Learning Methods

Acta Technica Jaurinensis ◽

10.14513/actatechjaur.v12.n4.511 ◽

2019 ◽

Vol 12 (4) ◽

pp. 294-305

Author(s):

Balázs Szűcs ◽

Áron Ballagi

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Machine Vision ◽

Error Rate ◽

Materials Science ◽

False Positive Rate ◽

Machine Learning Algorithms ◽

Positive Rate ◽

Industrial Use ◽

Industrial Machine

Nowadays machine learning and artificial neural networks are hot topic. These methods gains more and more ground in everyday life. In addition to everyday usage, an increasing emphasis placed on industrial use. In the field of research and development, materials science, robotics and thanks to the spread of Industry 4.0 and digitalization, more and more machine learning based systems introduced in production. This paper gives examples of possible ways of using machine learning algorithms in manufacturing, as well as reducing pseudo-error (false positive) rate of machine vision quality control systems. Even the simplest algorithms and models can be very effective on real-world problems. With the usage of convolutional neural networks, the pseudo-error rate of the examined system reducible.

Download Full-text

LC-MS Peak Assignment Based on Unanimous Selection by Six Machine Learning Algorithms

10.21203/rs.3.rs-845859/v1 ◽

2021 ◽

Author(s):

Hiroaki Ito ◽

Takashi Matsui ◽

Ryo Konno ◽

Makoto Itakura ◽

Yoshio Kodera

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

False Positive Rate ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Weak Signals ◽

Accuracy And Precision ◽

Peak Assignment ◽

Positive Rate ◽

Assignment Strategy

Abstract Recent Mass spectrometry (MS)-based techniques enable deep proteome coverage with relative quantitative analysis, resulting in increased identification of very weak signals accompanied by increased data size of liquid chromatography (LC)–MS/MS spectra. However, the identification of weak signals using an assignment strategy with poorer performance resulted in imperfect quantification with misidentification of peaks and ratio distortions. Manually annotating a large number of signals within a very large dataset is not a realistic approach. In this study, therefore, we utilized machine learning algorithms to successfully extract a higher number of peptide peaks with high accuracy and precision. Our strategy evaluated each peak identified using six different algorithms; peptide peaks identified by all six algorithms (i.e., unanimously selected) were subsequently assigned as true peaks, which resulted in a reduction in the false-positive rate. Hence, exact and highly quantitative peptide peaks were obtained, providing better performance than obtained applying the conventional criteria or using a single machine learning algorithm.

Download Full-text

Machine Learning for Automated Polyp Detection in Computed Tomography Colonography

Advances in Bioinformatics and Biomedical Engineering - Biomedical Image Analysis and Machine Learning Technologies ◽

10.4018/978-1-60566-956-4.ch003 ◽

2010 ◽

pp. 54-77

Author(s):

Abhilash Alexander Miranda ◽

Olivier Caelen ◽

Gianluca Bontempi

Keyword(s):

Machine Learning ◽

Computed Tomography ◽

False Positive ◽

False Positive Rate ◽

Learning Algorithms ◽

Colorectal Polyps ◽

Machine Learning Algorithms ◽

Computed Tomography Colonography ◽

Positive Rate ◽

Independent Features

This chapter presents a comprehensive scheme for automated detection of colorectal polyps in computed tomography colonography (CTC) with particular emphasis on robust learning algorithms that differentiate polyps from non-polyp shapes. The authors’ automated CTC scheme introduces two orientation independent features which encode the shape characteristics that aid in classification of polyps and non-polyps with high accuracy, low false positive rate, and low computations making the scheme suitable for colorectal cancer screening initiatives. Experiments using state-of-the-art machine learning algorithms viz., lazy learning, support vector machines, and naïve Bayes classifiers reveal the robustness of the two features in detecting polyps at 100% sensitivity for polyps with diameter greater than 10 mm while attaining total low false positive rates, respectively, of 3.05, 3.47 and 0.71 per CTC dataset at specificities above 99% when tested on 58 CTC datasets. The results were validated using colonoscopy reports provided by expert radiologists.

Download Full-text

Comparative Study of Various Machine Learning Algorithms for Prediction of Insomnia

Advances in Medical Technologies and Clinical Practice - Advanced Classification Techniques for Healthcare Analysis ◽

10.4018/978-1-5225-7796-6.ch011 ◽

2019 ◽

pp. 234-257 ◽

Cited By ~ 5

Author(s):

Ravinder Ahuja ◽

Vishal Vivek ◽

Manika Chandna ◽

Shivani Virmani ◽

Alisha Banga

Keyword(s):

Machine Learning ◽

Heart Diseases ◽

False Positive Rate ◽

Learning Algorithms ◽

True Positive Rate ◽

Machine Learning Algorithms ◽

Support Vector ◽

Mobility Problem ◽

Positive Rate ◽

F Measure

An early diagnosis of insomnia can prevent further medical aids such as anger issues, heart diseases, anxiety, depression, and hypertension. Fifteen machine learning algorithms have been applied and 14 leading factors have been taken into consideration for predicting insomnia. Seven performance parameters (accuracy, kappa, the true positive rate, false positive rate, precision, f-measure, and AUC) are used and for implementation. The authors have used python language. The support vector machine is giving higher performance out of all algorithms giving accuracy 91.6%, f-measure is 92.13, and kappa is 0.83. Further, SVM is applied on another dataset of 100 patients and giving accuracy 92%. In addition, an analysis of the variable importance of CART, C5.0, decision tree, random forest, adaptive boost, and XG boost is calculated. The analysis shows that insomnia primarily depends on the factors, which are the vision problem, mobility problem, and sleep disorder. This chapter mainly finds the usages and effectiveness of machine learning algorithms in Insomnia diseases prediction.

Download Full-text

Hadoop based Parallel Machine Learning Algorithms for Intrusion Detection System

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a4443.119119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1152-1156

Keyword(s):

Machine Learning ◽

False Positive ◽

Naive Bayes ◽

False Positive Rate ◽

True Positive Rate ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

True Positive ◽

Positive Rate ◽

Bayes Algorithm

Web use and digitized information are getting expanded each day. The measure of information created is likewise getting expanded. On the opposite side, the security assaults cause numerous security dangers in the system, sites and Internet. Interruption discovery in a fast system is extremely a hard undertaking. The Hadoop Implementation is utilized to address the previously mentioned test that is distinguishing interruption in a major information condition at constant. To characterize the strange bundle stream, AI methodologies are used. Innocent Bayes does grouping by a vector of highlight esteems produced using some limited set. Choice Tree is another Machine Learning classifier which is likewise an administered learning model. Choice tree is the stream diagram like tree structure. J48 and Naïve Bayes Algorithm are actualized in Hadoop MapReduce Framework for parallel preparing by utilizing the KDDCup Data Corrected Benchmark dataset records. The outcome acquired is 89.9% True Positive rate and 0.04% False Positive rate for Naive Bayes Algorithm and 98.06% True Positive rate and 0.001% False Positive rate for Decision Tree Algorithm.

Download Full-text

Hybrid rule-based botnet detection approach using machine learning for analysing DNS traffic

PeerJ Computer Science ◽

10.7717/peerj-cs.640 ◽

2021 ◽

Vol 7 ◽

pp. e640

Author(s):

Saif Al-mashhadi ◽

Mohammed Anbar ◽

Iznan Hasbullah ◽

Taief Alaa Alamiedy

Keyword(s):

Machine Learning ◽

False Positive ◽

False Positive Rate ◽

Communication Protocols ◽

Cyber Attacks ◽

Machine Learning Algorithms ◽

Detection Accuracy ◽

Botnet Detection ◽

Internet Service ◽

Positive Rate

Botnets can simultaneously control millions of Internet-connected devices to launch damaging cyber-attacks that pose significant threats to the Internet. In a botnet, bot-masters communicate with the command and control server using various communication protocols. One of the widely used communication protocols is the ‘Domain Name System’ (DNS) service, an essential Internet service. Bot-masters utilise Domain Generation Algorithms (DGA) and fast-flux techniques to avoid static blacklists and reverse engineering while remaining flexible. However, botnet’s DNS communication generates anomalous DNS traffic throughout the botnet life cycle, and such anomaly is considered an indicator of DNS-based botnets presence in the network. Despite several approaches proposed to detect botnets based on DNS traffic analysis; however, the problem still exists and is challenging due to several reasons, such as not considering significant features and rules that contribute to the detection of DNS-based botnet. Therefore, this paper examines the abnormality of DNS traffic during the botnet lifecycle to extract significant enriched features. These features are further analysed using two machine learning algorithms. The union of the output of two algorithms proposes a novel hybrid rule detection model approach. Two benchmark datasets are used to evaluate the performance of the proposed approach in terms of detection accuracy and false-positive rate. The experimental results show that the proposed approach has a 99.96% accuracy and a 1.6% false-positive rate, outperforming other state-of-the-art DNS-based botnet detection approaches.

Download Full-text