scholarly journals Classification of online toxic comments using the logistic regression and neural networks models

Author(s):  
Mujahed A. Saif ◽  
Alexander N. Medvedev ◽  
Maxim A. Medvedev ◽  
Todorka Atanasova
2000 ◽  
Vol 90 (2) ◽  
pp. 108-113 ◽  
Author(s):  
E. D. De Wolf ◽  
L. J. Francl

Tan spot and Stagonospora blotch of hard red spring wheat served as a model system for evaluating disease forecasts by artificial neural networks. Pathogen infection periods on susceptible wheat plants were measured in the field from 1993 to 1998, and incidence data were merged with 24-h summaries of accumulated growing degree days, temperature, relative humidity, precipitation, and leaf wetness duration. The resulting data set of 202 discrete periods was randomly assigned to 10 modeldevelopment or -validation (n = 50) data sets. Backpropagation neural networks, general regression neural networks, logistic regression, and parametric and nonparametric methods of discriminant analysis were chosen for comparison. Mean validation classification of tan spot incidence was between 71% for logistic regression and 76% for backpropagation models. No significant difference was found between methods of modeling tan spot infection periods. Mean validation prediction accuracy of Stagonospora blotch incidence was 86 and 81% for backpropagation and logistic regression, respectively. Prediction accuracies of other modeling methods were ≤78% and were significantly different (P = 0.01) from backpropagation, but not logistic regression, results. The best backpropagation models of tan spot and Stagonospora blotch incidences correctly classified 82 and 84% of validation cases, respectively. High classification accuracy and consistently good performance demonstrate the applicability of neural network technology to plant disease forecasting.


Author(s):  
Supun Nakandala ◽  
Marta M. Jankowska ◽  
Fatima Tuz-Zahra ◽  
John Bellettiere ◽  
Jordan A. Carlson ◽  
...  

Background: Machine learning has been used for classification of physical behavior bouts from hip-worn accelerometers; however, this research has been limited due to the challenges of directly observing and coding human behavior “in the wild.” Deep learning algorithms, such as convolutional neural networks (CNNs), may offer better representation of data than other machine learning algorithms without the need for engineered features and may be better suited to dealing with free-living data. The purpose of this study was to develop a modeling pipeline for evaluation of a CNN model on a free-living data set and compare CNN inputs and results with the commonly used machine learning random forest and logistic regression algorithms. Method: Twenty-eight free-living women wore an ActiGraph GT3X+ accelerometer on their right hip for 7 days. A concurrently worn thigh-mounted activPAL device captured ground truth activity labels. The authors evaluated logistic regression, random forest, and CNN models for classifying sitting, standing, and stepping bouts. The authors also assessed the benefit of performing feature engineering for this task. Results: The CNN classifier performed best (average balanced accuracy for bout classification of sitting, standing, and stepping was 84%) compared with the other methods (56% for logistic regression and 76% for random forest), even without performing any feature engineering. Conclusion: Using the recent advancements in deep neural networks, the authors showed that a CNN model can outperform other methods even without feature engineering. This has important implications for both the model’s ability to deal with the complexity of free-living data and its potential transferability to new populations.


2005 ◽  
Vol 4 (4) ◽  
pp. 291-305
Author(s):  
Jozef Zurada ◽  
Waldemar Karwowski ◽  
William Marras

Work related low back disorders (LBDs) continue to pose significant occupational health problem that affects the quality of life of the industrial population. The main objective of this study was to explore the application of various data mining techniques, including neural networks, logistic regression, decision trees, memory-based reasoning, and the ensemble model, for classification of industrial jobs with respect to the risk of work-related LBDs. The results from extensive computer simulations using a 10-fold cross validation showed that memory-based reasoning and ensemble models were the best in the overall classification accuracy. The decision tree and memory-based reasoning models were the most accurate in classifying jobs with high risk of LBDs, whereas neural networks and logistic regression were the best in classifying jobs with low risk of LBDs. The decision tree model delivered the most stable results across 10 generations of different data sets randomly chosen for training, validation, and testing. The classification results generated by the decision tree were the easiest to interpret because they were given in the form of simple 'if-then' rules. These results produced by the decision tree method showed that the peak moment had the highest predictive power of LBDs.


2019 ◽  
Vol 6 (1) ◽  
pp. 287-308 ◽  
Author(s):  
Robert A. Stine

Sentiment analysis labels a body of text as expressing either a positive or negative opinion, as in summarizing the content of an online product review. In this sense, sentiment analysis can be considered the challenge of building a classifier from text. Sentiment analysis can be done by counting the words from a dictionary of emotional terms, by fitting traditional classifiers such as logistic regression to word counts, or, most recently, by employing sophisticated neural networks. These methods progressively improve classification at the cost of increased computation and reduced transparency. A common sentiment analysis task, the classification of IMDb (Internet Movie Database) movie reviews, is used to illustrate the methods on a common task that appears frequently in the literature.


2020 ◽  
Vol 2020 (10) ◽  
pp. 28-1-28-7 ◽  
Author(s):  
Kazuki Endo ◽  
Masayuki Tanaka ◽  
Masatoshi Okutomi

Classification of degraded images is very important in practice because images are usually degraded by compression, noise, blurring, etc. Nevertheless, most of the research in image classification only focuses on clean images without any degradation. Some papers have already proposed deep convolutional neural networks composed of an image restoration network and a classification network to classify degraded images. This paper proposes an alternative approach in which we use a degraded image and an additional degradation parameter for classification. The proposed classification network has two inputs which are the degraded image and the degradation parameter. The estimation network of degradation parameters is also incorporated if degradation parameters of degraded images are unknown. The experimental results showed that the proposed method outperforms a straightforward approach where the classification network is trained with degraded images only.


Sign in / Sign up

Export Citation Format

Share Document