Using text mining techniques to identify healthcare providers at risk: an exploratory study (Preprint)

Mapping Intimacies ◽

10.2196/preprints.19064 ◽

2020 ◽

Author(s):

Iris Hendrickx ◽

Tim Voets ◽

Pieter van Dyk ◽

Rudolph B Kool

Keyword(s):

Machine Learning ◽

At Risk ◽

Text Mining ◽

Sentiment Analysis ◽

Exploratory Study ◽

Healthcare Providers ◽

Supervised Machine Learning ◽

Care Providers ◽

Severity Prediction ◽

Dutch Health

BACKGROUND Regulatory bodies such as healthcare inspectorates can identify risks of healthcare providers by analyzing patient complaints. Text mining techniques (automatic text analysis based on machine learning), might help by identifying specific patterns and signals for risks on quality and safety issues. OBJECTIVE The aim of this study was to explore whether text mining techniques might be used to identify healthcare providers at risk. METHODS We performed an exploratory study on a complaints database of the Dutch Health and Youth Care Inspectorate with more than 22000 written complaints. We studied a range of supervised machine learning techniques to automatically determine the severity of incoming complaints. We investigated several features based on the complaints’ content, including sentiment analysis, to decide which were helpful for severity prediction. Finally, we took the list of health care providers and their organization-specific complaints to determine the average severity of complaints per organization. We performed a keyword analysis in order to give the Inspectorate insight in the patterns and severity per organization. RESULTS The data preparation and preprocessing were time-consuming one-off costs, mainly because we had to create a safe and efficient digital research environment. A straightforward text classification approach using a bag-of-words feature representation worked best for severity prediction. The usage of sentiment analysis for severity prediction was not helpful. Finally, we produced a list of n-grams of healthcare providers with the most complaints to inform the Inspectorate about the specific combination of words for these organizations. CONCLUSIONS Text mining techniques can support inspectorates with fully automatic analysis of complaints. They can give insights in patterns, detect possible blind spots, or support prioritizing follow-up supervision activities by sorting complaints on severity per organization or per sector. An appropriate data science and ICT infrastructure is crucial and indispensable for applied text mining.

Download Full-text

Text Mining Based Approach to Customer Sentiment Analysis Using Machine Learning

Journal of Advances and Scholarly Researches in Allied Education ◽

10.29070/15/57680 ◽

2018 ◽

Vol 15 (6) ◽

pp. 58-65

Author(s):

Gurjeet Kaur

Keyword(s):

Machine Learning ◽

Text Mining ◽

Sentiment Analysis

Download Full-text

Financial Context News Sentiment Analysis for the Lithuanian Language

Applied Sciences ◽

10.3390/app11104443 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4443

Author(s):

Rokas Štrimaitis ◽

Pavel Stefanovič ◽

Simona Ramanauskaitė ◽

Asta Slotkienė

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Experimental Investigations ◽

Support Vector ◽

Applied Machine Learning ◽

Bayes Algorithm ◽

Website Content

Financial area analysis is not limited to enterprise performance analysis. It is worth analyzing as wide an area as possible to obtain the full impression of a specific enterprise. News website content is a datum source that expresses the public’s opinion on enterprise operations, status, etc. Therefore, it is worth analyzing the news portal article text. Sentiment analysis in English texts and financial area texts exist, and are accurate, the complexity of Lithuanian language is mostly concentrated on sentiment analysis of comment texts, and does not provide high accuracy. Therefore in this paper, the supervised machine learning model was implemented to assign sentiment analysis on financial context news, gathered from Lithuanian language websites. The analysis was made using three commonly used classification algorithms in the field of sentiment analysis. The hyperparameters optimization using the grid search was performed to discover the best parameters of each classifier. All experimental investigations were made using the newly collected datasets from four Lithuanian news websites. The results of the applied machine learning algorithms show that the highest accuracy is obtained using a non-balanced dataset, via the multinomial Naive Bayes algorithm (71.1%). The other algorithm accuracies were slightly lower: a long short-term memory (71%), and a support vector machine (70.4%).

Download Full-text

Multilingual Sentiment Analysis on Short Text Document Using Semi-Supervised Machine Learning

10.1145/3485768.3485775 ◽

2021 ◽

Author(s):

Joshua Lois Cruz Paulino ◽

Lexter Carl Antoja Almirol ◽

Jun Marco Cruz Favila ◽

Kent Alvin Gerald Loria Aquino ◽

Angelica Hernandez De La Cruz ◽

...

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Machine Learning ◽

Short Text ◽

Text Document

Download Full-text

An ensemble approach to stabilize the features for multi-domain sentiment analysis using supervised machine learning

Journal Of Big Data ◽

10.1186/s40537-018-0152-5 ◽

2018 ◽

Vol 5 (1) ◽

Cited By ~ 4

Author(s):

Monalisa Ghosh ◽

Goutam Sanyal

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Machine Learning ◽

Ensemble Approach

Download Full-text

Sentiment Analysis using various Machine Learning and Deep Learning Techniques

Journal of the Nigerian Society of Physical Sciences ◽

10.46481/jnsps.2021.308 ◽

2021 ◽

pp. 385-394

Author(s):

V Umarani ◽

A Julian ◽

J Deepa

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Process ◽

Learning Techniques

Sentiment analysis has gained a lot of attention from researchers in the last year because it has been widely applied to a variety of application domains such as business, government, education, sports, tourism, biomedicine, and telecommunication services. Sentiment analysis is an automated computational method for studying or evaluating sentiments, feelings, and emotions expressed as comments, feedbacks, or critiques. The sentiment analysis process can be automated using machine learning techniques, which analyses text patterns faster. The supervised machine learning technique is the most used mechanism for sentiment analysis. The proposed work discusses the flow of sentiment analysis process and investigates the common supervised machine learning techniques such as multinomial naive bayes, Bernoulli naive bayes, logistic regression, support vector machine, random forest, K-nearest neighbor, decision tree, and deep learning techniques such as Long Short-Term Memory and Convolution Neural Network. The work examines such learning methods using standard data set and the experimental results of sentiment analysis demonstrate the performance of various classifiers taken in terms of the precision, recall, F1-score, RoC-Curve, accuracy, running time and k fold cross validation and helps in appreciating the novelty of the several deep learning techniques and also giving the user an overview of choosing the right technique for their application.

Download Full-text

Predicting Obstetric Disease With Machine Learning Applied to Patient-Reported Data (Preprint)

10.2196/preprints.11766 ◽

2018 ◽

Cited By ~ 1

Author(s):

Danielle Bradley ◽

Erin Landau ◽

Adam Wolfberg ◽

Alex Baron

Keyword(s):

Machine Learning ◽

At Risk ◽

Mobile Apps ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Obstetric Outcomes ◽

Patient Reported ◽

Data Points ◽

Reported Data

BACKGROUND The rise of highly engaging digital health mobile apps over the past few years has created repositories containing billions of patient-reported data points that have the potential to inform clinical research and advance medicine. OBJECTIVE To determine if self-reported data could be leveraged to create machine learning algorithms to predict the presence of, or risk for, obstetric outcomes and related conditions. METHODS More than 10 million women have downloaded Ovia Health’s three mobile apps (Ovia Fertility, Ovia Pregnancy, and Ovia Parenting). Data points logged by app users can include information about menstrual cycle, health history, current health status, nutrition habits, exercise activity, symptoms, or moods. Machine learning algorithms were developed using supervised machine learning methodologies, specifically, Gradient Boosting Decision Tree algorithms. Each algorithm was developed and trained using anywhere from 385 to 5770 features and data from 77,621 to 121,740 app users. RESULTS Algorithms were created to detect the risk of developing preeclampsia, gestational diabetes, and preterm delivery, as well as to identify the presence of existing preeclampsia. The positive predictive value (PPV) was set to 0.75 for all of the models, as this was the threshold where the researchers felt a clinical response—additional screening or testing—would be reasonable, due to the likelihood of a positive outcome. Sensitivity ranged from 24% to 75% across all models. When PPV was adjusted from 0.75 to 0.52, the sensitivity of the preeclampsia prediction algorithm rose from 24% to 85%. When PPV was adjusted from 0.75 to 0.65, the sensitivity of the preeclampsia detection or diagnostic algorithm increased from 37% to 79%. CONCLUSIONS Algorithms based on patient-reported data can predict serious obstetric conditions with accuracy levels sufficient to guide clinical screening by health care providers and health plans. Further research is needed to determine whether such an approach can improve outcomes for at-risk patients and reduce the cost of screening those not at risk. Presenting the results of these models to patients themselves could also provide important insight into otherwise unknown health risks.

Download Full-text

Bangla Text Sentiment Analysis Using Supervised Machine Learning with Extended Lexicon Dictionary

Natural Language Processing Research ◽

10.2991/nlpr.d.210316.001 ◽

2021 ◽

Vol 1 (3-4) ◽

pp. 34

Author(s):

Nitish Ranjan Bhowmik ◽

Mohammad Arifuzzaman ◽

M. Rubaiyat Hossain Mondal ◽

M. S. Islam

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Machine Learning ◽

Text Sentiment Analysis

Download Full-text

Evaluating Annotated Dataset of Customer Reviews for Aspect Based Sentiment Analysis

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2122 ◽

2021 ◽

Author(s):

Dimple Chehal ◽

Parul Gupta ◽

Payal Gulati

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Supervised Machine Learning ◽

Support Vector ◽

Product Reviews ◽

K Nearest Neighbor ◽

Customer Reviews ◽

Percent Accuracy

Sentiment analysis of product reviews on e-commerce platforms aids in determining the preferences of customers. Aspect-based sentiment analysis (ABSA) assists in identifying the contributing aspects and their corresponding polarity, thereby allowing for a more detailed analysis of the customer’s inclination toward product aspects. This analysis helps in the transition from the traditional rating-based recommendation process to an improved aspect-based process. To automate ABSA, a labelled dataset is required to train a supervised machine learning model. As the availability of such dataset is limited due to the involvement of human efforts, an annotated dataset has been provided here for performing ABSA on customer reviews of mobile phones. The dataset comprising of product reviews of Apple-iPhone11 has been manually annotated with predefined aspect categories and aspect sentiments. The dataset’s accuracy has been validated using state-of-the-art machine learning techniques such as Naïve Bayes, Support Vector Machine, Logistic Regression, Random Forest, K-Nearest Neighbor and Multi Layer Perceptron, a sequential model built with Keras API. The MLP model built through Keras Sequential API for classifying review text into aspect categories produced the most accurate result with 67.45 percent accuracy. K- nearest neighbor performed the worst with only 49.92 percent accuracy. The Support Vector Machine had the highest accuracy for classifying review text into aspect sentiments with an accuracy of 79.46 percent. The model built with Keras API had the lowest 76.30 percent accuracy. The contribution is beneficial as a benchmark dataset for ABSA of mobile phone reviews.

Download Full-text

Sentiment Analysis on UAV-aided Product Comments Based on Machine Learning: From Sentence to Document Level

10.21203/rs.3.rs-104009/v1 ◽

2020 ◽

Author(s):

JINGYANG CAO ◽

Shirong Yin ◽

Guoxu Zhang

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Accurate Result ◽

Supervised Machine Learning ◽

Hotel Management ◽

Novel Approach ◽

Online Comments ◽

New Perspective ◽

The Relationship ◽

Document Level

Abstract This paper presents a novel approach to analyze the sentiment of the product comments from sentence to document level and apply to the customers sentiment analysis on UAV-aided product comments for hotel management. In order to realize the effiffifficient sentiment analysis, a cascaded sentence-to-document sentiment classifification method is investigated. Initially, a supervised machine learning method is applied to explore the sentiment polarity of the sentence (SPS). Afterward, the contribution of the sentence to document (CSD) is calculated by using various statistical algorithms. Lastly, the sentiment polarity of the document (SPD) is determined by the SPS as well as its contribution. Comparative experiments have been established on the basis of hotel online comments, and the outcomes indicate that the proposed method not only raises the effiffifficiency in attaining a more accurate result but also assists immensely in regards to the B5G wireless communication supported by the UAV. The fifindings provide a new perspective that sentence position and its sentiment similarity with document (sentiment condition) dramatically disclose the relationship between sentence and document.

Download Full-text

Comparative Analysis of Various Supervised Machine Learning Techniques Used for Sentiment Analysis on Tourism Reviews

Proceedings of International Conference on Recent Trends in Computing - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-7118-0_3 ◽

2022 ◽

pp. 19-49

Author(s):

Manoj Kumar Sahu ◽

Smita Selot

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Sentiment Analysis ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text