A Framework for Applying Natural Language Processing in Digital Health Interventions

Burkhardt Funk; Shiri Sadeh-Sharvit; Ellen E Fitzsimmons-Craft; Mickey Todd Trockel; Grace E Monterubio; Neha J Goel; Katherine N Balantekin; Dawn M Eichen; Rachael E Flatt; Marie-Laure Firebaugh; Corinna Jacobi; Andrea K Graham; Mark Hoogendoorn; Denise E Wilfley; C Barr Taylor

doi:10.2196/13855

A Framework for Applying Natural Language Processing in Digital Health Interventions

Journal of Medical Internet Research ◽

10.2196/13855 ◽

2020 ◽

Vol 22 (2) ◽

pp. e13855 ◽

Cited By ~ 6

Author(s):

Burkhardt Funk ◽

Shiri Sadeh-Sharvit ◽

Ellen E Fitzsimmons-Craft ◽

Mickey Todd Trockel ◽

Grace E Monterubio ◽

...

Keyword(s):

Body Image ◽

Natural Language Processing ◽

Binge Eating ◽

Language Processing ◽

Digital Health ◽

Health Interventions ◽

Text Data ◽

Text Features ◽

Technical Framework

Background Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making. Objective This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes. Methods We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model. Results We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms. Conclusions The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.

Download Full-text

Natural language processing (NLP) tools in extracting biomedical concepts from research articles: a case study on autism spectrum disorder

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-020-01352-2 ◽

2020 ◽

Vol 20 (S11) ◽

Cited By ~ 1

Author(s):

Jacqueline Peng ◽

Mengge Zhao ◽

James Havrilla ◽

Cong Liu ◽

Chunhua Weng ◽

...

Keyword(s):

Autism Spectrum Disorder ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Clinical Manifestations ◽

Predictive Performance ◽

Autism Spectrum ◽

Spectrum Disorder ◽

Research Articles

Abstract Background Natural language processing (NLP) tools can facilitate the extraction of biomedical concepts from unstructured free texts, such as research articles or clinical notes. The NLP software tools CLAMP, cTAKES, and MetaMap are among the most widely used tools to extract biomedical concept entities. However, their performance in extracting disease-specific terminology from literature has not been compared extensively, especially for complex neuropsychiatric disorders with a diverse set of phenotypic and clinical manifestations. Methods We comparatively evaluated these NLP tools using autism spectrum disorder (ASD) as a case study. We collected 827 ASD-related terms based on previous literature as the benchmark list for performance evaluation. Then, we applied CLAMP, cTAKES, and MetaMap on 544 full-text articles and 20,408 abstracts from PubMed to extract ASD-related terms. We evaluated the predictive performance using precision, recall, and F1 score. Results We found that CLAMP has the best performance in terms of F1 score followed by cTAKES and then MetaMap. Our results show that CLAMP has much higher precision than cTAKES and MetaMap, while cTAKES and MetaMap have higher recall than CLAMP. Conclusion The analysis protocols used in this study can be applied to other neuropsychiatric or neurodevelopmental disorders that lack well-defined terminology sets to describe their phenotypic presentations.

Download Full-text

Natural Language Processing (NLP) and Its Impact across Industries – Unlocking the True Potential of Digital Healthcare (A Case Study Approach)

Journal of Pharmaceutical Research International ◽

10.9734/jpri/2021/v33i35b31906 ◽

2021 ◽

pp. 86-98

Author(s):

Sourajit Roy ◽

Pankaj Pathak ◽

S. Nithya

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Computational Models ◽

M2m Communication ◽

Digital Revolution ◽

Language Complexity ◽

Study Approach ◽

Data Variability

During the advent of the 21st century, technical breakthroughs and developments took place. Natural Language Processing or NLP is one of their promising disciplines that has been increasingly dynamic via groundbreaking findings on most computer networks. Because of the digital revolution the amounts of data generated by M2M communication across devices and platforms such as Amazon Alexa, Apple Siri, Microsoft Cortana, etc. were significantly increased. This causes a great deal of unstructured data to be processed that does not fit in with standard computational models. In addition, the increasing problems of language complexity, data variability and voice ambiguity make implementing models increasingly harder. The current study provides an overview of the potential and breadth of the NLP market and its acceptance in industry-wide, in particular after Covid-19. It also gives a macroscopic picture of progress in natural language processing research, development and implementation.

Download Full-text

How Language Shapes Prejudice Against Women: An Examination Across 45 World Languages

10.31234/osf.io/mrbcf ◽

2020 ◽

Author(s):

David DeFranza ◽

Himanshu Mishra ◽

Arul Mishra

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Ongoing Debate ◽

Text Data ◽

Gender Prejudice ◽

World Languages ◽

The World ◽

Present Context ◽

The Common

Language provides an ever-present context for our cognitions and has the ability to shape them. Languages across the world can be gendered (language in which the form of noun, verb, or pronoun is presented as female or male) versus genderless. In an ongoing debate, one stream of research suggests that gendered languages are more likely to display gender prejudice than genderless languages. However, another stream of research suggests that language does not have the ability to shape gender prejudice. In this research, we contribute to the debate by using a Natural Language Processing (NLP) method which captures the meaning of a word from the context in which it occurs. Using text data from Wikipedia and the Common Crawl project (which contains text from billions of publicly facing websites) across 45 world languages, covering the majority of the world’s population, we test for gender prejudice in gendered and genderless languages. We find that gender prejudice occurs more in gendered rather than genderless languages. Moreover, we examine whether genderedness of language influences the stereotypic dimensions of warmth and competence utilizing the same NLP method.

Download Full-text

Sentiment of App with Word Vectors

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1416.0986s319 ◽

2019 ◽

Vol 8 (6S3) ◽

pp. 2156-2159

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Text Data ◽

Vector Representations ◽

Text Sentiment Analysis

Vector representations for language have been shown to be useful in a number of Natural Language Processing tasks. In this paper, we aim to investigate the effectiveness of word vector representations for the problem of Sentiment Analysis. In particular, we target three sub-tasks namely sentiment words extraction, polarity of sentiment words detection, and text sentiment prediction. We investigate the effectiveness of vector representations over different text data and evaluate the quality of domain-dependent vectors. Vector representations has been used to compute various vector-based features and conduct systematically experiments to demonstrate their effectiveness. Using simple vector based features can achieve better results for text sentiment analysis of APP.

Download Full-text

Attention-based Unsupervised Keyphrase Extraction and Phrase Graph for COVID-19 Medical Literature Retrieval

ACM Transactions on Computing for Healthcare ◽

10.1145/3473939 ◽

2022 ◽

Vol 3 (1) ◽

pp. 1-16

Author(s):

Haoran Ding ◽

Xiao Luo

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Language Processing ◽

Medical Literature ◽

Graph Model ◽

The Self ◽

Keyphrase Extraction ◽

Text Data ◽

Text Collections ◽

Extraction Model

Searching, reading, and finding information from the massive medical text collections are challenging. A typical biomedical search engine is not feasible to navigate each article to find critical information or keyphrases. Moreover, few tools provide a visualization of the relevant phrases to the query. However, there is a need to extract the keyphrases from each document for indexing and efficient search. The transformer-based neural networks—BERT has been used for various natural language processing tasks. The built-in self-attention mechanism can capture the associations between words and phrases in a sentence. This research investigates whether the self-attentions can be utilized to extract keyphrases from a document in an unsupervised manner and identify relevancy between phrases to construct a query relevancy phrase graph to visualize the search corpus phrases on their relevancy and importance. The comparison with six baseline methods shows that the self-attention-based unsupervised keyphrase extraction works well on a medical literature dataset. This unsupervised keyphrase extraction model can also be applied to other text data. The query relevancy graph model is applied to the COVID-19 literature dataset and to demonstrate that the attention-based phrase graph can successfully identify the medical phrases relevant to the query terms.

Download Full-text

Identifying disaster-related tweets and their semantic, spatial and temporal context using deep learning, natural language processing and spatial analysis: a case study of Hurricane Irma

Social Sensing and Big Data Computing for Disaster Management ◽

10.4324/9781003106494-2 ◽

2020 ◽

pp. 8-32

Author(s):

Muhammed Ali Sit ◽

Caglar Koylu ◽

Ibrahim Demir

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Spatial Analysis ◽

Natural Language ◽

Language Processing ◽

Temporal Context ◽

Hurricane Irma

Download Full-text

Semi-Automatic De-identification of Hospital Discharge Summaries with Natural Language Processing: A Case-Study of Performance and Real-World Usability

2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) ◽

10.1109/ithings-greencom-cpscom-smartdata.2017.169 ◽

2017 ◽

Author(s):

Ioan Calapodescu ◽

David Rozier ◽

Svetlana Artemova ◽

Jean-Luc Bosson

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Hospital Discharge ◽

Language Processing ◽

Real World ◽

Discharge Summaries

Download Full-text

Semantics-Based Design Knowledge Annotation and Retrieval

Volume 3: 25th Computers and Information in Engineering Conference, Parts A and B ◽

10.1115/detc2005-85107 ◽

2005 ◽

Cited By ~ 6

Author(s):

Zhanjun Li ◽

Min Liu ◽

David C. Anderson ◽

Karthik Ramani

Keyword(s):

Natural Language Processing ◽

Language Processing ◽

Ontology Engineering ◽

Design Knowledge ◽

Knowledge Based ◽

Design Engineers ◽

First Case ◽

Computer Aided ◽

Design Documents

Nowadays computer aided tools have enabled the creation of the electronic design documents on an unprecedented scale, while determining and finding what can be reused is like searching a “needle in a haystack.” One of the primary reasons for this is that the design knowledge behind the physical design is not properly represented and indexed. With the large amount of designs available, design engineers need to retrieve suitable ones, so that a knowledge-based unified reuse environment can be realized. In this paper, we describe our approach to intelligently annotating and retrieving designs by using ontology engineering and natural language processing. We use the design documents from an engineering design class as the first case study.

Download Full-text

IDENTIFYING BEST PRACTICES FOR USE OF TEXT DATA IN HEALTH ECONOMICS AND OUTCOMES RESEARCH USING NATURAL LANGUAGE PROCESSING

Value in Health ◽

10.1016/j.jval.2016.03.1776 ◽

2016 ◽

Vol 19 (3) ◽

pp. A82

Author(s):

B.A. Feinberg ◽

L. Lal ◽

D.F. Garofalo ◽

U. Mujumdar

Keyword(s):

Natural Language Processing ◽

Health Economics ◽

Best Practices ◽

Natural Language ◽

Language Processing ◽

Outcomes Research ◽

Text Data

Download Full-text

Opinion mining on newspaper headlines using SVM and NLP

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i3.pp2152-2163 ◽

2019 ◽

Vol 9 (3) ◽

pp. 2152 ◽

Cited By ~ 1

Author(s):

Chaudhary Jashubhai Rameshbhai ◽

Joy Paulose

Keyword(s):

Support Vector Machine ◽

Natural Language Processing ◽

Language Processing ◽

Opinion Mining ◽

Confusion Matrix ◽

Support Vector ◽

Text Data ◽

Mining Technique ◽

Svm Model ◽

Linear Svm

<p>Opinion Mining also known as Sentiment Analysis, is a technique or procedure which uses Natural Language processing (NLP) to classify the outcome from text. There are various NLP tools available which are used for processing text data. Multiple research have been done in opinion mining for online blogs, Twitter, Facebook etc. This paper proposes a new opinion mining technique using Support Vector Machine (SVM) and NLP tools on newspaper headlines. Relative words are generated using Stanford CoreNLP, which is passed to SVM using count vectorizer. On comparing three models using confusion matrix, results indicate that Tf-idf and Linear SVM provides better accuracy for smaller dataset. While for larger dataset, SGD and linear SVM model outperform other models.</p>

Download Full-text