Early Detection of Adverse Drug Reactions in Social Health Networks: A Natural Language Processing Pipeline for Signal Detection (Preprint)

Mapping Intimacies ◽

10.2196/preprints.11264 ◽

2018 ◽

Author(s):

Azadeh Nikfarjam ◽

Julia D Ransohoff ◽

Alison Callahan ◽

Erik Jones ◽

Brian Loew ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Adverse Drug Reactions ◽

Language Processing ◽

Signal Generation ◽

Social Health ◽

Drug Reactions ◽

Health Networks ◽

Health Concerns ◽

Patient Reports

BACKGROUND Adverse drug reactions (ADRs) occur in nearly all patients on chemotherapy, causing morbidity and therapy disruptions. Detection of such ADRs is limited in clinical trials, which are underpowered to detect rare events. Early recognition of ADRs in the postmarketing phase could substantially reduce morbidity and decrease societal costs. Internet community health forums provide a mechanism for individuals to discuss real-time health concerns and can enable computational detection of ADRs. OBJECTIVE The goal of this study is to identify cutaneous ADR signals in social health networks and compare the frequency and timing of these ADRs to clinical reports in the literature. METHODS We present a natural language processing-based, ADR signal-generation pipeline based on patient posts on Internet social health networks. We identified user posts from the Inspire health forums related to two chemotherapy classes: erlotinib, an epidermal growth factor receptor inhibitor, and nivolumab and pembrolizumab, immune checkpoint inhibitors. We extracted mentions of ADRs from unstructured content of patient posts. We then performed population-level association analyses and time-to-detection analyses. RESULTS Our system detected cutaneous ADRs from patient reports with high precision (0.90) and at frequencies comparable to those documented in the literature but an average of 7 months ahead of their literature reporting. Known ADRs were associated with higher proportional reporting ratios compared to negative controls, demonstrating the robustness of our analyses. Our named entity recognition system achieved a 0.738 microaveraged F-measure in detecting ADR entities, not limited to cutaneous ADRs, in health forum posts. Additionally, we discovered the novel ADR of hypohidrosis reported by 23 patients in erlotinib-related posts; this ADR was absent from 15 years of literature on this medication and we recently reported the finding in a clinical oncology journal. CONCLUSIONS Several hundred million patients report health concerns in social health networks, yet this information is markedly underutilized for pharmacosurveillance. We demonstrated the ability of a natural language processing-based signal-generation pipeline to accurately detect patient reports of ADRs months in advance of literature reporting and the robustness of statistical analyses to validate system detections. Our findings suggest the important contributions that social health network data can play in contributing to more comprehensive and timely pharmacovigilance.

Download Full-text

Early Detection of Adverse Drug Reactions in Social Health Networks: A Natural Language Processing Pipeline for Signal Detection

JMIR Public Health and Surveillance ◽

10.2196/11264 ◽

2019 ◽

Vol 5 (2) ◽

pp. e11264 ◽

Cited By ~ 10

Author(s):

Azadeh Nikfarjam ◽

Julia D Ransohoff ◽

Alison Callahan ◽

Erik Jones ◽

Brian Loew ◽

...

Keyword(s):

Natural Language Processing ◽

Early Detection ◽

Natural Language ◽

Signal Detection ◽

Adverse Drug Reactions ◽

Language Processing ◽

Social Health ◽

Drug Reactions ◽

Health Networks ◽

Processing Pipeline

Download Full-text

Automatic Extraction of Adverse Drug Reactions from Summary of Product Characteristics

Applied Sciences ◽

10.3390/app11062663 ◽

2021 ◽

Vol 11 (6) ◽

pp. 2663

Author(s):

Zhengru Shen ◽

Marco Spruit

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Adverse Drug Reactions ◽

Language Processing ◽

European Medicines Agency ◽

Drug Reactions ◽

Clinical Practices ◽

Product Characteristics ◽

Textual Information ◽

Summary Of Product Characteristics

The summary of product characteristics from the European Medicines Agency is a reference document on medicines in the EU. It contains textual information for clinical experts on how to safely use medicines, including adverse drug reactions. Using natural language processing (NLP) techniques to automatically extract adverse drug reactions from such unstructured textual information helps clinical experts to effectively and efficiently use them in daily practices. Such techniques have been developed for Structured Product Labels from the Food and Drug Administration (FDA), but there is no research focusing on extracting from the Summary of Product Characteristics. In this work, we built a natural language processing pipeline that automatically scrapes the summary of product characteristics online and then extracts adverse drug reactions from them. Besides, we have made the method and its output publicly available so that it can be reused and further evaluated in clinical practices. In total, we extracted 32,797 common adverse drug reactions for 647 common medicines scraped from the Electronic Medicines Compendium. A manual review of 37 commonly used medicines has indicated a good performance, with a recall and precision of 0.99 and 0.934, respectively.

Download Full-text

Developing A Deep Learning Natural Language Processing Algorithm For Automated Reporting Of Adverse Drug Reactions

10.1101/2021.12.11.21267504 ◽

2021 ◽

Author(s):

Christopher McMaster ◽

Julia Chan ◽

David FL Liew ◽

Elizabeth Su ◽

Albert G Frauman ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Adverse Events ◽

Natural Language ◽

Adverse Drug Reactions ◽

Language Processing ◽

Processing Algorithm ◽

Drug Reactions ◽

Discharge Summaries ◽

Natural Language Processing Algorithm

The detection of adverse drug reactions (ADRs) is critical to our understanding of the safety and risk-benefit profile of medications. With an incidence that has not changed over the last 30 years, ADRs are a significant source of patient morbidity, responsible for 5-10% of acute care hospital admissions worldwide. Spontaneous reporting of ADRs has long been the standard method of reporting, however this approach is known to have high rates of under-reporting, a problem that limits pharmacovigilance efforts. Automated ADR reporting presents an alternative pathway to increase reporting rates, although this may be limited by over-reporting of other drug-related adverse events. We developed a deep learning natural language processing algorithm to identify ADRs in discharge summaries at a single academic hospital centre. Our model was developed in two stages: first, a pre-trained model (DeBERTa) was further pre-trained on 150,000 unlabelled discharge summaries; secondly, this model was fine-tuned to detect ADR mentions in a corpus of 861 annotated discharge summaries. To ensure that our algorithm could differentiate ADRs from other drug-related adverse events, the annotated corpus was enriched for both validated ADR reports and confounding drug-related adverse events using. The final model demonstrated good performance with a ROC-AUC of 0.934 (95% CI 0.931 - 0.955) for the task of identifying discharge summaries containing ADR mentions.

Download Full-text

A vocabulary development and visualization tool based on natural language processing and the mining of textual patient reports

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2003.08.005 ◽

2003 ◽

Vol 36 (3) ◽

pp. 189-201 ◽

Cited By ~ 9

Author(s):

Carol Friedman ◽

Hongfang Liu ◽

Lyudmila Shagina

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Vocabulary Development ◽

Visualization Tool ◽

Patient Reports

Download Full-text

Representing Information in Patient Reports Using Natural Language Processing and the Extensible Markup Language

Journal of the American Medical Informatics Association ◽

10.1136/jamia.1999.0060076 ◽

1999 ◽

Vol 6 (1) ◽

pp. 76-87 ◽

Cited By ~ 77

Author(s):

C. Friedman ◽

G. Hripcsak ◽

L. Shagina ◽

H. Liu

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Extensible Markup Language ◽

Markup Language ◽

Patient Reports ◽

Extensible Markup

Download Full-text

Natural Language Processing Combined with ICD-9-CM Codes as a Novel Method to Study the Epidemiology of Allergic Drug Reactions

The Journal of Allergy and Clinical Immunology In Practice ◽

10.1016/j.jaip.2019.12.007 ◽

2020 ◽

Vol 8 (3) ◽

pp. 1032-1038.e1 ◽

Cited By ~ 3

Author(s):

Aleena Banerji ◽

Kenneth H. Lai ◽

Yu Li ◽

Rebecca R. Saff ◽

Carlos A. Camargo ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Drug Reactions ◽

Novel Method

Download Full-text

Natural Language Processing and Enhanced Clinical Decision Making Radiology and VINCI

PsycEXTRA Dataset ◽

10.1037/e615572012-015 ◽

2012 ◽

Author(s):

Eliot Siegel

Keyword(s):

Decision Making ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Clinical Decision Making ◽

Clinical Decision

Download Full-text

Natural Language Processing in the Clinical Setting

PsycEXTRA Dataset ◽

10.1037/e615572012-013 ◽

2012 ◽

Author(s):

Thomas H. Payne

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Clinical Setting

Download Full-text

A Review and evaluation of Machine Translation methods for Lumasaaba

Journal of Digital Science ◽

10.33847/2686-8296.2.1_1 ◽

2020 ◽

pp. 3-17

Author(s):

Peter Nabende

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Machine Translation ◽

Language Processing ◽

Research Area ◽

Data Driven ◽

East African ◽

Data Set ◽

African Languages ◽

Translation Methods

Natural Language Processing for under-resourced languages is now a mainstream research area. However, there are limited studies on Natural Language Processing applications for many indigenous East African languages. As a contribution to covering the current gap of knowledge, this paper focuses on evaluating the application of well-established machine translation methods for one heavily under-resourced indigenous East African language called Lumasaaba. Specifically, we review the most common machine translation methods in the context of Lumasaaba including both rule-based and data-driven methods. Then we apply a state of the art data-driven machine translation method to learn models for automating translation between Lumasaaba and English using a very limited data set of parallel sentences. Automatic evaluation results show that a transformer-based Neural Machine Translation model architecture leads to consistently better BLEU scores than the recurrent neural network-based models. Moreover, the automatically generated translations can be comprehended to a reasonable extent and are usually associated with the source language input.

Download Full-text