scholarly journals Projection Word Embedding Model With Hybrid Sampling Training for Classifying ICD-10-CM Codes: Longitudinal Observational Study

10.2196/14499 ◽  
2019 ◽  
Vol 7 (3) ◽  
pp. e14499 ◽  
Author(s):  
Chin Lin ◽  
Yu-Sheng Lou ◽  
Dung-Jang Tsai ◽  
Chia-Cheng Lee ◽  
Chia-Jung Hsu ◽  
...  
2019 ◽  
Author(s):  
Chin Lin ◽  
Yu-Sheng Lou ◽  
Dung-Jang Tsai ◽  
Chia-Cheng Lee ◽  
Chia-Jung Hsu ◽  
...  

BACKGROUND Most current state-of-the-art models for searching the International Classification of Diseases, Tenth Revision Clinical Modification (ICD-10-CM) codes use word embedding technology to capture useful semantic properties. However, they are limited by the quality of initial word embeddings. Word embedding trained by electronic health records (EHRs) is considered the best, but the vocabulary diversity is limited by previous medical records. Thus, we require a word embedding model that maintains the vocabulary diversity of open internet databases and the medical terminology understanding of EHRs. Moreover, we need to consider the particularity of the disease classification, wherein discharge notes present only positive disease descriptions. OBJECTIVE We aimed to propose a projection word2vec model and a hybrid sampling method. In addition, we aimed to conduct a series of experiments to validate the effectiveness of these methods. METHODS We compared the projection word2vec model and traditional word2vec model using two corpora sources: English Wikipedia and PubMed journal abstracts. We used seven published datasets to measure the medical semantic understanding of the word2vec models and used these embeddings to identify the three–character-level ICD-10-CM diagnostic codes in a set of discharge notes. On the basis of embedding technology improvement, we also tried to apply the hybrid sampling method to improve accuracy. The 94,483 labeled discharge notes from the Tri-Service General Hospital of Taipei, Taiwan, from June 1, 2015, to June 30, 2017, were used. To evaluate the model performance, 24,762 discharge notes from July 1, 2017, to December 31, 2017, from the same hospital were used. Moreover, 74,324 additional discharge notes collected from seven other hospitals were tested. The F-measure, which is the major global measure of effectiveness, was adopted. RESULTS In medical semantic understanding, the original EHR embeddings and PubMed embeddings exhibited superior performance to the original Wikipedia embeddings. After projection training technology was applied, the projection Wikipedia embeddings exhibited an obvious improvement but did not reach the level of original EHR embeddings or PubMed embeddings. In the subsequent ICD-10-CM coding experiment, the model that used both projection PubMed and Wikipedia embeddings had the highest testing mean F-measure (0.7362 and 0.6693 in Tri-Service General Hospital and the seven other hospitals, respectively). Moreover, the hybrid sampling method was found to improve the model performance (F-measure=0.7371/0.6698). CONCLUSIONS The word embeddings trained using EHR and PubMed could understand medical semantics better, and the proposed projection word2vec model improved the ability of medical semantics extraction in Wikipedia embeddings. Although the improvement from the projection word2vec model in the real ICD-10-CM coding task was not substantial, the models could effectively handle emerging diseases. The proposed hybrid sampling method enables the model to behave like a human expert.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Ingmar Schäfer ◽  
Heike Hansen ◽  
Agata Menzel ◽  
Marion Eisele ◽  
Daniel Tajdar ◽  
...  

Abstract Objectives The aims of our study were to describe the effect of the COVID-19 pandemic and lockdown on primary care in Germany regarding the number of consultations, the prevalence of specific reasons for consultation presented by the patients, and the frequency of specific services performed by the GP. Methods We conducted a longitudinal observational study based on standardised GP interviews in a quota sampling design comparing the time before the COVID-19 pandemic (12 June 2015 to 27 April 2017) with the time during lockdown (21 April to 14 July 2020). The sample included GPs in urban and rural areas 120 km around Hamburg, Germany, and was stratified by region type and administrative districts. Differences in the consultation numbers were analysed by multivariate linear regressions in mixed models adjusted for random effects on the levels of the administrative districts and GP practices. Results One hundred ten GPs participated in the follow-up, corresponding to 52.1% of the baseline. Primary care practices in 32 of the 37 selected administrative districts (86.5%) could be represented in both assessments. At baseline, GPs reported 199.6 ± 96.9 consultations per week, which was significantly reduced during COVID-19 lockdown by 49.0% to 101.8 ± 67.6 consultations per week (p < 0.001). During lockdown, the frequency of five reasons for consultation (-43.0% to -31.5%) and eleven services (-56.6% to -33.5%) had significantly decreased. The multilevel, multivariable analyses showed an average reduction of 94.6 consultations per week (p < 0.001). Conclusions We observed a dramatic reduction of the number of consultations in primary care. This effect was independent of age, sex and specialty of the GP and independent of the practice location in urban or rural areas. Consultations for complaints like low back pain, gastrointestinal complaints, vertigo or fatigue and services like house calls/calls at nursing homes, wound treatments, pain therapy or screening examinations for the early detection of chronic diseases were particularly affected.


2002 ◽  
Vol 96 (8) ◽  
pp. 576-584 ◽  
Author(s):  
Tana D'Allura

This longitudinal, observational study of 13 children in a preschool for children with visual impairments examined the effects of reverse mainstreaming, in combination with the cooperative learning strategy, on the social interaction patterns of preschoolers with and without visual impairments. It found that the type of environment provided and the learning strategies used affect both whether and how children relate to their environment.


2014 ◽  
Vol 27 ◽  
pp. 45-50 ◽  
Author(s):  
Tha Han ◽  
Myriam Alexander ◽  
Aphrodite Niggebrugge ◽  
Gareth J. Hollands ◽  
Theresa M. Marteau

2021 ◽  
Vol 254 ◽  
pp. 108999
Author(s):  
Paolo Capozza ◽  
Gianvito Lanave ◽  
Georgia Diakoudi ◽  
Fabio Stasi ◽  
Paola Ghergo ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document