A review of auditing techniques for the Unified Medical Language System

Ling Zheng; Zhe He; Duo Wei; Vipina Keloth; Jung-Wei Fan; Luke Lindemann; Xinxin Zhu; James J Cimino; Yehoshua Perl

doi:10.1093/jamia/ocaa108

A review of auditing techniques for the Unified Medical Language System

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocaa108 ◽

2020 ◽

Vol 27 (10) ◽

pp. 1625-1638 ◽

Cited By ~ 4

Author(s):

Ling Zheng ◽

Zhe He ◽

Duo Wei ◽

Vipina Keloth ◽

Jung-Wei Fan ◽

...

Keyword(s):

Error Detection ◽

Easy Access ◽

Ontology Alignment ◽

Inclusion Criteria ◽

Semantic Type ◽

Language System ◽

Unified Medical Language System ◽

Level Of Automation ◽

Medical Language ◽

Meta Analyses

Abstract Objective The study sought to describe the literature related to the development of methods for auditing the Unified Medical Language System (UMLS), with particular attention to identifying errors and inconsistencies of attributes of the concepts in the UMLS Metathesaurus. Materials and Methods We applied the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) approach by searching the MEDLINE database and Google Scholar for studies referencing the UMLS and any of several terms related to auditing, error detection, and quality assurance. A qualitative analysis and summarization of articles that met inclusion criteria were performed. Results Eighty-three studies were reviewed in detail. We first categorized techniques based on various aspects including concepts, concept names, and synonymy (n = 37), semantic type assignments (n = 36), hierarchical relationships (n = 24), lateral relationships (n = 12), ontology enrichment (n = 8), and ontology alignment (n = 18). We also categorized the methods according to their level of automation (ie, automated systematic, automated heuristic, or manual) and the type of knowledge used (ie, intrinsic or extrinsic knowledge). Conclusions This study is a comprehensive review of the published methods for auditing the various conceptual aspects of the UMLS. Categorizing the auditing techniques according to the various aspects will enable the curators of the UMLS as well as researchers comprehensive easy access to this wealth of knowledge (eg, for auditing lateral relationships in the UMLS). We also reviewed ontology enrichment and alignment techniques due to their critical use of and impact on the UMLS.

Download Full-text

An Interoperable UMLS Terminology Service Using FHIR

Future Internet ◽

10.3390/fi12110199 ◽

2020 ◽

Vol 12 (11) ◽

pp. 199

Author(s):

Rishi Saripalle ◽

Mehdi Sookhak ◽

Mahboobeh Haghparast

Keyword(s):

Knowledge Structure ◽

Technical Skills ◽

Easy Access ◽

Light Weight ◽

Language System ◽

Unified Medical Language System ◽

Internal Working ◽

Medical Language ◽

Medical Vocabulary ◽

Complex Knowledge

The Unified Medical Language System (UMLS) is an internationally recognized medical vocabulary that enables semantic interoperability across various biomedical terminologies. To use its knowledge, the users must understand its complex knowledge structure, a structure that is not interoperable or is not compliant with any known biomedical and healthcare standard. Further, the users also need to have good technical skills to understand its inner working and interact with UMLS in general. These barriers might cause UMLS usage concerns among inter-disciplinary users in biomedical and healthcare informatics. Currently, there exists no terminology service that normalizes UMLS’s complex knowledge structure to a widely accepted interoperable healthcare standard and allows easy access to its knowledge, thus hiding its workings. The objective of this research is to design and implement a light-weight terminology service that allows easy access to UMLS knowledge structured using the fast health interoperability resources (FHIR) standard, a widely accepted interoperability healthcare standard. The developed terminology service, named UMLS FHIR, leverages FHIR resources and features, and can easily be integrated into any application to consume UMLS knowledge in the FHIR format without the need to understand UMLS’s native knowledge structure and its internal working.

Download Full-text

Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)–based ranking for concept normalization

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocaa080 ◽

2020 ◽

Vol 27 (10) ◽

pp. 1510-1519

Author(s):

Dongfang Xu ◽

Manoj Gopale ◽

Jiacheng Zhang ◽

Kris Brown ◽

Edmon Begoli ◽

...

Keyword(s):

Neural Network ◽

Relation Extraction ◽

Training Data ◽

Shared Task ◽

Semantic Type ◽

Language System ◽

Unified Medical Language System ◽

Medical Language ◽

Rank System ◽

Semantic Types

Abstract Objective Concept normalization, the task of linking phrases in text to concepts in an ontology, is useful for many downstream tasks including relation extraction, information retrieval, etc. We present a generate-and-rank concept normalization system based on our participation in the 2019 National NLP Clinical Challenges Shared Task Track 3 Concept Normalization. Materials and Methods The shared task provided 13 609 concept mentions drawn from 100 discharge summaries. We first design a sieve-based system that uses Lucene indices over the training data, Unified Medical Language System (UMLS) preferred terms, and UMLS synonyms to generate a list of possible concepts for each mention. We then design a listwise classifier based on the BERT (Bidirectional Encoder Representations from Transformers) neural network to rank the candidate concepts, integrating UMLS semantic types through a regularizer. Results Our generate-and-rank system was third of 33 in the competition, outperforming the candidate generator alone (81.66% vs 79.44%) and the previous state of the art (76.35%). During postevaluation, the model’s accuracy was increased to 83.56% via improvements to how training data are generated from UMLS and incorporation of our UMLS semantic type regularizer. Discussion Analysis of the model shows that prioritizing UMLS preferred terms yields better performance, that the UMLS semantic type regularizer results in qualitatively better concept predictions, and that the model performs well even on concepts not seen during training. Conclusions Our generate-and-rank framework for UMLS concept normalization integrates key UMLS features like preferred terms and semantic types with a neural network–based ranking model to accurately link phrases in text to UMLS concepts.

Download Full-text

Unified Medical Language System

10.32388/urur42 ◽

2020 ◽

Cited By ~ 2

Author(s):

Keyword(s):

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

Using Semantic and Structural Properties of the Unified Medical Language System to Discover Potential Terminological Relationships

Journal of the American Medical Informatics Association ◽

10.1197/jamia.m2931 ◽

2009 ◽

Vol 16 (3) ◽

pp. 346-353 ◽

Cited By ~ 10

Author(s):

C. O. Patel ◽

J. J. Cimino

Keyword(s):

Structural Properties ◽

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

Auditing the Unified Medical Language System with Semantic Methods

Journal of the American Medical Informatics Association ◽

10.1136/jamia.1998.0050041 ◽

1998 ◽

Vol 5 (1) ◽

pp. 41-51 ◽

Cited By ~ 48

Author(s):

J. J. Cimino

Keyword(s):

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

The outline of Unified Medical Language System(UMLS) Knowledge Sources.

Journal of Information Processing and Management ◽

10.1241/johokanri.41.15 ◽

1998 ◽

Vol 41 (1) ◽

pp. 15-23

Author(s):

Koreni KAWANO

Keyword(s):

Knowledge Sources ◽

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

Unified Medical Language System

Electronic Health Record ◽

10.1002/9781118479612.ch16 ◽

2012 ◽

pp. 145-152 ◽

Cited By ~ 1

Keyword(s):

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

IAIMS and UMLS at Columbia-Presbyterian Medical Center

Medical Decision Making ◽

10.1177/0272989x9101104s17 ◽

1991 ◽

Vol 11 (4_suppl) ◽

pp. S89-S93 ◽

Cited By ~ 4

Author(s):

James J. Cimino ◽

Soumitra Sengupta

Keyword(s):

Information Management ◽

Management System ◽

Medical Center ◽

Information Management System ◽

Language System ◽

Unified Medical Language System ◽

Medical Language ◽

Academic Information

The authors use an example to illustrate combining Integrated Academic Information Management System (IAIMS) components (applications) into an integral whole, to facilitate using the components simultaneously or in sequence. They examine a model for classifying IAIMS systems, proposing ways in which the Unified Medical Language System (UMLS) can be exploited in them.

Download Full-text

Mining Biomedical Data Using MetaMap Transfer (MMTx) and the Unified Medical Language System (UMLS)

Gene Function Analysis - Methods in Molecular Biology™ ◽

10.1007/978-1-59745-547-3_9 ◽

2007 ◽

pp. 153-169 ◽

Cited By ~ 15

Author(s):

John D. Osborne ◽

Simon Lin ◽

Lihua Julie Zhu ◽

Warren A. Kibbe

Keyword(s):

Biomedical Data ◽

Language System ◽

Unified Medical Language System ◽

Medical Language

Download Full-text

Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocaa136 ◽

2020 ◽

Vol 27 (10) ◽

pp. 1538-1546 ◽

Cited By ~ 1

Author(s):

Yuqing Mao ◽

Kin Wah Fung

Keyword(s):

Word Sense Disambiguation ◽

Graph Embedding ◽

Semantic Relatedness ◽

Word Sense ◽

Medical Subject Headings ◽

Network Graph ◽

Convolutional Network ◽

Language System ◽

Unified Medical Language System ◽

Medical Language

Abstract Objective The study sought to explore the use of deep learning techniques to measure the semantic relatedness between Unified Medical Language System (UMLS) concepts. Materials and Methods Concept sentence embeddings were generated for UMLS concepts by applying the word embedding models BioWordVec and various flavors of BERT to concept sentences formed by concatenating UMLS terms. Graph embeddings were generated by the graph convolutional networks and 4 knowledge graph embedding models, using graphs built from UMLS hierarchical relations. Semantic relatedness was measured by the cosine between the concepts’ embedding vectors. Performance was compared with 2 traditional path-based (shortest path and Leacock-Chodorow) measurements and the publicly available concept embeddings, cui2vec, generated from large biomedical corpora. The concept sentence embeddings were also evaluated on a word sense disambiguation (WSD) task. Reference standards used included the semantic relatedness and semantic similarity datasets from the University of Minnesota, concept pairs generated from the Standardized MedDRA Queries and the MeSH (Medical Subject Headings) WSD corpus. Results Sentence embeddings generated by BioWordVec outperformed all other methods used individually in semantic relatedness measurements. Graph convolutional network graph embedding uniformly outperformed path-based measurements and was better than some word embeddings for the Standardized MedDRA Queries dataset. When used together, combined word and graph embedding achieved the best performance in all datasets. For WSD, the enhanced versions of BERT outperformed BioWordVec. Conclusions Word and graph embedding techniques can be used to harness terms and relations in the UMLS to measure semantic relatedness between concepts. Concept sentence embedding outperforms path-based measurements and cui2vec, and can be further enhanced by combining with graph embedding.

Download Full-text