scholarly journals Constructing high-fidelity phenotype knowledge graphs with a fine-grained semantic information model (Preprint)

2021 ◽  
Author(s):  
Lizong Deng ◽  
Luming Chen ◽  
Tao Yang ◽  
Mi Liu ◽  
Shicheng Li ◽  
...  

BACKGROUND Phenotypes characterize clinical manifestations of disease, which provide important information for diagnosis. Therefore, constructing phenotype knowledge graphs of disease is valuable to the development of artificial intelligence in medicine. However, phenotype knowledge graphs in current knowledge bases such as WikiData and DBpedia are coarse-grained knowledge graphs, because they only consider core concepts of phenotypes but neglects details (attributes) associated with phenotypes. OBJECTIVE To characterize details of disease phenotypes in clinical guidelines, we proposed a fine-grained semantic information model named PhenoSSU (Semantic Structured Unit of Phenotypes). METHODS PhenoSSU is an "entity-attribute-value" model by its very nature, which aims to capture full semantics underlying phenotype descriptions with a series of attributes and values. 193 clinical guidelines of infectious diseases from Wikipedia were selected as the study corpus, and 12 attributes from SNOMED-CT were introduced into the PhenoSSU model based on co-occurrences of phenotype concepts and attribute values. The expressive power of the PhenoSSU model was evaluated by analyzing whether a PhenoSSU instance could capture full semantic underlying the corresponding phenotype description. To automatically construct fine-grained phenotype knowledge graphs, A hybrid strategy that firstly recognized phenotype concepts with the MetaMap tool and then predicted attribute values of phenotypes with machine learning classifiers was developed. RESULTS Fine-grained phenotype knowledge graphs of 193 infectious diseases were manually constructed with the BRAT annotation tool. It was found that the PhenoSSU model could precisely represent 89.5% (3757/4020) of phenotype descriptions in clinical guidelines. By comparison, other information models such as the Clinical Element Model and the HL7 FHIR model could only capture full semantics underlying 48.4% and 21.8% of phenotype descriptions, respectively. The hybrid strategy achieved an F1-score of 0.732 for the subtask of phenotype concept recognition and an average weighted accuracy of 0.776 for the subtask of attribute value prediction. CONCLUSIONS PhenoSSU is an effective information model for the precise representation of phenotype knowledge in clinical guidelines, and machine learning can be used to improve efficiency for constructing PhenoSSU-based knowledge graphs. Our work will potentially benefit knowledge-based systems for diagnosis.

2021 ◽  
Author(s):  
Lizong Deng ◽  
Luming Chen ◽  
Tao Yang ◽  
Mi Liu ◽  
Shicheng Li ◽  
...  

UNSTRUCTURED In “Constructing High-Fidelity Phenotype Knowledge Graphs for Infectious Diseases With a Fine-Grained Semantic Information Model: Development and Usability Study” (J Med Internet Res 2021;23(6):e26892) the authors noted one error. The institution name of affiliation “Suzhou Institute of Systems Medicine” was not correct. It should be corrected from “Suzhou Institute of Systems Medicine” to “Center of Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College; Suzhou Institute of Systems Medicine”


2021 ◽  
Vol 12 ◽  
Author(s):  
Bram Vanroy ◽  
Moritz Schaeffer ◽  
Lieve Macken

Characteristics of the translation product are often used in translation process research as predictors for cognitive load, and by extension translation difficulty. In the last decade, user-activity information such as eye-tracking data has been increasingly employed as an experimental tool for that purpose. In this paper, we take a similar approach. We look for significant effects that different predictors may have on three different eye-tracking measures: First Fixation Duration (duration of first fixation on a token), Eye-Key Span (duration between first fixation on a token and the first keystroke contributing to its translation), and Total Reading Time on source tokens (sum of fixations on a token). As predictors we make use of a set of established metrics involving (lexico)semantics and word order, while also investigating the effect of more recent ones concerning syntax, semantics or both. Our results show a, particularly late, positive effect of many of the proposed predictors, suggesting that both fine-grained metrics of syntactic phenomena (such as word reordering) as well as coarse-grained ones (encapsulating both syntactic and semantic information) contribute to translation difficulties. The effect on especially late measures may indicate that the linguistic phenomena that our metrics capture (e.g., word reordering) are resolved in later stages during cognitive processing such as problem-solving and revision.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Zengyi Qin ◽  
Jiansheng Chen ◽  
Zhenyu Jiang ◽  
Xumin Yu ◽  
Chunhua Hu ◽  
...  

AbstractDue to its importance in clinical science, the estimation of physiological states (e.g., the severity of pathological tremor) has aroused growing interest in machine learning community. While the physiological state is a continuous variable, its continuity is lost when the physiological state is quantized into a few discrete classes during recording and labeling. The discreteness introduces misalignment between the true value and its label, meaning that these labels are unfortunately imprecise and coarse-grained. Most previous work did not consider the inaccuracy and directly utilized the coarse labels to train the machine learning algorithms, whose predictions are also coarse-grained. In this work, we propose to learn a precise, fine-grained estimation of physiological states using these coarse-grained ground truths. Established on mathematical rigorous proof, we utilize imprecise labels to restore the probabilistic distribution of precise labels in an approximate order-preserving fashion, then the deep neural network learns from this distribution and offers fine-grained estimation. We demonstrate the effectiveness of our approach in assessing the pathological tremor in Parkinson’s Disease and estimating the systolic blood pressure from bioelectrical signals.


Author(s):  
Wang Zheng-fang ◽  
Z.F. Wang

The main purpose of this study highlights on the evaluation of chloride SCC resistance of the material,duplex stainless steel,OOCr18Ni5Mo3Si2 (18-5Mo) and its welded coarse grained zone(CGZ).18-5Mo is a dual phases (A+F) stainless steel with yield strength:512N/mm2 .The proportion of secondary Phase(A phase) accounts for 30-35% of the total with fine grained and homogeneously distributed A and F phases(Fig.1).After being welded by a specific welding thermal cycle to the material,i.e. Tmax=1350°C and t8/5=20s,microstructure may change from fine grained morphology to coarse grained morphology and from homogeneously distributed of A phase to a concentration of A phase(Fig.2).Meanwhile,the proportion of A phase reduced from 35% to 5-10°o.For this reason it is known as welded coarse grained zone(CGZ).In association with difference of microstructure between base metal and welded CGZ,so chloride SCC resistance also differ from each other.Test procedures:Constant load tensile test(CLTT) were performed for recording Esce-t curve by which corrosion cracking growth can be described, tf,fractured time,can also be recorded by the test which is taken as a electrochemical behavior and mechanical property for SCC resistance evaluation. Test environment:143°C boiling 42%MgCl2 solution is used.Besides, micro analysis were conducted with light microscopy(LM),SEM,TEM,and Auger energy spectrum(AES) so as to reveal the correlation between the data generated by the CLTT results and micro analysis.


Author(s):  
Zhuliang Yao ◽  
Shijie Cao ◽  
Wencong Xiao ◽  
Chen Zhang ◽  
Lanshun Nie

In trained deep neural networks, unstructured pruning can reduce redundant weights to lower storage cost. However, it requires the customization of hardwares to speed up practical inference. Another trend accelerates sparse model inference on general-purpose hardwares by adopting coarse-grained sparsity to prune or regularize consecutive weights for efficient computation. But this method often sacrifices model accuracy. In this paper, we propose a novel fine-grained sparsity approach, Balanced Sparsity, to achieve high model accuracy with commercial hardwares efficiently. Our approach adapts to high parallelism property of GPU, showing incredible potential for sparsity in the widely deployment of deep learning services. Experiment results show that Balanced Sparsity achieves up to 3.1x practical speedup for model inference on GPU, while retains the same high model accuracy as finegrained sparsity.


Author(s):  
Jonas Austerjost ◽  
Robert Söldner ◽  
Christoffer Edlund ◽  
Johan Trygg ◽  
David Pollard ◽  
...  

Machine vision is a powerful technology that has become increasingly popular and accurate during the last decade due to rapid advances in the field of machine learning. The majority of machine vision applications are currently found in consumer electronics, automotive applications, and quality control, yet the potential for bioprocessing applications is tremendous. For instance, detecting and controlling foam emergence is important for all upstream bioprocesses, but the lack of robust foam sensing often leads to batch failures from foam-outs or overaddition of antifoam agents. Here, we report a new low-cost, flexible, and reliable foam sensor concept for bioreactor applications. The concept applies convolutional neural networks (CNNs), a state-of-the-art machine learning system for image processing. The implemented method shows high accuracy for both binary foam detection (foam/no foam) and fine-grained classification of foam levels.


2021 ◽  
Vol 83 (4) ◽  
Author(s):  
S. Adam Soule ◽  
Michael Zoeller ◽  
Carolyn Parcheta

AbstractHawaiian and other ocean island lava flows that reach the coastline can deposit significant volumes of lava in submarine deltas. The catastrophic collapse of these deltas represents one of the most significant, but least predictable, volcanic hazards at ocean islands. The volume of lava deposited below sea level in delta-forming eruptions and the mechanisms of delta construction and destruction are rarely documented. Here, we report on bathymetric surveys and ROV observations following the Kīlauea 2018 eruption that, along with a comparison to the deltas formed at Pu‘u ‘Ō‘ō over the past decade, provide new insight into delta formation. Bathymetric differencing reveals that the 2018 deltas contain more than half of the total volume of lava erupted. In addition, we find that the 2018 deltas are comprised largely of coarse-grained volcanic breccias and intact lava flows, which contrast with those at Pu‘u ‘Ō‘ō that contain a large fraction of fine-grained hyaloclastite. We attribute this difference to less efficient fragmentation of the 2018 ‘a‘ā flows leading to fragmentation by collapse rather than hydrovolcanic explosion. We suggest a mechanistic model where the characteristic grain size influences the form and stability of the delta with fine grain size deltas (Pu‘u ‘Ō‘ō) experiencing larger landslides with greater run-out supported by increased pore pressure and with coarse grain size deltas (Kīlauea 2018) experiencing smaller landslides that quickly stop as the pore pressure rapidly dissipates. This difference, if validated for other lava deltas, would provide a means to assess potential delta stability in future eruptions.


Sign in / Sign up

Export Citation Format

Share Document