scholarly journals Supporting semiconductor manufacturing simulation tools using a structured data model

Author(s):  
Susan S. Baum ◽  
Peter G. Glassey
2021 ◽  
Vol 2083 (4) ◽  
pp. 042001
Author(s):  
Nan Zhang ◽  
Wenqiang Zhang ◽  
Yingnan Shang

Abstract The emergence of computer big data related data provides a new method for the construction of knowledge links in the knowledge map. This realizes an objective knowledge network with practical significance that is easier to be understood by machines. The article combines the four principles of linked data publishing content objects and their semantic characteristics, and uses the RDF data model to convert unstructured data on the Internet and structured data that adopts different standards into unified standard structured data for association. The system forms a huge knowledge map with semantics, intelligence, and dynamics.


Author(s):  
Sijia Liu ◽  
Yanshan Wang ◽  
Andrew Wen ◽  
Liwei Wang ◽  
Na Hong ◽  
...  

BACKGROUND Widespread adoption of electronic health records has enabled the secondary use of electronic health record data for clinical research and health care delivery. Natural language processing techniques have shown promise in their capability to extract the information embedded in unstructured clinical data, and information retrieval techniques provide flexible and scalable solutions that can augment natural language processing systems for retrieving and ranking relevant records. OBJECTIVE In this paper, we present the implementation of a cohort retrieval system that can execute textual cohort selection queries on both structured data and unstructured text—Cohort Retrieval Enhanced by Analysis of Text from Electronic Health Records (CREATE). METHODS CREATE is a proof-of-concept system that leverages a combination of structured queries and information retrieval techniques on natural language processing results to improve cohort retrieval performance using the Observational Medical Outcomes Partnership Common Data Model to enhance model portability. The natural language processing component was used to extract common data model concepts from textual queries. We designed a hierarchical index to support the common data model concept search utilizing information retrieval techniques and frameworks. RESULTS Our case study on 5 cohort identification queries, evaluated using the precision at 5 information retrieval metric at both the patient-level and document-level, demonstrates that CREATE achieves a mean precision at 5 of 0.90, which outperforms systems using only structured data or only unstructured text with mean precision at 5 values of 0.54 and 0.74, respectively. CONCLUSIONS The implementation and evaluation of Mayo Clinic Biobank data demonstrated that CREATE outperforms cohort retrieval systems that only use one of either structured data or unstructured text in complex textual cohort queries.


Author(s):  
Cristóbal J. Carmona ◽  
María J. Jesus ◽  
Pablo Guerrero ◽  
Reyes Peña-Santiago ◽  
Víctor M. Rivas
Keyword(s):  

2011 ◽  
Vol 268-270 ◽  
pp. 1868-1873
Author(s):  
Li Jun Yang

The existence of heterogeneous data sources brings great inconvenience to realize the exchange visits to data between different information systems. Therefore, it becomes a meaningful research topic to solve the problem of realizing convenient and flexible exchange visits. This paper combines the data representation format of XML generally used in current network with an interaction technique of WebService, and constructs a UDM data model, which can implement structured data of relational type as well as describe unstructured data and self-describing semi-structured data. So UDM data model can be used as a common data model integrated by heterogeneous data to integrate these heterogeneous data.


Sign in / Sign up

Export Citation Format

Share Document