Structured Data-Driven Operating Policies for Commodity Storage

Data‐Driven simulation of inelastic materials using structured data sets and tangential transition rules

PAMM ◽

10.1002/pamm.202000241 ◽

2021 ◽

Vol 20 (1) ◽

Author(s):

Kerem Ciftci ◽

Klaus Hackl

Keyword(s):

Structured Data ◽

Data Driven ◽

Data Sets ◽

Transition Rules

Download Full-text

Natural language processing and machine learning of electronic health records for prediction of first-time suicide attempts

JAMIA Open ◽

10.1093/jamiaopen/ooab011 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Fuchiang R Tsui ◽

Lingyun Shi ◽

Victor Ruiz ◽

Neal D Ryan ◽

Candice Biernesser ◽

...

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Suicide Attempt ◽

Language Processing ◽

Suicide Attempts ◽

Large Data ◽

Structured Data ◽

Data Driven ◽

Data Driven Approach ◽

First Time

Abstract Objective Limited research exists in predicting first-time suicide attempts that account for two-thirds of suicide decedents. We aimed to predict first-time suicide attempts using a large data-driven approach that applies natural language processing (NLP) and machine learning (ML) to unstructured (narrative) clinical notes and structured electronic health record (EHR) data. Methods This case-control study included patients aged 10–75 years who were seen between 2007 and 2016 from emergency departments and inpatient units. Cases were first-time suicide attempts from coded diagnosis; controls were randomly selected without suicide attempts regardless of demographics, following a ratio of nine controls per case. Four data-driven ML models were evaluated using 2-year historical EHR data prior to suicide attempt or control index visits, with prediction windows from 7 to 730 days. Patients without any historical notes were excluded. Model evaluation on accuracy and robustness was performed on a blind dataset (30% cohort). Results The study cohort included 45 238 patients (5099 cases, 40 139 controls) comprising 54 651 variables from 5.7 million structured records and 798 665 notes. Using both unstructured and structured data resulted in significantly greater accuracy compared to structured data alone (area-under-the-curve [AUC]: 0.932 vs. 0.901 P < .001). The best-predicting model utilized 1726 variables with AUC = 0.932 (95% CI, 0.922–0.941). The model was robust across multiple prediction windows and subgroups by demographics, points of historical most recent clinical contact, and depression diagnosis history. Conclusions Our large data-driven approach using both structured and unstructured EHR data demonstrated accurate and robust first-time suicide attempt prediction, and has the potential to be deployed across various populations and clinical settings.

Download Full-text

Five Years of Argument Mining: a Data-driven Analysis

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/766 ◽

2018 ◽

Cited By ~ 11

Author(s):

Elena Cabrio ◽

Serena Villata

Keyword(s):

Artificial Intelligence ◽

Computational Models ◽

Research Area ◽

Research Topic ◽

Structured Data ◽

Data Driven ◽

Small Community ◽

Research Areas ◽

Open Discussion ◽

Community Of Researchers

Argument mining is the research area aiming at extracting natural language arguments and their relations from text, with the final goal of providing machine-processable structured data for computational models of argument. This research topic has started to attract the attention of a small community of researchers around 2014, and it is nowadays counted as one of the most promising research areas in Artificial Intelligence in terms of growing of the community, funded projects, and involvement of companies. In this paper, we present the argument mining tasks, and we discuss the obtained results in the area from a data-driven perspective. An open discussion highlights the main weaknesses suffered by the existing work in the literature, and proposes open challenges to be faced in the future.

Download Full-text

Integration of Data-Driven Process Re-Engineering and Process Interdependency for Manufacturing Optimization Supported by Smart Structured Data

Designs ◽

10.3390/designs3030044 ◽

2019 ◽

Vol 3 (3) ◽

pp. 44 ◽

Cited By ~ 2

Author(s):

Md Ashikul Alam Khan ◽

Javaid Butt ◽

Habtom Mebrahtu ◽

Hassan Shirvani ◽

Alireza Sanaei ◽

...

Keyword(s):

Decision Making ◽

Process Optimization ◽

Structured Data ◽

Production Optimization ◽

Data Driven ◽

Implementation Phase ◽

Cause And Effect ◽

Manufacturing Optimization ◽

Line Process

Process re-engineering and optimization in manufacturing industries is a big challenge because of process interdependencies characterized by a high failure rate. Research has shown that over 70% of approaches fail because of complexity as a result of process interdependencies during the implementation phase. This paper investigates data from a manufacturing operation and designs a filtration algorithm to analyze process interdependencies as a new approach for process optimization. The algorithm examines the data from a manufacturing process to identify limitations through cause and effect relationships and implements changes to achieve an optimized result. The proposed cause and effect approach of re-engineering is termed the Khan-Hassan-Butt (KHB) methodology, and it can filter the process interdependencies and use those as key decision-making tools. It provides an improved process optimization framework that incorporates data analysis along with a cause and effect algorithm to filter out the process interdependencies as an approach to increase output and reduce failure factors simultaneously. It also provides a framework for filtering the manufacturing data into smart structured data. Based on the proposed KHB methodology, the study investigated a production line process using the WITNESS Horizon 22 simulation package and analyzed the efficiency of the proposed approach for production optimization. A case study is provided that integrated the KHB methodology with data-driven process re-engineering to analyze the process interdependencies to use them as decision-making tools for production optimization.

Download Full-text

Model‐free Data‐Driven simulation of inelastic materials using structured data sets, tangent space information and transition rules ‐ convergence test

PAMM ◽

10.1002/pamm.202100231 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Kerem Ciftci ◽

Klaus Hackl

Keyword(s):

Tangent Space ◽

Structured Data ◽

Data Driven ◽

Data Sets ◽

Convergence Test ◽

Model Free ◽

Free Data ◽

Transition Rules

Download Full-text

Data-Based Fault Diagnosis Model Using a Bayesian Causal Analysis Framework

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622018500025 ◽

2018 ◽

Vol 17 (02) ◽

pp. 583-620 ◽

Cited By ~ 1

Author(s):

Thierno M. L. Diallo ◽

Sébastien Henry ◽

Yacine Ouzrout ◽

Abdelaziz Bouras

Keyword(s):

Data Model ◽

Causal Analysis ◽

Accurate Diagnosis ◽

Structured Data ◽

Manufacturing Industries ◽

Data Driven ◽

Analysis Framework ◽

Tennessee Eastman Process ◽

Diagnosis Model

This paper provides a comprehensive data-driven diagnosis approach applicable to complex manufacturing industries. The proposed approach is based on the Bayesian network paradigm. Both the implementation of the Bayesian model (the structure and parameters of the network) and the use of the resulting model for diagnosis are presented. The construction of the structure taking into account the issue related to the explosion in the number of variables and the determination of the network’s parameters are addressed. A diagnosis procedure using the developed Bayesian framework is proposed. In order to provide the structured data required for the construction and the usage of the diagnosis model, a unitary traceability data model is proposed and its use for forward and backward traceability is explained. Finally, an industrial benchmark — the Tennessee Eastman process — is utilized to show the ability of the developed framework to make an accurate diagnosis.

Download Full-text

AceMesh: a structured data driven programming language for high performance computing

CCF Transactions on High Performance Computing ◽

10.1007/s42514-020-00047-4 ◽

2020 ◽

Vol 2 (4) ◽

pp. 309-322

Author(s):

Li Chen ◽

Shenglin Tang ◽

You Fu ◽

Xiran Gao ◽

Jie Guo ◽

...

Keyword(s):

High Performance Computing ◽

Programming Language ◽

High Performance ◽

Structured Data ◽

Data Driven ◽

Performance Computing

Download Full-text

A status report on a section-based stratigraphic and palaeontological database – the Geobiodiversity Database

Earth System Science Data ◽

10.5194/essd-12-3443-2020 ◽

2020 ◽

Vol 12 (4) ◽

pp. 3443-3452

Author(s):

Hong-He Xu ◽

Zhi-Bin Niu ◽

Yan-Sen Chen

Keyword(s):

Big Data ◽

Quantitative Analysis ◽

Scientific Research ◽

Structured Data ◽

Data Driven ◽

Status Report ◽

Statistical Results ◽

Existing Data

Abstract. Big data are significant for quantitative analysis and contribute to data-driven scientific research and discoveries. Here a brief introduction is given to the Geobiodiversity Database (GBDB), a comprehensive stratigraphic and palaeontological database, and its data. The GBDB includes abundant geological records from China and has supported a series of scientific studies on the Paleozoic palaeogeography and tectonic and biodiversity evolution of China. The data that the GBDB has including those that are newly collected are described in detail; the statistical results and structure of the data are given. A comparison between the GBDB; the largest palaeobiological database, the Paleobiology Database (PBDB); and the geological rock database Macrostrat is drawn. The GBDB and other databases are complementary in palaeontological and stratigraphic research. The GBDB will continually provide users access to detailed palaeontological and stratigraphic data based on publications. Non-structured data of palaeontology and stratigraphy will also be included in the GBDB, and they will be organically correlated with the existing data of the GBDB, making the GBDB more widely used for both researchers and anyone who is interested in fossils and strata. The GBDB fossil and stratum dataset (Xu, 2020) is freely downloadable from https://doi.org/10.5281/zenodo.4245604.

Download Full-text

Supplemental Material for Toward the Data-Driven Dissemination of Findings From Psychological Science

American Psychologist ◽

10.1037/amp0000721.supp ◽

2020 ◽

Keyword(s):

Data Driven ◽

Psychological Science

Download Full-text

SchoolMatters Supports Data-Driven Choices and Decisions

PsycEXTRA Dataset ◽

10.1037/e313982005-001 ◽

2005 ◽

Keyword(s):

Data Driven

Download Full-text