missing data techniques Latest Research Papers

Missing data are common in psychological and educational research. With the improvement in computing technology in recent decades, more researchers begin developing missing data techniques. In their research, they often conduct Monte Carlo simulation studies to compare the performances of different missing data techniques. During such simulation studies, researchers must generate missing data in the simulated dataset by deciding which data values to delete. However, in the current literature, there are few guidelines on how to generate missing data for simulation studies. Our paper is one of the first papers that examines ways of generating missing data for simulation studies. We emphasize the importance of specifying missing data rules which are statistical models for generating missing data. We begin the paper by reviewing the types of missing data mechanisms and missing data patterns. We then explain how to specify missing data rules to generate missing data with different mechanisms and patterns. We end the paper by presenting recommendations for generating missing data for simulation studies.

Download Full-text

A Multiple Imputation Approach for Handling Missing Data in Classification and Regression Trees

Journal of Behavioral Data Science ◽

10.35566/jbds/v1n1/p6 ◽

2021 ◽

Vol 1 (1) ◽

Author(s):

Danielle M. Rodgers ◽

Ross Jacobucci ◽

Kevin J. Grimm

Keyword(s):

Missing Data ◽

Multiple Imputation ◽

Missing Values ◽

Selection Process ◽

Superior Performance ◽

Machine Learning Technique ◽

Listwise Deletion ◽

Learning Technique ◽

Missing Data Techniques ◽

Imputation Approach

Decision trees (DTs) is a machine learning technique that searches the predictor space for the variable and observed value that leads to the best prediction when the data are split into two nodes based on the variable and splitting value. The algorithm repeats its search within each partition of the data until a stopping rule ends the search. Missing data can be problematic in DTs because of an inability to place an observation with a missing value into a node based on the chosen splitting variable. Moreover, missing data can alter the selection process because of its inability to place observations with missing values. Simple missing data approaches (e.g., listwise deletion, majority rule, and surrogate split) have been implemented in DT algorithms; however, more sophisticated missing data techniques have not been thoroughly examined. We propose a modified multiple imputation approach to handling missing data in DTs, and compare this approach with simple missing data approaches as well as single imputation and a multiple imputation with prediction averaging via Monte Carlo Simulation. This study evaluated the performance of each missing data approach when data were MAR or MCAR. The proposed multiple imputation approach and surrogate splits had superior performance with the proposed multiple imputation approach performing best in the more severe missing data conditions. We conclude with recommendations for handling missing data in DTs.

Download Full-text

Missing data techniques in classification for cardiovascular dysautonomias diagnosis

Medical & Biological Engineering & Computing ◽

10.1007/s11517-020-02266-x ◽

2020 ◽

Vol 58 (11) ◽

pp. 2863-2878

Author(s):

Ali Idri ◽

Ilham Kadi ◽

Ibtissam Abnane ◽

José Luis Fernandez-Aleman

Keyword(s):

Missing Data ◽

Missing Data Techniques

Download Full-text

The development of delinquency during adolescence: a comparison of missing data techniques revisited

Quality & Quantity ◽

10.1007/s11135-020-01030-5 ◽

2020 ◽

Author(s):

Kristian Kleinke ◽

Jost Reinecke ◽

Cornelia Weins

Keyword(s):

Missing Data ◽

Missing Data Techniques

Download Full-text

Missing Data in Research

Oxford Research Encyclopedia of Business and Management ◽

10.1093/acrefore/9780190224851.013.226 ◽

2020 ◽

Author(s):

Hettie A. Richardson ◽

Marcia J. Simmering

Keyword(s):

Missing Data ◽

Large Body ◽

Strong Support ◽

Nonresponse Bias ◽

Organizational Contexts ◽

Organizational Settings ◽

Post Hoc ◽

Missing Data Techniques ◽

Mobile Survey

Nonresponse and the missing data that it produces are ubiquitous in survey research, but they are also present in archival and other forms of research. Nonresponse and missing data can be especially problematic in organizational contexts where the risks of providing personal or organizational data might be perceived as (or actually) greater than in public opinion contexts. Moreover, nonresponse and missing data are presenting new challenges with the advent of online and mobile survey technology. When observational units (e.g., individuals, teams, organizations) do not provide some or all of the information sought by a researcher and the reasons for nonresponse are systematically related to the survey topic, nonresponse bias can result and the research community may draw faulty conclusions. Due to concerns about nonresponse bias, scholars have spent several decades seeking to understand why participants choose not to respond to certain items and entire surveys, and how best to avoid nonresponse through actions such as improved study design, the use of incentives, and follow-up initiatives. At the same time, researchers recognize that it is virtually impossible to avoid nonresponse and missing data altogether, and as such, in any given study there will likely be a need to diagnose patterns of missingness and their potential for bias. There will likewise be a need to statistically deal with missing data by employing post hoc mechanisms that maximize the sample available for hypothesis testing and minimize the extent to which missing data obscures the underlying true characteristics of the dataset. In this connection, a large body of programmatic research supports maximum likelihood (ML) and multiple imputation (MI) as useful data replacement procedures; although in some situations, it might be reasonable to use simpler procedures instead. Despite strong support for these statistical techniques, organizational scholars have yet to embrace them. Instead they tend to rely on approaches such as listwise deletion that do not preserve underlying data characteristics, reduce the sample available for statistical analysis, and in some cases, actually exacerbate the potential problems associated with missing data. Although there are certainly remaining questions that can be addressed about missing data techniques, these techniques are also well understood and validated. There remains, however, a strong need for exploration into the nature, causes, and extent of nonresponse in various organizational contexts, such when using online and mobile surveys. Such research could play a useful role in helping researchers avoid nonresponse in organizational settings, as well as extend insight about how best and when to apply validated missing data techniques.

Download Full-text

GRAPH MODELS WITH MISSING DATA TECHNIQUES FOR APPLICATIONS IN DRUG USE NETWORKS

10.23860/thesis-ryan-valerie-2020 ◽

2020 ◽

Author(s):

◽

Valerie Ryan

Keyword(s):

Missing Data ◽

Drug Use ◽

Graph Models ◽

Missing Data Techniques

Download Full-text

A Monte lo simulation on penalized regression and missing data techniques for social science large-scale data

Korean Society for Educational Evaluation ◽

10.31158/jeev.2019.32.4.755 ◽

2019 ◽

Vol 32 (4) ◽

pp. 755-776

Author(s):

Minjeong Rho ◽

Jin Eun Yoo

Keyword(s):

Missing Data ◽

Social Science ◽

Large Scale ◽

Penalized Regression ◽

Large Scale Data ◽

Missing Data Techniques ◽

Scale Data

Download Full-text

Testing Measurement Invariance with Ordinal Missing Data: A Comparison of Estimators and Missing Data Techniques

Multivariate Behavioral Research ◽

10.1080/00273171.2019.1608799 ◽

2019 ◽

Vol 55 (1) ◽

pp. 87-101 ◽

Cited By ~ 4

Author(s):

Po-Yi Chen ◽

Wei Wu ◽

Mauricio Garnier-Villarreal ◽

Benjamin Arthur Kite ◽

Fan Jia

Keyword(s):

Missing Data ◽

Measurement Invariance ◽

Missing Data Techniques

Download Full-text

Evaluation of Source-wise Missing Data Techniques for the Prediction of Parkinson’s Disease Using Smartphones

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2019.8682660 ◽

2019 ◽

Author(s):

John Prince ◽

Fernando Andreotti ◽

Maarten De Vos

Keyword(s):

Parkinson’S Disease ◽

Parkinson's Disease ◽

Missing Data ◽

Missing Data Techniques

Download Full-text

missing data techniques
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Correction to: The development of delinquency during adolescence: a comparison of missing data techniques revisited

Tutorial: How to Generate Missing Data For Simulation Studies

A Multiple Imputation Approach for Handling Missing Data in Classification and Regression Trees

Missing data techniques in classification for cardiovascular dysautonomias diagnosis

The development of delinquency during adolescence: a comparison of missing data techniques revisited

Missing Data in Research

GRAPH MODELS WITH MISSING DATA TECHNIQUES FOR APPLICATIONS IN DRUG USE NETWORKS

A Monte lo simulation on penalized regression and missing data techniques for social science large-scale data

Testing Measurement Invariance with Ordinal Missing Data: A Comparison of Estimators and Missing Data Techniques

Evaluation of Source-wise Missing Data Techniques for the Prediction of Parkinson’s Disease Using Smartphones

Export Citation Format

missing data techniquesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Correction to: The development of delinquency during adolescence: a comparison of missing data techniques revisited

Tutorial: How to Generate Missing Data For Simulation Studies

A Multiple Imputation Approach for Handling Missing Data in Classification and Regression Trees

Missing data techniques in classification for cardiovascular dysautonomias diagnosis

The development of delinquency during adolescence: a comparison of missing data techniques revisited

Missing Data in Research

GRAPH MODELS WITH MISSING DATA TECHNIQUES FOR APPLICATIONS IN DRUG USE NETWORKS

A Monte lo simulation on penalized regression and missing data techniques for social science large-scale data

Testing Measurement Invariance with Ordinal Missing Data: A Comparison of Estimators and Missing Data Techniques

Evaluation of Source-wise Missing Data Techniques for the Prediction of Parkinson’s Disease Using Smartphones

missing data techniques
Recently Published Documents