DIF DETECTION SENSITIVITY OF LORD’S CHI-SQUARE, RAJU’S AREA, LOGISTIC REGRESSION, MANTEL-HAENSZEL, STANDARDIZATION, AND TRANSFORMED ITEM DIFFICULTIES METHODS, IN COMPARISON, USING R.

EPRA International Journal of Multidisciplinary Research (IJMR) ◽

10.36713/epra7953 ◽

2021 ◽

pp. 629-640

Author(s):

Dr. Wokoma T. Abbott

Keyword(s):

Logistic Regression ◽

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Reference Group ◽

Detection Sensitivity ◽

Detection Methods ◽

Chi Square ◽

Item Functioning ◽

Item Parameters

Differential item functioning (DIF) will always occur as a result of these differences in the person parameter of these individuals being examined even when item parameters remain constant during testing. This postulate of item response theory (IRT) was proven in this work. This study investigated if DIF detection methods will have the same DIF detection sensitivity. Comparative research design formed the framework of the study. Transformed item difficulties (TID), Mantel-Haenszel (MH), standardization, logistic regression, Ragu’s area, and Lord’s chi-square methods were compared. The study used 400 vocational one students (200 male as reference group and 200 female as focal group) in Rivers state, Nigeria. The multiple choice items of 2019 computer science for the junior school certificate examination (JSCE) was adapted as the instrument for data collection, which were administered to students and scored dichotomously. Difficulty and discrimination parameters of the items were analyzed using the 2PL model of IRT with the help of ltm package. Ogives of the items were plotted with ggplot2 package. Individual DIF methods and DichoDif in DifR were used to detect DIF and compare the methods. The results revealed that all the items of the test functioned differently between the reference group and the focal group as shown in the item characteristic curves (ICCs). In comparison of the DIF detection methods, standardization method detected most of the DIF items followed by logistic regression method, and then lord’s chi-square methods. Transformed item difficulties method detected more than mantel-Haenszel method. Raju’s area method could not detect any. In the light of the finding, it was recommended that the best DIF detection methods (possibly combination of them) should be used to identify DIF items in tests. KEYWORDS: Item response theory, differential item functioning, item characteristic curve, item parameters.

Download Full-text

lordif: AnRPackage for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations

Journal of Statistical Software ◽

10.18637/jss.v039.i08 ◽

2011 ◽

Vol 39 (8) ◽

Cited By ~ 198

Author(s):

Seung W. Choi ◽

Laura E. Gibbons ◽

Paul K. Crane

Keyword(s):

Monte Carlo ◽

Logistic Regression ◽

Item Response Theory ◽

Monte Carlo Simulations ◽

Differential Item Functioning ◽

Item Response ◽

Ordinal Logistic Regression ◽

Response Theory ◽

Item Functioning

Download Full-text

Detecting Multidimensional Differential Item Functioning with the Multiple Indicators Multiple Causes Model, the Item Response Theory Likelihood Ratio Test, and Logistic Regression

Frontiers in Education ◽

10.3389/feduc.2017.00051 ◽

2017 ◽

Vol 2 ◽

Cited By ~ 2

Author(s):

Okan Bulut ◽

Youngsuk Suh

Keyword(s):

Logistic Regression ◽

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Likelihood Ratio ◽

Likelihood Ratio Test ◽

Ratio Test ◽

Response Theory ◽

Multiple Indicators ◽

Item Functioning

Download Full-text

Using Item Response Theory and Differential Item Functioning to Further Examine Concerted Cultivation

PsycEXTRA Dataset ◽

10.1037/e500122015-117 ◽

2014 ◽

Author(s):

R. Crabbe ◽

Rachel Gordon ◽

K. Fujimoto ◽

M. Krysan

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Response Theory ◽

Concerted Cultivation ◽

Item Functioning

Download Full-text

Thin Versus Thick Matching in the Mantel-Haenszel Procedure for Detecting DIF

Journal of Educational Statistics ◽

10.3102/10769986018002131 ◽

1993 ◽

Vol 18 (2) ◽

pp. 131-154 ◽

Cited By ~ 19

Author(s):

John R. Donoghue ◽

Nancy L. Allen

Keyword(s):

Item Response ◽

Test Score ◽

Outcome Measure ◽

Reference Group ◽

Monte Carlo Study ◽

Chi Square ◽

Irt Model ◽

Item Functioning ◽

Log Odds ◽

Total Test

This Monte Carlo study examined strategies for forming the matching variable for the Mantel-Haenszel (MH) differential item functioning (DIF) procedure; thin matching on total test score was compared to forms of thick matching, pooling levels of the matching variable. Data were generated using a three-parameter logistic (3PL) item response theory (IRT) model with common guessing parameter. Number of subjects and test length were manipulated, as were the difficulty, discrimination, and presence/absence of DIF in the studied item. Outcome measures were the transformed log-odds &Deltacirc; MH, its standard error, and the MH chi-square statistic. For short tests (5 or 10 items), thin matching yielded very poor results, with a tendency to falsely identify items as possessing DIF against the reference group. The best methods of thick matching yielded outcome measure values closer to the expected value for non-DIF items, as well as a larger value than thin matching when the studied item possessed DIF. Intermediate length tests yielded similar results for thin matching and the best methods of thick matching. The method of thick matching that performed best depended on the measure used to detect DIF. Both difficulty and discrimination of the studied item were found to have a strong effect on the value of &Deltacirc; MH.

Download Full-text

Determining Differential Item Functioning with the Mixture Item Response Theory

Eurasian Journal of Educational Research ◽

10.14689/ejer.2018.74.10 ◽

2018 ◽

Vol 18 ◽

pp. 1-20

Author(s):

Seher YALCIN

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Response Theory ◽

Item Functioning

Download Full-text

A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

Applied Psychological Measurement ◽

10.1177/0146621611428447 ◽

2011 ◽

Vol 35 (8) ◽

pp. 604-622 ◽

Cited By ~ 16

Author(s):

Hirotaka Fukuhara ◽

Akihito Kamata

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Estimation Method ◽

Multidimensional Item Response Theory ◽

Multidimensional Item Response ◽

Response Theory ◽

Data Set ◽

Detection Rates ◽

Item Functioning

A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into account, thus estimating DIF magnitude appropriately when a test is composed of testlets. A fully Bayesian estimation method was adopted for parameter estimation. The recovery of parameters was evaluated for the proposed DIF model. Simulation results revealed that the proposed bifactor MIRT DIF model produced better estimates of DIF magnitude and higher DIF detection rates than the traditional IRT DIF model for all simulation conditions. A real data analysis was also conducted by applying the proposed DIF model to a statewide reading assessment data set.

Download Full-text

Item Response Theory and Differential Item Functioning

Quality of Life ◽

10.1002/0470846283.ch6 ◽

2000 ◽

pp. 117-134

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Response Theory ◽

Item Functioning

Download Full-text

Determining differential item functioning and its effect on the test scores of selected pib indexes, using item response theory techniques

SA Journal of Industrial Psychology ◽

10.4102/sajip.v27i2.783 ◽

2001 ◽

Vol 27 (2) ◽

Author(s):

Pieter Schaap

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Test Scores ◽

Response Theory ◽

South Africans ◽

Test Characteristics ◽

Potential Index ◽

Item Functioning

The objective of this article is to present the results of an investigation into the item and test characteristics of two tests of the Potential Index Batteries (PIB) in terms of differential item functioning (DIP) and the effect thereof on test scores of different race groups. The English Vocabulary (Index 12) and Spelling Tests (Index 22) of the PIB were analysed for white, black and coloured South Africans. Item response theory (IRT) methods were used to identify items which function differentially for white, black and coloured race groups. Opsomming Die doel van hierdie artikel is om die resultate van n ondersoek na die item- en toetseienskappe van twee PIB (Potential Index Batteries) toetse in terme van itemsydigheid en die invloed wat dit op die toetstellings van rassegroepe het, weer te gee. Die Potential Index Batteries (PIB) se Engelse Woordeskat (Index 12) en Spellingtoetse (Index 22) is ten opsigte van blanke, swart en gekleurde Suid-Afrikaners ontleed. Itemresponsteorie (IRT) is gebruik om items te identifiseer wat as sydig (DIP) vir die onderskeie rassegroepe beskou kan word.

Download Full-text

Differential item functioning of the Boston Naming Test in cognitively normal African American and Caucasian older adults

Journal of the International Neuropsychological Society ◽

10.1017/s1355617709990361 ◽

2009 ◽

Vol 15 (5) ◽

pp. 758-768 ◽

Cited By ~ 33

Author(s):

OTTO PEDRAZA ◽

NEILL R. GRAFF-RADFORD ◽

GLENN E. SMITH ◽

ROBERT J. IVNIK ◽

FLOYD B. WILLIS ◽

...

Keyword(s):

African American ◽

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Group Performance ◽

Response Theory ◽

Boston Naming Test ◽

Item Functioning ◽

The Impact ◽

Naming Test

AbstractScores on the Boston Naming Test (BNT) are frequently lower for African American when compared with Caucasian adults. Although demographically based norms can mitigate the impact of this discrepancy on the likelihood of erroneous diagnostic impressions, a growing consensus suggests that group norms do not sufficiently address or advance our understanding of the underlying psychometric and sociocultural factors that lead to between-group score discrepancies. Using item response theory and methods to detect differential item functioning (DIF), the current investigation moves beyond comparisons of the summed total score to examine whether the conditional probability of responding correctly to individual BNT items differs between African American and Caucasian adults. Participants included 670 adults age 52 and older who took part in Mayo’s Older Americans and Older African Americans Normative Studies. Under a two-parameter logistic item response theory framework and after correction for the false discovery rate, 12 items where shown to demonstrate DIF. Of these 12 items, 6 (“dominoes,” “escalator,” “muzzle,” “latch,” “tripod,” and “palette”) were also identified in additional analyses using hierarchical logistic regression models and represent the strongest evidence for race/ethnicity-based DIF. These findings afford a finer characterization of the psychometric properties of the BNT and expand our understanding of between-group performance. (JINS, 2009, 15, 758–768.)

Download Full-text

P1-221: Assessment of differential item functioning in the mini-mental state examination-korean version (MMSE-KC) between dementia and pseudo-dementia using item response theory

Alzheimer s & Dementia ◽

10.1016/j.jalz.2015.06.421 ◽

2015 ◽

Vol 11 (7S_Part_9) ◽

pp. P436-P436

Author(s):

Sung Man Chang ◽

Seong-Jin Cho

Keyword(s):

Item Response Theory ◽

Differential Item Functioning ◽

Item Response ◽

Mental State ◽

Mini Mental State Examination ◽

Response Theory ◽

Korean Version ◽

Item Functioning ◽

State Examination

Download Full-text