Funding information of the article entitled “Post-hoc simulation study of computerized adaptive testing for the Korean Medical Licensing Examination”

Post-hoc simulation study of computerized adaptive testing for the Korean Medical Licensing Examination

Journal of Educational Evaluation for Health Professions ◽

10.3352/jeehp.2018.15.14 ◽

2018 ◽

Vol 15 ◽

pp. 14 ◽

Cited By ~ 3

Author(s):

Dong Gi Seo ◽

Jeongwook Choi

Keyword(s):

Simulation Study ◽

Computerized Adaptive Testing ◽

Likelihood Estimation ◽

Adaptive Testing ◽

Item Selection ◽

Selection Methods ◽

Medical Licensing ◽

Licensing Examination ◽

Post Hoc ◽

Better Than

Purpose: Computerized adaptive testing (CAT) has been adopted in licensing examinations because it improves the efficiency and accuracy of the tests, as shown in many studies. This simulation study investigated CAT scoring and item selection methods for the Korean Medical Licensing Examination (KMLE). Methods: This study used a post-hoc (real data) simulation design. The item bank used in this study included all items from the January 2017 KMLE. All CAT algorithms for this study were implemented using the ‘catR’ package in the R program. Results: In terms of accuracy, the Rasch and 2-parametric logistic (PL) models performed better than the 3PL model. The ‘modal a posteriori’ and ‘expected a posterior’ methods provided more accurate estimates than maximum likelihood estimation or weighted likelihood estimation. Furthermore, maximum posterior weighted information and minimum expected posterior variance performed better than other item selection methods. In terms of efficiency, the Rasch model is recommended to reduce test length. Conclusion: Before implementing live CAT, a simulation study should be performed under varied test conditions. Based on a simulation study, and based on the results, specific scoring and item selection methods should be predetermined.

Download Full-text

A Comparison of Three Empirical Reliability Estimates for Computerized Adaptive Testing (CAT) Using a Medical Licensing Examination

Frontiers in Psychology ◽

10.3389/fpsyg.2018.00681 ◽

2018 ◽

Vol 9 ◽

Author(s):

Dong Gi Seo ◽

Sunho Jung

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Reliability Estimates ◽

Medical Licensing ◽

Licensing Examination

Download Full-text

Correlations between the scores of computerized adaptive testing, paper and pencil tests, and the Korean Medical Licensing Examination

Journal of Educational Evaluation for Health Professions ◽

10.3352/jeehp.2005.2.1.113 ◽

2005 ◽

Vol 2 ◽

pp. 113 ◽

Cited By ~ 2

Author(s):

Mee Young Kim ◽

Yoon Hwan Lee ◽

Sun Huh

Keyword(s):

Medical Students ◽

Medical School ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Preliminary Examination ◽

Medical Licensing ◽

Licensing Examination ◽

Paper And Pencil ◽

Ability Estimate

To evaluate the usefulness of computerized adaptive testing (CAT) in medical school, the General Examination for senior medical students was administered as a paper and pencil test (P&P) and using CAT. The General Examination is a graduate examination, which is also a preliminary examination for the Korean Medical Licensing Examination (KMLE). The correlations between the results of the CAT and P&P and KMLE were analyzed. The correlation between the CAT and P&P was 0.8013 (p=0.000); that between the CAT and P&P was 0.7861 (p=0.000); and that between the CAT and KMLE was 0.6436 (p=0.000). Six out of 12 students with an ability estimate below 0.52 failed the KMLE. The results showed that CAT could replace P&P in medical school. The ability of CAT to predict whether students would pass the KMLE was 0.5 when the criterion of the theta value was set at -0.52 that was chosen arbitrarily for the prediction of pass or failure.

Download Full-text

Evaluating a Computerized Adaptive Testing Version of a Cognitive Ability Test Using a Simulation Study

Journal of Psychoeducational Assessment ◽

10.1177/07342829211027753 ◽

2021 ◽

pp. 073428292110277

Author(s):

Ioannis Tsaousis ◽

Georgios D. Sideridis ◽

Hannan M. AlGhamdi

Keyword(s):

Cognitive Ability ◽

Simulation Study ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Estimation Methods ◽

Item Pool ◽

Sequential Approach ◽

Ability Test ◽

Promising Alternative ◽

Item Exposure

This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the efficiency of the GCAT, all items in the final item pool were linked (equated), following a sequential approach. Data were generated using a standard normal for 10,000 virtual individuals ( M = 0 and SD = 1). Using the measure’s 165-item bank, the ability value (θ) for each participant was estimated. maximum Fisher information (MFI) and maximum likelihood estimation with fences (MLEF) were used as item selection and score estimation methods, respectively. For item exposure control, the fade away method (FAM) was preferred. The termination criterion involved a minimum SE ≤ 0.33. The study revealed that the average number of items administered for 10,000 participants was 15. Moreover, the precision level in estimating the participant’s ability score was very high, as demonstrated by the CBIAS, CMAE, and CRMSE). It is concluded that the CAT version of the test is a promising alternative to administering the corresponding full-length measure since it reduces the number of administered items, prevents high rates of item exposure, and provides accurate scores with minimum measurement error.

Download Full-text

Score comparability of short forms and computerized adaptive testing: simulation study with the activity measure for post-acute care

Archives of Physical Medicine and Rehabilitation ◽

10.1016/j.apmr.2003.08.097 ◽

2004 ◽

Vol 85 (4) ◽

pp. 661-666 ◽

Cited By ~ 65

Author(s):

Stephen M Haley ◽

Wendy J Coster ◽

Patricia L Andres ◽

Mark Kosinski ◽

Pengsheng Ni

Keyword(s):

Acute Care ◽

Simulation Study ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Short Forms ◽

Activity Measure ◽

Post Acute Care

Download Full-text

A Note on the Relationship of the Shannon Entropy Procedure and the Jensen–Shannon Divergence in Cognitive Diagnostic Computerized Adaptive Testing

SAGE Open ◽

10.1177/2158244019899046 ◽

2020 ◽

Vol 10 (1) ◽

pp. 215824401989904

Author(s):

Wenyi Wang ◽

Lihong Song ◽

Teng Wang ◽

Peng Gao ◽

Jian Xiong

Keyword(s):

Shannon Entropy ◽

Selection Criteria ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Item Parameters ◽

Post Hoc ◽

Relationship Of ◽

Jensen Shannon Divergence ◽

The Relationship

The purpose of this study is to investigate the relationship between the Shannon entropy procedure and the Jensen–Shannon divergence (JSD) that are used as item selection criteria in cognitive diagnostic computerized adaptive testing (CD-CAT). Because the JSD itself is defined by the Shannon entropy, we apply the well-known relationship between the JSD and Shannon entropy to establish a relationship between the item selection criteria that are based on these two measures. To understand the relationship between these two item selection criteria better, an alternative way is also provided. Theoretical derivations and empirical examples have shown that the Shannon entropy procedure and the JSD in CD-CAT have a linear relation under cognitive diagnostic models. Consistent with our theoretical conclusions, simulation results have shown that two item selection criteria behaved quite similarly in terms of attribute-level and pattern recovery rates under all conditions and they selected the same set of items for each examinee from an item bank with item parameters drawn from a uniform distribution U(0.1, 0.3) under post hoc simulations. We provide some suggestions for future studies and a discussion of relationship between the modified posterior-weighted Kullback–Leibler index and the G-DINA (generalized deterministic inputs, noisy “and” gate) discrimination index.

Download Full-text

Applying Computerized Adaptive Testing to the Four-Dimensional Symptom Questionnaire (4DSQ): A Simulation Study

JMIR Mental Health ◽

10.2196/mental.6545 ◽

2017 ◽

Vol 4 (1) ◽

pp. e7 ◽

Cited By ~ 1

Author(s):

Tessa Magnée ◽

Derek P de Beurs ◽

Berend Terluin ◽

Peter F Verhaak

Keyword(s):

Mental Health ◽

General Practice ◽

Simulation Study ◽

Computerized Adaptive Testing ◽

Mental Health Problems ◽

Adaptive Testing ◽

Health Problems ◽

Measurement Precision ◽

Stopping Rule ◽

Symptom Questionnaire

Background Efficient screening questionnaires are useful in general practice. Computerized adaptive testing (CAT) is a method to improve the efficiency of questionnaires, as only the items that are particularly informative for a certain responder are dynamically selected. Objective The objective of this study was to test whether CAT could improve the efficiency of the Four-Dimensional Symptom Questionnaire (4DSQ), a frequently used self-report questionnaire designed to assess common psychosocial problems in general practice. Methods A simulation study was conducted using a sample of Dutch patients visiting a general practitioner (GP) with psychological problems (n=379). Responders completed a paper-and-pencil version of the 50-item 4DSQ and a psychometric evaluation was performed to check if the data agreed with item response theory (IRT) assumptions. Next, a CAT simulation was performed for each of the four 4DSQ scales (distress, depression, anxiety, and somatization), based on the given responses as if they had been collected through CAT. The following two stopping rules were applied for the administration of items: (1) stop if measurement precision is below a predefined level, or (2) stop if more than half of the items of the subscale are administered. Results In general, the items of each of the four scales agreed with IRT assumptions. Application of the first stopping rule reduced the length of the questionnaire by 38% (from 50 to 31 items on average). When the second stopping rule was also applied, the total number of items could be reduced by 56% (from 50 to 22 items on average). Conclusions CAT seems useful for improving the efficiency of the 4DSQ by 56% without losing a considerable amount of measurement precision. The CAT version of the 4DSQ may be useful as part of an online assessment to investigate the severity of mental health problems of patients visiting a GP. This simulation study is the first step needed for the development a CAT version of the 4DSQ. A CAT version of the 4DSQ could be of high value for Dutch GPs since increasing numbers of patients with mental health problems are visiting the general practice. In further research, the results of a real-time CAT should be compared with the results of the administration of the full scale.

Download Full-text

Feasibility of computerized adaptive testing evaluated by Monte-Carlo and post-hoc simulations

Proceedings of the 2020 Federated Conference on Computer Science and Information Systems ◽

10.15439/2020f197 ◽

2020 ◽

Author(s):

Lubomír Štěpánek ◽

Patricia Martinková

Keyword(s):

Monte Carlo ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Post Hoc

Download Full-text

A Real Data Simulation Study of Computerized Adaptive Testing of Chinese Soldier Personality Questionnaire

2009 3rd International Conference on Bioinformatics and Biomedical Engineering ◽

10.1109/icbbe.2009.5162233 ◽

2009 ◽

Cited By ~ 1

Author(s):

Yebing Yang ◽

Danmin Miao ◽

Jianquan Tian ◽

Xufeng Liu ◽

Xia Zhu

Keyword(s):

Simulation Study ◽

Computerized Adaptive Testing ◽

Real Data ◽

Adaptive Testing ◽

Personality Questionnaire ◽

Data Simulation

Download Full-text

Item Response Models in Computerized Adaptive Testing: A Simulation Study

Computational Science and Its Applications – ICCSA 2014 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-09150-1_40 ◽

2014 ◽

pp. 552-565 ◽

Cited By ~ 1

Author(s):

Maria Eugénia Ferrão ◽

Paula Prata

Keyword(s):

Item Response ◽

Simulation Study ◽

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Response Models ◽

Item Response Models

Download Full-text