Uniform Test Assembly: Concepts, Problems, Solvers, and Applications for Adaptive Testing

Summary: Item parameters for several hundreds of items were estimated based on empirical data from several thousands of subjects. The logistic one-parameter (1PL) and two-parameter (2PL) model estimates were evaluated. However, model fit showed that only a subset of items complied sufficiently, so that the remaining ones were assembled in well-fitting item banks. In several simulation studies 5000 simulated responses were generated in accordance with a computerized adaptive test procedure along with person parameters. A general reliability of .80 or a standard error of measurement of .44 was used as a stopping rule to end CAT testing. We also recorded how often each item was used by all simulees. Person-parameter estimates based on CAT correlated higher than .90 with true values simulated. For all 1PL fitting item banks most simulees used more than 20 items but less than 30 items to reach the pre-set level of measurement error. However, testing based on item banks that complied to the 2PL revealed that, on average, only 10 items were sufficient to end testing at the same measurement error level. Both clearly demonstrate the precision and economy of computerized adaptive testing. Empirical evaluations from everyday uses will show whether these trends will hold up in practice. If so, CAT will become possible and reasonable with some 150 well-calibrated 2PL items.

Download Full-text

Methods for Restricting Maximum Exposure Rate in Computerized Adaptative Testing

Methodology ◽

10.1027/1614-2241.3.1.14 ◽

2007 ◽

Vol 3 (1) ◽

pp. 14-23 ◽

Cited By ~ 9

Author(s):

Juan Ramon Barrada ◽

Julio Olea ◽

Vicente Ponsoda

Keyword(s):

Measurement Accuracy ◽

Computerized Adaptive Testing ◽

Computation Time ◽

Adaptive Testing ◽

Exposure Rate ◽

Control Parameters ◽

The Impact ◽

Two Alternatives ◽

Selection Of ◽

Maximum Exposure

Abstract. The Sympson-Hetter (1985) method provides a means of controlling maximum exposure rate of items in Computerized Adaptive Testing. Through a series of simulations, control parameters are set that mark the probability of administration of an item on being selected. This method presents two main problems: it requires a long computation time for calculating the parameters and the maximum exposure rate is slightly above the fixed limit. Van der Linden (2003) presented two alternatives which appear to solve both of the problems. The impact of these methods in the measurement accuracy has not been tested yet. We show how these methods over-restrict the exposure of some highly discriminating items and, thus, the accuracy is decreased. It also shown that, when the desired maximum exposure rate is near the minimum possible value, these methods offer an empirical maximum exposure rate clearly above the goal. A new method, based on the initial estimation of the probability of administration and the probability of selection of the items with the restricted method ( Revuelta & Ponsoda, 1998 ), is presented in this paper. It can be used with the Sympson-Hetter method and with the two van der Linden's methods. This option, when used with Sympson-Hetter, speeds the convergence of the control parameters without decreasing the accuracy.

Download Full-text

Optimal Test Assembly in Practice

Zeitschrift für Psychologie ◽

10.1027/2151-2604/a000146 ◽

2013 ◽

Vol 221 (3) ◽

pp. 190-200 ◽

Cited By ~ 9

Author(s):

Jörg-Tobias Kuhn ◽

Thomas Kiefer

Keyword(s):

Particle Physics ◽

Large Scale ◽

Block Design ◽

Linear Optimization ◽

Mixed Integer ◽

Optimal Test ◽

Automated Test Assembly ◽

Test Assembly ◽

Block Level ◽

Mixed Integer Linear Optimization

Several techniques have been developed in recent years to generate optimal large-scale assessments (LSAs) of student achievement. These techniques often represent a blend of procedures from such diverse fields as experimental design, combinatorial optimization, particle physics, or neural networks. However, despite the theoretical advances in the field, there still exists a surprising scarcity of well-documented test designs in which all factors that have guided design decisions are explicitly and clearly communicated. This paper therefore has two goals. First, a brief summary of relevant key terms, as well as experimental designs and automated test assembly routines in LSA, is given. Second, conceptual and methodological steps in designing the assessment of the Austrian educational standards in mathematics are described in detail. The test design was generated using a two-step procedure, starting at the item block level and continuing at the item level. Initially, a partially balanced incomplete item block design was generated using simulated annealing, whereas in a second step, items were assigned to the item blocks using mixed-integer linear optimization in combination with a shadow-test approach.

Download Full-text

Computerized adaptive testing: From inquiry to operation.

10.1037/10244-000 ◽

1997 ◽

Cited By ~ 48

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing

Download Full-text

Computer Adaptive Testing in Personality Assessment

PsycEXTRA Dataset ◽

10.1037/e494382008-001 ◽

2008 ◽

Author(s):

Paul Williams

Keyword(s):

Personality Assessment ◽

Adaptive Testing ◽

Computer Adaptive Testing

Download Full-text

MARKOV MATHEMATICAL MODEL OF DYNAMIC ADAPTIVE TESTING OF AN ACTIVE AGENT

Informatics and Education ◽

10.32517/0234-0453-2018-33-10-29-35 ◽

2018 ◽

pp. 29-35

Author(s):

N. V. Brovka ◽

P. P. Dyachuk ◽

M. V. Noskov ◽

I. P. Peregudova

Keyword(s):

Mathematical Model ◽

Finite State Machine ◽

Active Agent ◽

Adaptive Testing ◽

State Machine ◽

Learning Activities ◽

Independent Learning ◽

Complex Object ◽

Total Reward ◽

Finite State

The problem and the goal.The urgency of the problem of mathematical description of dynamic adaptive testing is due to the need to diagnose the cognitive abilities of students for independent learning activities. The goal of the article is to develop a Markov mathematical model of the interaction of an active agent (AA) with the Liquidator state machine, canceling incorrect actions, which will allow mathematically describe dynamic adaptive testing with an estimated feedback.The research methodologyconsists of an analysis of the results of research by domestic and foreign scientists on dynamic adaptive testing in education, namely: an activity approach that implements AA developmental problem-solving training; organizational and technological approach to managing the actions of AA in terms of evaluative feedback; Markow’s theory of cement and reinforcement learning.Results.On the basis of the theory of Markov processes, a Markov mathematical model of the interaction of an active agent with a finite state machine, canceling incorrect actions, was developed. This allows you to develop a model for diagnosing the procedural characteristics of students ‘learning activities, including: building axiograms of total reward for students’ actions; probability distribution of states of the solution of the problem of identifying elements of the structure of a complex object calculate the number of AA actions required to achieve the target state depending on the number of elements that need to be identified; construct a scatter plot of active agents by target states in space (R, k), where R is the total reward AA, k is the number of actions performed.Conclusion.Markov’s mathematical model of the interaction of an active agent with a finite state machine, canceling wrong actions allows you to design dynamic adaptive tests and diagnostics of changes in the procedural characteristics of educational activities. The results and conclusions allow to formulate the principles of dynamic adaptive testing based on the estimated feedback.

Download Full-text

A Comparison of Item Selection Methods for Controlling Exposure Rate in Cognitive Diagnostic Computerized Adaptive Testing

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2013.00694 ◽

2013 ◽

Vol 45 (6) ◽

pp. 694-703

Author(s):

Xiuzhen MAO ◽

Tao XIN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Exposure Rate ◽

Selection Methods

Download Full-text

Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2012.00400 ◽

2013 ◽

Vol 44 (3) ◽

pp. 400-412 ◽

Cited By ~ 1

Author(s):

Fen LUO ◽

Shu-Liang DING ◽

Xiao-Qing WANG

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Item Selection ◽

Response Model ◽

Graded Response Model ◽

Selection Strategies ◽

Graded Response

Download Full-text

Application of Online Calibration Technique in Computerized Adaptive Testing

Advances in Psychological Science ◽

10.3724/sp.j.1042.2013.01883 ◽

2013 ◽

Vol 21 (10) ◽

pp. 1883-1892

Author(s):

Ping CHEN ◽

Jiahui ZHANG ◽

Tao XIN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Calibration Technique ◽

Online Calibration

Download Full-text

a-Stratified Methods Combining Item Exposure Control and General Test Overlap in Computerized Adaptive Testing

Acta Psychologica Sinica ◽

10.3724/sp.j.1041.2014.00702 ◽

2014 ◽

Vol 46 (5) ◽

pp. 702

Author(s):

Lei GUO ◽

Zhuoran WANG ◽

Feng WANG ◽

Yufang BIAN

Keyword(s):

Computerized Adaptive Testing ◽

Adaptive Testing ◽

Exposure Control ◽

Item Exposure ◽

General Test ◽

Item Exposure Control ◽

Test Overlap

Download Full-text