test constructor Latest Research Papers

Implementation of a test constructor utilizing a calibrated item bank using 3PL-IRT model

Procedia Computer Science ◽

10.1016/j.procs.2021.12.166 ◽

2022 ◽

Vol 197 ◽

pp. 495-502

Author(s):

Julieto E. Perez ◽

Wenieva Padrones

Keyword(s):

Item Bank ◽

Irt Model ◽

Test Constructor

Download Full-text

Estimating reliability statistics and measurement error variances using instrumental variables with longitudinal data

Longitudinal and Life Course Studies ◽

10.1332/175795920x15844303873216 ◽

2020 ◽

Vol 11 (3) ◽

pp. 289-306

Author(s):

Harvey Goldstein ◽

Michele Haynes ◽

George Leckie ◽

Phuong Tran

Keyword(s):

Measurement Error ◽

Longitudinal Data ◽

Measurement Errors ◽

Data Set ◽

Explanatory Variables ◽

Data Analyst ◽

Mathematics Test ◽

Scale Scores ◽

Distributed Measurement ◽

Test Constructor

The presence of randomly distributed measurement errors in scale scores such as those used in educational and behavioural assessments implies that careful adjustments are required to statistical model estimation procedures if inferences are required for ‘true’ as opposed to ‘observed’ relationships. In many cases this requires the use of external values for ‘reliability’ statistics or ‘measurement error variances’ which may be provided by a test constructor or else inferred or estimated by the data analyst. Popular measures are those described as ‘internal consistency’ estimates and sometimes other measures based on data grouping. All such measures, however, make particular assumptions that may be questionable but are often not examined. In this paper we focus on scaled scores derived from aggregating a set of indicators, and set out a general methodological framework for exploring different ways of estimating reliability statistics and measurement error variances, critiquing certain approaches and suggesting more satisfactory methods in the presence of longitudinal data. In particular, we explore the assumption of local (conditional) item response independence and show how a failure of this assumption can lead to biased estimates in statistical models using scaled scores as explanatory variables. We illustrate our methods using a large longitudinal data set of mathematics test scores from Queensland, Australia.

Download Full-text

DESIGNING OF COMPUTERIZED ADAPTIVE TESTS IN THE ABSENCE OF TESTING STATISTICS

Information Technologies and Learning Tools ◽

10.33407/itlt.v73i5.2520 ◽

2019 ◽

Vol 73 (5) ◽

pp. 101-115

Author(s):

Viktor E. Bondarenko

Keyword(s):

Item Response Theory ◽

Item Response ◽

Testing Time ◽

Knowledge Level ◽

Response Theory ◽

Computerized Adaptive Test ◽

Adaptive Test ◽

Record Statistics ◽

Test Constructor ◽

Decision Tables

A Computerized Adaptive Test proposes items according to the student's knowledge level. Therefore, the number of items, which are given to students, is reduced. Besides, the ending of such test is determined by the student's knowledge level, which allows an instructor to reduce testing time. As usual, construction of such tests is based on the Item Response Theory (IRT). This theory gives models which use statistical data about the student's knowledge level and difficulty of items. We do not have such statistics for new tests. In such cases, this paper proposes to estimate the complexity of items on the basis of the experts' conclusions. These conclusions are based on the analytic hierarchy process (AHP) which was modified. The modification allows experts to estimate the complexity of items with the help of the collection of the items characteristics. This modification can remove the expert's inadequate estimates of items or their characteristics. This method allows experts to classify all items in clusters according to their complexity in the first stage of the testing when statistics of items use is absent. A test constructor, on the basis of a decision tables network, realizes the algorithm of the items' selection from different clusters. In the future, tutors will have tested a sufficient number of students' groups. They record statistics of the test using. A test constructor receives such statistics, which will allow them to use the models of the Item Response Theory for estimation of the test items' complexity. The assessment of the knowledge level of students is made with the help of an adaptive test, which is based on a network of decision tables. This network determines the algorithm of using items from different clusters for the testing. The adaptive test is built on the basis of the network of decision tables as a computer system. This system is constructed on the Java platform with the help of the programming environment Android Studio. It has the interface suitable for students as well as for a constructor, which allows the constructor to change the algorithm of using items if received statistics of items use shows such necessity.

Download Full-text

Testing intercultural competence in (International) English: Some basic questions and suggested answers

Language Learning in Higher Education ◽

10.1515/cercles-2014-0012 ◽

2014 ◽

Vol 4 (1) ◽

Author(s):

Rudi Camerer

Keyword(s):

Intercultural Competence ◽

Point Of View ◽

Intercultural Communicative Competence ◽

Test Procedures ◽

Language Competence ◽

Underlying Assumption ◽

Course Materials ◽

Test Specifications ◽

Intercultural Encounters ◽

Test Constructor

AbstractThe testing of intercultural competence has long been regarded as the field of psychometric test procedures, which claim to analyse an individual's personality by specifying and quantifying personality traits with the help of self-answer questionnaires and the statistical evaluation of these. The underlying assumption is that what is analysed and described as a candidate's personality can be treated as an indicator of that same person's practical performance in intercultural encounters. From the point of view of a test constructor for language competence, all intercultural tests of this type raise basic questions concerning their construct and predictive validity.Against this background, this article firstly examines the shortcomings of personality-based tests of intercultural competence. Secondly, based on relevant parts of the CEFR as well as on the work of numerous contributors to the international debate, a practicable construct of intercultural communicative competence is suggested. Special attention is paid to the concept of politeness in intercultural encounters and the role of English as a lingua franca (ELF). Thirdly, a basic outline of a criterion-based test of intercultural competence in English is provided. The test procedures on which this article draws have been extensively piloted and are part of a training package including test specifications, course materials and teacher-training material.

Download Full-text

Is the test constructor a facet?

Language Testing ◽

10.1191/0265532203lt244oa ◽

2003 ◽

Vol 20 (1) ◽

pp. 57-87 ◽

Cited By ~ 1

Author(s):

Abdoljavad Jafarpur

Keyword(s):

Test Constructor

Download Full-text

Paivio's "Dual Coding Theory" en Effect van Visuele Stimuli op het Verwerken en Onthouden van Informatie

Toegepaste Taalwetenschap in Artikelen ◽

10.1075/ttwia.66.08hoe ◽

2001 ◽

Vol 66 ◽

pp. 91-99

Author(s):

Arie Hoeflaak

Keyword(s):

Foreign Language ◽

University Students ◽

Coding Theory ◽

Main Idea ◽

First Year ◽

Teaching Methodology ◽

Foreign Language Learners ◽

Spoken Text ◽

Open Questions ◽

Test Constructor

The use of video in foreign language teaching is considered to be a powerful tool by many teachers and researchers. It seems, however, that a sound 'video teaching methodology' has not yet been fully developed. This article sets out to present some reflections on the advantages of the use of video. We will then briefly describe some elements from two more or less theoretical studies, Lang (1995) and particularly Paivio (1986), and discuss the results of other experiments that we found in the literature. Finally, we will put forward some tentative ideas about experiments that we will prepare on the basis of the most important findings of other experiments. The main idea is that information is best processed if it is presented in a redundant way, e.g., both by an audio and a video channel. Many experiments claim that reversed subtitling (subtitles not in L1, but in L2 or FL) is the most successful visual support for foreign language learners. Our experimental design will be organized as follows. Subjects (pre-university students and first-year university students of French) will be divided into four experimental groups to be tested under four different conditions: 1) Image, sound (French spoken text), no subtitles; 2) Image, no sound, French subtitles; 3) No image, sound, subtitles; 4) Image, sound, subtitles. We hypothesize that condition 4 will yield the best result, but before conducting the experiment, we will have to examine three aspects: 1) The assessment format: subjects might consider open questions unclear, whereas, in closed questioning (true-false, multiple choice, cloze), items might be biased by the test constructor. 2) Clarifying the distinction between high, medium, and low redundancy. 3) Bi- or multimodal information input may lead to cognitive overload.

Download Full-text

Algorithms for Computerized Test Construction Using Classical Item Parameters

Journal of Educational Statistics ◽

10.3102/10769986014003279 ◽

1989 ◽

Vol 14 (3) ◽

pp. 279-290 ◽

Cited By ~ 9

Author(s):

Jos J. Adema ◽

Wim J. van der Linden

Keyword(s):

Linear Programming ◽

Classical Test Theory ◽

Banking System ◽

Test Construction ◽

Test Theory ◽

Programming Models ◽

Information Function ◽

Item Parameters ◽

Test Parameters ◽

Test Constructor

Recently, linear programming models for test construction were developed. These models were based on the information function from item response theory. In this paper another approach is followed. Two 0-1 linear programming models for the construction of tests using classical item and test parameters are given. These models are useful, for instance, when classical test theory has to serve as an interface between an IRT-based item banking system and a test constructor not familiar with the underlying theory.

Download Full-text

test constructor
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Implementation of a test constructor utilizing a calibrated item bank using 3PL-IRT model

Estimating reliability statistics and measurement error variances using instrumental variables with longitudinal data

DESIGNING OF COMPUTERIZED ADAPTIVE TESTS IN THE ABSENCE OF TESTING STATISTICS

Testing intercultural competence in (International) English: Some basic questions and suggested answers

Is the test constructor a facet?

Paivio's "Dual Coding Theory" en Effect van Visuele Stimuli op het Verwerken en Onthouden van Informatie

Algorithms for Computerized Test Construction Using Classical Item Parameters

Export Citation Format

test constructorRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Implementation of a test constructor utilizing a calibrated item bank using 3PL-IRT model

Estimating reliability statistics and measurement error variances using instrumental variables with longitudinal data

DESIGNING OF COMPUTERIZED ADAPTIVE TESTS IN THE ABSENCE OF TESTING STATISTICS

Testing intercultural competence in (International) English: Some basic questions and suggested answers

Is the test constructor a facet?

Paivio's "Dual Coding Theory" en Effect van Visuele Stimuli op het Verwerken en Onthouden van Informatie

Algorithms for Computerized Test Construction Using Classical Item Parameters

test constructor
Recently Published Documents