The Importance of Sample Weights and Plausible Values in Large-Scale Assessments

On Matrix Sampling and Imputation of Context Questionnaires With Implications for the Generation of Plausible Values in Large-Scale Assessments

Journal of Educational and Behavioral Statistics ◽

10.3102/1076998615622221 ◽

2016 ◽

Vol 41 (1) ◽

pp. 57-80 ◽

Cited By ~ 7

Author(s):

David Kaplan ◽

Dan Su

Keyword(s):

Large Scale ◽

Matrix Sampling ◽

Plausible Values ◽

Large Scale Assessments

Download Full-text

Eine Einführung in die Plausible-Values-Technik für die psychologische Forschung

Diagnostica ◽

10.1026/0012-1924/a000175 ◽

2017 ◽

Vol 63 (3) ◽

pp. 193-205 ◽

Cited By ~ 1

Author(s):

Oliver Lüdtke ◽

Alexander Robitzsch

Keyword(s):

Large Scale ◽

Plausible Values ◽

Statistische Verfahren ◽

Alternative Verfahren ◽

Large Scale Assessments

Zusammenfassung. In der psychologischen Forschung durchgeführte Messungen zur Erfassung von Konstrukten sind meistens mit einem Messfehler behaftet. Diese Messfehler führen zu verzerrten Schätzern von Populationsparametern und deren Standardfehlern. In den letzten Jahrzehnten hat sich im Bereich der Large-Scale-Assessments mit der Plausible-Values-Technik ein Verfahren zur Korrektur von messfehlerbehafteten Zusammenhängen zwischen latenten Variablen und beobachteten Kovariaten etabliert. Der vorliegende Beitrag führt anhand eines einfachen Beispiels aus der Klassischen Testtheorie in dieses komplexe statistische Verfahren ein. Es wird gezeigt, dass alternative Verfahren zur Schätzung von Personenwerten im Allgemeinen zu verzerrten Schätzungen von Zusammenhängen auf Populationsebene führen. In einer Simulationsstudie werden diese Befunde auf ein IRT-Modell für dichotome Indikatoren übertragen. Aus diagnostischer Sicht wird betont, dass Plausible Values nicht zur Schätzung von individuellen Fähigkeitsausprägungen verwendet werden sollen. Abschließend werden methodische Herausforderungen bei der Anwendung der Plausible-Values-Technik sowie das Potential für die psychologische Forschung diskutiert.

Download Full-text

Using plausible values in secondary analysis in large-scale assessments

Communication in Statistics- Theory and Methods ◽

10.1080/03610926.2016.1267764 ◽

2016 ◽

Vol 46 (22) ◽

pp. 11341-11357 ◽

Cited By ~ 11

Author(s):

Inga Laukaityte ◽

Marie Wiberg

Keyword(s):

Large Scale ◽

Secondary Analysis ◽

Plausible Values ◽

Large Scale Assessments

Download Full-text

Efficiency Analysis with Educational Data: How to Deal with Plausible Values from International Large-Scale Assessments

Mathematics ◽

10.3390/math9131579 ◽

2021 ◽

Vol 9 (13) ◽

pp. 1579

Author(s):

Juan Aparicio ◽

Jose M. Cordero ◽

Lidia Ortiz

Keyword(s):

Large Scale ◽

Data Envelopment ◽

Strongly Correlated ◽

Efficiency Measures ◽

Plausible Values ◽

Large Scale Assessments ◽

Using Data ◽

Pisa Data ◽

Fuzzy Dea ◽

Average Measure

International large-scale assessments (ILSAs) provide several measures as a representation of educational outcomes, the so-called plausible values, which are frequently interpreted as a representation of the ability range of students. In this paper, we focus on how this information should be incorporated into the estimation of efficiency measures of student or school performance using data envelopment analysis (DEA). Thus far, previous studies that have adopted this approach using data from ILSAs have used only one of the available plausible values or an average of all of them. We propose an approach based on the fuzzy DEA, which allows us to consider the whole distribution of results as a proxy of student abilities. To assess the extent to which our proposal offers similar results to those obtained in previous studies, we provide an empirical example using PISA data from 2015. Our results suggest that the performance measures estimated using the fuzzy DEA approach are strongly correlated with measures calculated using just one plausible value or an average measure. Therefore, we conclude that the studies that decide upon using one of these options do not seem to be making a significant error in their estimates.

Download Full-text

Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Measurement Instruments for the Social Sciences ◽

10.1186/s42409-020-00020-5 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Clemens M. Lechner ◽

Nivedita Bhaktha ◽

Katharina Groskurth ◽

Matthias Bluemke

Keyword(s):

Measurement Error ◽

Statistical Models ◽

Test Scores ◽

Large Scale ◽

Equation Modeling ◽

Model Parameters ◽

Advantages And Disadvantages ◽

Point Estimates ◽

Secondary Analyses ◽

Large Scale Assessments

AbstractMeasures of cognitive or socio-emotional skills from large-scale assessments surveys (LSAS) are often based on advanced statistical models and scoring techniques unfamiliar to applied researchers. Consequently, applied researchers working with data from LSAS may be uncertain about the assumptions and computational details of these statistical models and scoring techniques and about how to best incorporate the resulting skill measures in secondary analyses. The present paper is intended as a primer for applied researchers. After a brief introduction to the key properties of skill assessments, we give an overview over the three principal methods with which secondary analysts can incorporate skill measures from LSAS in their analyses: (1) as test scores (i.e., point estimates of individual ability), (2) through structural equation modeling (SEM), and (3) in the form of plausible values (PVs). We discuss the advantages and disadvantages of each method based on three criteria: fallibility (i.e., control for measurement error and unbiasedness), usability (i.e., ease of use in secondary analyses), and immutability (i.e., consistency of test scores, PVs, or measurement model parameters across different analyses and analysts). We show that although none of the methods are optimal under all criteria, methods that result in a single point estimate of each respondent’s ability (i.e., all types of “test scores”) are rarely optimal for research purposes. Instead, approaches that avoid or correct for measurement error—especially PV methodology—stand out as the method of choice. We conclude with practical recommendations for secondary analysts and data-producing organizations.

Download Full-text

The interplay of g and mathematical abilities in large-scale assessments across grades

Intelligence ◽

10.1016/j.intell.2017.05.001 ◽

2017 ◽

Vol 63 ◽

pp. 33-44 ◽

Cited By ~ 8

Author(s):

Steffani Saß ◽

Nele Kampa ◽

Olaf Köller

Keyword(s):

Large Scale ◽

Mathematical Abilities ◽

Large Scale Assessments

Download Full-text

Math proficiency prediction in computer-based international large-scale assessments using a multi-class machine learning model

10.1109/sisy52375.2021.9582522 ◽

2021 ◽

Author(s):

Aleksandar Pejic ◽

Piroska Stanic Molcer ◽

Kristian Gulaci

Keyword(s):

Machine Learning ◽

Large Scale ◽

Learning Model ◽

Machine Learning Model ◽

Computer Based ◽

Large Scale Assessments ◽

Math Proficiency

Download Full-text

The Pitfalls and Potentials of Classroom and Large-Scale Assessments

International Trends in Educational Assessment ◽

10.1163/9789004393455_005 ◽

2018 ◽

pp. 51-61

Keyword(s):

Large Scale ◽

Large Scale Assessments

Download Full-text

Large-scale assessments of students’ learning and education policy: synthesising evidence across world regions

Research Papers in Education ◽

10.1080/02671522.2016.1225353 ◽

2016 ◽

Vol 31 (5) ◽

pp. 578-594 ◽

Cited By ~ 4

Author(s):

Mollie Tobin ◽

Dita Nugroho ◽

Petra Lietz

Keyword(s):

Education Policy ◽

Large Scale ◽

World Regions ◽

Large Scale Assessments

Download Full-text

FROM PISA TO EDUCATIONAL STANDARDS: THE IMPACT OF LARGE-SCALE ASSESSMENTS ON SCIENCE EDUCATION IN GERMANY

International Journal of Science and Mathematics Education ◽

10.1007/s10763-010-9206-7 ◽

2010 ◽

Vol 8 (3) ◽

pp. 545-563 ◽

Cited By ~ 53

Author(s):

Knut Neumann ◽

Hans E. Fischer ◽

Alexander Kauertz

Keyword(s):

Science Education ◽

Large Scale ◽

Educational Standards ◽

Large Scale Assessments ◽

The Impact

Download Full-text