Assessment Design for International Large-Scale Assessments

Using process data to understand problem-solving strategies and processes for drag-and-drop items in a large-scale mathematics assessment

Large-scale Assessments in Education ◽

10.1186/s40536-021-00095-4 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Yang Jiang ◽

Tao Gong ◽

Luis E. Saldivia ◽

Gabrielle Cayton-Hodges ◽

Christopher Agard

Keyword(s):

Problem Solving ◽

Time Allocation ◽

Large Scale ◽

Solution Process ◽

Process Data ◽

Mathematics Assessments ◽

Assessment Design ◽

Eighth Grade Students ◽

Problem Solving Strategies ◽

Drag And Drop

AbstractIn 2017, the mathematics assessments that are part of the National Assessment of Educational Progress (NAEP) program underwent a transformation shifting the administration from paper-and-pencil formats to digitally-based assessments (DBA). This shift introduced new interactive item types that bring rich process data and tremendous opportunities to study the cognitive and behavioral processes that underlie test-takers’ performances in ways that are not otherwise possible with the response data alone. In this exploratory study, we investigated the problem-solving processes and strategies applied by the nation’s fourth and eighth graders by analyzing the process data collected during their interactions with two technology-enhanced drag-and-drop items (one item for each grade) included in the first digital operational administration of the NAEP’s mathematics assessments. Results from this research revealed how test-takers who achieved different levels of accuracy on the items engaged in various cognitive and metacognitive processes (e.g., in terms of their time allocation, answer change behaviors, and problem-solving strategies), providing insights into the common mathematical misconceptions that fourth- and eighth-grade students held and the steps where they may have struggled during their solution process. Implications of the findings for educational assessment design and limitations of this research are also discussed.

Download Full-text

Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Measurement Instruments for the Social Sciences ◽

10.1186/s42409-020-00020-5 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Clemens M. Lechner ◽

Nivedita Bhaktha ◽

Katharina Groskurth ◽

Matthias Bluemke

Keyword(s):

Measurement Error ◽

Statistical Models ◽

Test Scores ◽

Large Scale ◽

Equation Modeling ◽

Model Parameters ◽

Advantages And Disadvantages ◽

Point Estimates ◽

Secondary Analyses ◽

Large Scale Assessments

AbstractMeasures of cognitive or socio-emotional skills from large-scale assessments surveys (LSAS) are often based on advanced statistical models and scoring techniques unfamiliar to applied researchers. Consequently, applied researchers working with data from LSAS may be uncertain about the assumptions and computational details of these statistical models and scoring techniques and about how to best incorporate the resulting skill measures in secondary analyses. The present paper is intended as a primer for applied researchers. After a brief introduction to the key properties of skill assessments, we give an overview over the three principal methods with which secondary analysts can incorporate skill measures from LSAS in their analyses: (1) as test scores (i.e., point estimates of individual ability), (2) through structural equation modeling (SEM), and (3) in the form of plausible values (PVs). We discuss the advantages and disadvantages of each method based on three criteria: fallibility (i.e., control for measurement error and unbiasedness), usability (i.e., ease of use in secondary analyses), and immutability (i.e., consistency of test scores, PVs, or measurement model parameters across different analyses and analysts). We show that although none of the methods are optimal under all criteria, methods that result in a single point estimate of each respondent’s ability (i.e., all types of “test scores”) are rarely optimal for research purposes. Instead, approaches that avoid or correct for measurement error—especially PV methodology—stand out as the method of choice. We conclude with practical recommendations for secondary analysts and data-producing organizations.

Download Full-text

The interplay of g and mathematical abilities in large-scale assessments across grades

Intelligence ◽

10.1016/j.intell.2017.05.001 ◽

2017 ◽

Vol 63 ◽

pp. 33-44 ◽

Cited By ~ 8

Author(s):

Steffani Saß ◽

Nele Kampa ◽

Olaf Köller

Keyword(s):

Large Scale ◽

Mathematical Abilities ◽

Large Scale Assessments

Download Full-text

Math proficiency prediction in computer-based international large-scale assessments using a multi-class machine learning model

10.1109/sisy52375.2021.9582522 ◽

2021 ◽

Author(s):

Aleksandar Pejic ◽

Piroska Stanic Molcer ◽

Kristian Gulaci

Keyword(s):

Machine Learning ◽

Large Scale ◽

Learning Model ◽

Machine Learning Model ◽

Computer Based ◽

Large Scale Assessments ◽

Math Proficiency

Download Full-text

The Pitfalls and Potentials of Classroom and Large-Scale Assessments

International Trends in Educational Assessment ◽

10.1163/9789004393455_005 ◽

2018 ◽

pp. 51-61

Keyword(s):

Large Scale ◽

Large Scale Assessments

Download Full-text

Large-scale assessments of students’ learning and education policy: synthesising evidence across world regions

Research Papers in Education ◽

10.1080/02671522.2016.1225353 ◽

2016 ◽

Vol 31 (5) ◽

pp. 578-594 ◽

Cited By ~ 4

Author(s):

Mollie Tobin ◽

Dita Nugroho ◽

Petra Lietz

Keyword(s):

Education Policy ◽

Large Scale ◽

World Regions ◽

Large Scale Assessments

Download Full-text

FROM PISA TO EDUCATIONAL STANDARDS: THE IMPACT OF LARGE-SCALE ASSESSMENTS ON SCIENCE EDUCATION IN GERMANY

International Journal of Science and Mathematics Education ◽

10.1007/s10763-010-9206-7 ◽

2010 ◽

Vol 8 (3) ◽

pp. 545-563 ◽

Cited By ~ 53

Author(s):

Knut Neumann ◽

Hans E. Fischer ◽

Alexander Kauertz

Keyword(s):

Science Education ◽

Large Scale ◽

Educational Standards ◽

Large Scale Assessments ◽

The Impact

Download Full-text

Reading skills of students in different school tracks: Systematic (dis)advantages based on item formats in large scale assessments

Zeitschrift für Erziehungswissenschaft ◽

10.1007/s11618-015-0645-3 ◽

2015 ◽

Vol 18 (4) ◽

pp. 781-801

Author(s):

Franziska Schwabe ◽

Nele McElvany ◽

Matthias Trendtel

Keyword(s):

Large Scale ◽

Reading Skills ◽

Large Scale Assessments

Download Full-text

Policies and Practices of Assessment: A Showcase for the Use (and Misuse) of International Large Scale Assessments in Educational Effectiveness Research

International Perspectives in Educational Effectiveness Research ◽

10.1007/978-3-030-44810-3_7 ◽

2020 ◽

pp. 147-181

Author(s):

Eckhard Klieme

Keyword(s):

Large Scale ◽

Effectiveness Research ◽

Educational Effectiveness ◽

Large Scale Assessments ◽

Educational Effectiveness Research ◽

Policies And Practices

Download Full-text

Analytics in International Large-Scale Assessments: Item Response Theory and Population Models

Handbook of International Large-Scale Assessment ◽

10.1201/b16061-12 ◽

2013 ◽

pp. 169-188

Keyword(s):

Item Response Theory ◽

Item Response ◽

Large Scale ◽

Population Models ◽

Response Theory ◽

Large Scale Assessments

Download Full-text