Desarrollo de un algoritmo en Python para la simulación y análisis de fiabilidad de los test multirrespuesta = Development of a Python algorithm to simulate and analyze the reliability of multiple choice tests to evaluate the student knowledge

María José García Tárrago

doi:10.20868/abe.2020.2.4461

Desarrollo de un algoritmo en Python para la simulación y análisis de fiabilidad de los test multirrespuesta = Development of a Python algorithm to simulate and analyze the reliability of multiple choice tests to evaluate the student knowledge

Advances in Building Education ◽

10.20868/abe.2020.2.4461 ◽

2020 ◽

Vol 4 (2) ◽

pp. 20

Author(s):

María José García Tárrago

Keyword(s):

Empirical Evaluation ◽

Multiple Choice ◽

Knowledge Level ◽

Student Knowledge ◽

Multiple Choice Tests ◽

Choice Tests ◽

High Knowledge ◽

Innovación Educativa ◽

Final Score ◽

Answer Choice

Existe gran número de publicaciones en relación con la fiabilidad de los test multi-respuesta para la evaluación del alumnado en la educación superior. Número de opciones por pregunta, sistemas de puntuación (marcado positivo o negativo), puntuación del conocimiento parcial o cantidad total de preguntas… La combinación de todos estos parámetros es una muestra de la variedad de configuraciones que pueden llegar a establecerse al diseñar un test. ¿Existe algún modelo o configuración óptima? Durante años, los investigadores en innovación educativa han intentado responder a esta cuestión haciendo uso del cálculo de probabilidades y distintas evaluaciones empíricas.En esta investigación se ha desarrollado un algoritmo basado en código Python con la finalidad de generar una serie de estudiantes hipotéticos con características y habilidades específicas (conocimiento real, nivel de cautela…). Un alto nivel de conocimientos implicaría una alta probabilidad de saber si una de las opciones de respuesta a una cuestión es cierta o no. Un exceso en el nivel de cautela de un alumno estaría relacionado con el nivel de probabilidad que lleva al alumno a arriesgarse a responder a una pregunta de la que no tiene por seguro su respuesta. Ello sería una medida de la capacidad de riesgo del alumno. El algoritmo lanza test a un número específico de alumnos hipotéticos analizando la desviación existente entre el conocimiento real (una característica intrínseca de cada alumno), y el conocimiento estimado por el test.Una vez desarrollado el algoritmo, se buscó validarlo con el uso de los distintos parámetros de entrada con la finalidad de observar la influencia que estos tenían en la puntuación final del test.AbstractThere are many literatures related with the reliability of true/false and multiple- choice tests and their application in higher education. Choices per question, positive or negative marking, rewards of partial knowledge or how long they should be… The combination of all these parameters shows the wide set of test setup that each examiner could design. Is there any optimized configuration? An extended educational research has tried to answer these questions using probability calculations and empirical evaluations.In this investigation, a novel algorithm was designed with Python code to generate hypothetical examinees with specific features (real knowledge, degree of over-cautiousness, fatigue limit…). High knowledge level implies high probability to know whether an answer choice was true or false in a multiple- choice question. Over-cautiousness was related with the probability to answer an unknown question or the risk capacity of the examinee. Finally, fatigue is directly related with the number of questions in the test. Going beyond its upper limit the knowledge level is reduced and the over-cautiousness is increased. The algorithm launched tests to the hypothetical examinees analysing the deviation between the real knowledge (a feature of the examinee), and the estimated knowledge.This algorithm was used to optimize the different parameters of a test (length of test, choices per question, scoring system…) to reduce the influence of fatigue and over-cautiousness on the final score. An empirical evaluation was performed comparing different test setups to verify and validate the algorithm.

Download Full-text

Problems and potentialities of e-Learning for regular undergraduate courses in emergency medicine

Revista Brasileira de Educação Médica ◽

10.1590/s0100-55022010000300016 ◽

2010 ◽

Vol 34 (3) ◽

pp. 452-458 ◽

Cited By ~ 2

Author(s):

William Rafaelo Schlinkert ◽

Sandro Scarpelini ◽

Antonio Pazin-Filho

Keyword(s):

Medical Students ◽

Multiple Choice ◽

First Aid ◽

Student Knowledge ◽

Multiple Choice Tests ◽

Reading Material ◽

Learning Techniques ◽

Choice Tests ◽

E Learning ◽

The Impact

BACKGROUND: E-learning techniques are spreading at great speed in medicine, raising concerns about the impact of adopting them. Websites especially designed to host courses are becoming more common. There is a lack of evidence that these systems could enhance student knowledge acquisition. GOAL: To evaluate the impact of using dedicated-website tools over cognition of medical students exposed to a first-aid course. METHODS: Prospective study of 184 medical students exposed to a twenty-hour first-aid course. We generated a dedicated-website with several sections (lectures, additional reading material, video and multiple choice exercises). We constructed variables expressing the student's access to each section. The evaluation was composed of fifty multiple-choice tests, based on clinical problems. We used multiple linear regression to adjust for potential confounders. RESULTS: There was no association of website intensity of exposure and the outcome - beta-coeficient 0.27 (95%CI - 0.454 - 1.004). These findings were not altered after adjustment for potential confounders - 0.165 (95%CI -0.628 - 0.960). CONCLUSION: A dedicated website with passive and active capabilities for aiding in person learning had not shown association with a better outcome.

Download Full-text

Large scale Rorschach techniques: A manual for the group Rorschach and multiple choice tests (2nd ed., 2nd printing).

10.1037/13988-000 ◽

1973 ◽

Cited By ~ 1

Author(s):

M. R. Harrower ◽

M. E. Steiner

Keyword(s):

Large Scale ◽

Multiple Choice ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Two Models for Multiple Choice Tests

PsycEXTRA Dataset ◽

10.1037/e473742008-127 ◽

1968 ◽

Author(s):

J. Brown Grier ◽

Raymond Ditrichs

Keyword(s):

Multiple Choice ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Validity and subgroup differences on three- and five-alternative multiple-choice tests

PsycEXTRA Dataset ◽

10.1037/e518422013-815 ◽

2009 ◽

Author(s):

Leonardis L. Bruce ◽

Bryan D. Edwards ◽

Winfred Arthur

Keyword(s):

Multiple Choice ◽

Multiple Choice Tests ◽

Subgroup Differences ◽

Choice Tests

Download Full-text

Using multiple-choice tests as learning events

PsycEXTRA Dataset ◽

10.1037/e520562012-782 ◽

2009 ◽

Author(s):

Jeri L. Little ◽

Elizabeth Ligon Bjork ◽

Ashley Kees

Keyword(s):

Multiple Choice ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Increasing Benefits in Learning With More Effortful Multiple Choice Tests

PsycEXTRA Dataset ◽

10.1037/e633262013-916 ◽

2013 ◽

Author(s):

Jessica M. Logan ◽

Alda G. Rivas

Keyword(s):

Multiple Choice ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Guessing in Multiple-Choice Tests as a Judgment and Choice Problem

PsycEXTRA Dataset ◽

10.1037/e683312011-118 ◽

1999 ◽

Cited By ~ 1

Author(s):

Yigal Attali

Keyword(s):

Multiple Choice ◽

Choice Problem ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Cheating Probabilities on Multiple Choice Tests

Journal of Chemical Education ◽

10.1021/ed074p1185 ◽

1997 ◽

Vol 74 (10) ◽

pp. 1185 ◽

Cited By ~ 1

Author(s):

Gaspard T. Rizzuto ◽

Fred Walters

Keyword(s):

Multiple Choice ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

Multiple-objective optimization applied in extracting multiple-choice tests

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2021.104439 ◽

2021 ◽

Vol 105 ◽

pp. 104439

Author(s):

Tram Nguyen ◽

Toan Bui ◽

Hamido Fujita ◽

Tzung-Pei Hong ◽

Ho Dac Loc ◽

...

Keyword(s):

Multiple Choice ◽

Multiple Objective ◽

Multiple Objective Optimization ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text

A GROUP APPROACH TO THE ANALYSIS OF INDIVIDUAL DIFFERENCES IN THE RANDOMNESS OF GUESSING BEHAVIOR ON MULTIPLE-CHOICE TESTS AND THE DEVELOPMENT OF SCORING METHODS TO TAKE SUCH DIFFERENCES INTO ACCOUNT

ETS Research Bulletin Series ◽

10.1002/j.2333-8504.1964.tb00697.x ◽

1964 ◽

Vol 1964 (2) ◽

pp. i-113 ◽

Cited By ~ 2

Author(s):

Dorothy Bird Price

Keyword(s):

Individual Differences ◽

Multiple Choice ◽

Scoring Methods ◽

Group Approach ◽

Multiple Choice Tests ◽

Choice Tests

Download Full-text