scholarly journals Game-Theoretic Question Selection for Tests

2017 ◽  
Vol 59 ◽  
pp. 437-462
Author(s):  
Yuqian Li ◽  
Vincent Conitzer

Conventionally, the questions on a test are assumed to be kept secret from test takers until the test. However, for tests that are taken on a large scale, particularly asynchronously, this is very hard to achieve. For example, TOEFL iBT and driver's license test questions are easily found online. This also appears likely to become an issue for Massive Open Online Courses (MOOCs, as offered for example by Coursera, Udacity, and edX). Specifically, the test result may not reflect the true ability of a test taker if questions are leaked beforehand. In this paper, we take the loss of confidentiality as a fact. Even so, not all hope is lost as the test taker can memorize only a limited set of questions' answers, and the tester can randomize which questions to let appear on the test. We model this as a Stackelberg game, where the tester commits to a mixed strategy and the follower responds. Informally, the goal of the tester is to best reveal the true ability of a test taker, while the test taker tries to maximize the test result (pass probability or score). We provide an exponential-size linear program formulation that computes the optimal test strategy, prove several NP-hardness results on computing optimal test strategies in general, and give efficient algorithms for special cases (scored tests and single-question tests). Experiments are also provided for those proposed algorithms to show their scalability and the increase of the tester's utility relative to that of the uniform-at-random strategy. The increase is quite significant when questions have some correlation---for example, when a test taker who can solve a harder question can always solve easier questions.

Forests ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 809
Author(s):  
Denys Yemshanov ◽  
Robert G. Haight ◽  
Ning Liu ◽  
Rob Rempel ◽  
Frank H. Koch ◽  
...  

When adopted, wildlife protection policies in Canadian forests typically cover large areas and affect multiple economic agents working in these landscapes. Such measures are likely to increase the costs of timber for forestry companies operating in the area, which may hinder their acceptance of the policies unless harvesting remains profitable. We propose a bi-level wildlife protection problem that accounts for the profit-maximizing behavior of forestry companies operating in an area subject to protection. We consider the regulator with a wildlife protection mandate and forestry companies licensed to harvest public forest lands. We depict the relationship between the regulator and forestry companies as a leader-follower Stackelberg game. The leader sets the protected area target for each license area and the followers adjust their strategies to maximize payoffs while meeting the protection target set by the leader. The leader’s objective is to maximize the area-wide protection of spatially contiguous habitat while accounting for the followers’ profit-maximizing behavior. We apply the approach to investigate habitat protection policies for woodland caribou in the Churchill range, Ontario, Canada. We compare the game-theoretic solutions with solutions that do not consider the forest companies’ objectives and also with solutions equalizing the revenue losses among the companies.


2013 ◽  
Vol 221 (3) ◽  
pp. 190-200 ◽  
Author(s):  
Jörg-Tobias Kuhn ◽  
Thomas Kiefer

Several techniques have been developed in recent years to generate optimal large-scale assessments (LSAs) of student achievement. These techniques often represent a blend of procedures from such diverse fields as experimental design, combinatorial optimization, particle physics, or neural networks. However, despite the theoretical advances in the field, there still exists a surprising scarcity of well-documented test designs in which all factors that have guided design decisions are explicitly and clearly communicated. This paper therefore has two goals. First, a brief summary of relevant key terms, as well as experimental designs and automated test assembly routines in LSA, is given. Second, conceptual and methodological steps in designing the assessment of the Austrian educational standards in mathematics are described in detail. The test design was generated using a two-step procedure, starting at the item block level and continuing at the item level. Initially, a partially balanced incomplete item block design was generated using simulated annealing, whereas in a second step, items were assigned to the item blocks using mixed-integer linear optimization in combination with a shadow-test approach.


2018 ◽  
Vol 16 (1) ◽  
pp. 67-76
Author(s):  
Disyacitta Neolia Firdana ◽  
Trimurtini Trimurtini

This research aimed to determine the properness and effectiveness of the big book media on learning equivalent fractions of fourth grade students. The method of research is Research and Development  (R&D). This study was conducted in fourth grade of SDN Karanganyar 02 Kota Semarang. Data sources from media validation, material validation, learning outcomes, and teacher and students responses on developed media. Pre-experimental research design with one group pretest-posttest design. Big book developed consist of equivalent fractions material, students learning activities sheets with rectangle and circle shape pictures, and questions about equivalent fractions. Big book was developed based on students and teacher needs. This big book fulfill the media validity of 3,75 with very good criteria and scored 3 by material experts with good criteria. In large-scale trial, the result of students posttest have learning outcomes completness 82,14%. The result of N-gain calculation with result 0,55 indicates the criterion “medium”. The t-test result 9,6320 > 2,0484 which means the average of posttest outcomes is better than the average of pretest outcomes. Based on that data, this study has produced big book media which proper and effective as a media of learning equivalent fractions of fourth grade elementary school.


2012 ◽  
Vol 424-425 ◽  
pp. 132-136
Author(s):  
Guo Jin Chen ◽  
Zhang Ming Peng ◽  
Jian Guo Yang ◽  
Qiao Ying Huang

On the diesel engine’s test bed, this paper has studied the parameters regarding the diesel engine’s rotational speed, the piston ring’s width and wearing capacity and so on, and their relation with the output signal of the magnetoresistive sensor under the reverse drawing of the diesel engine. The research discovered that the piston ring’s wear and the magnetoresistive sensor’s output have the corresponding relationship. And on the oil tanker with the 6RTA52U diesel engine, the influence of the diesel engine’s operating parameters and the load situations to the magnetoresistive sensor’s output is surveyed under four kinds of different operating modes. The test result and the research conclusion provide the technical foundation for the online Wear monitoring of the large-scale marine diesel engine’s piston ring.


2013 ◽  
Vol 2013 ◽  
pp. 1-10 ◽  
Author(s):  
Yanbing Liu ◽  
Tao Wu ◽  
Jun Huang ◽  
Shousheng Jia

Wireless mesh networks (WMNs) are a promising networking paradigm for next generation wireless networking system. Power control plays a vital role in WMNs and is realized to be a crucial step toward large-scale WMNs deployment. In this paper, we address the problem of how to allocate the power for both optimizing quality of service (QoS) and saving the power consumption in WMNs based on the game theory. We first formulate the problem as a noncooperative game, in which the QoS attributes and the power of each node are defined as a utility function, and all the nodes attempt to maximize their own utility. In such game, we correlate all the interfering nodes to be an interfering object and the receiving node to be the interfering object's virtual destination node. We then present an equilibrium solution for the noncooperative game using Stackelberg model, and we propose an iterative, distributed power control algorithm for WMNs. Also, we conduct numeric experiments to evaluate the system performance, our results show that the proposed algorithm can balance nodes to share the limited network resources and maximize total utility, and thus it is efficient and effective for solving the power control problem in WMNs.


Author(s):  
Alessandro Achille ◽  
Giovanni Paolini ◽  
Glen Mbeng ◽  
Stefano Soatto

Abstract We introduce an asymmetric distance in the space of learning tasks and a framework to compute their complexity. These concepts are foundational for the practice of transfer learning, whereby a parametric model is pre-trained for a task, and then fine tuned for another. The framework we develop is non-asymptotic, captures the finite nature of the training dataset and allows distinguishing learning from memorization. It encompasses, as special cases, classical notions from Kolmogorov complexity and Shannon and Fisher information. However, unlike some of those frameworks, it can be applied to large-scale models and real-world datasets. Our framework is the first to measure complexity in a way that accounts for the effect of the optimization scheme, which is critical in deep learning.


2020 ◽  
Vol 55 (5-6) ◽  
pp. 273-281
Author(s):  
S. Surkov

In this article, a theoretical analysis of the flows arising in the cross sections of fluid and gas flows is performed. Such flows are subdivided into secondary flows and coherent structures. From experimental studies it is known that both types of flows are long-lived large-scale movements (LSM) stretched along the flow. The relative stability of the vortices is traditionally explained by the fact that the viscous friction forces that inhibit the rotation are compensated by the intensification of the swirl when moving slowly rotating peripheral layers to the center of the vortex due to longitudinal tension. An analysis of this mechanism made it possible to develop a relatively simple model of vortex structures in which the viscous friction forces and axial expansion are considered to be infinitesimal. Under these assumptions, one can use the equations of motion of an ideal fluid in the variables “stream function - vorticity”. It is shown that under certain assumptions these equations take the form of a wave equation, and the boundary conditions are the condition that the stream function on the solid walls of the flow equals zero. The obtained solutions of the wave equation describe the following special cases: Goertler’s vortices between rotating cylinders, secondary flows in a pipe with a square cross section, swirling flow in a round pipe, paired vortex after bend of the pipe. The physical sense of more complex solutions of the wave equation has become clear relatively recently. Very similar structures were found in experimental studies using orthogonal decomposition (POD) of a turbulent pulsations field. This may mean that the eigenfunctions in the POD correspond to coherent structures that really arise in the flow. The results obtained confirm the hypothesis that secondary flows and coherent structures have a common nature. The solutions obtained in this paper can be used in processing the experiment as eigenfunctions for the orthogonal decomposition method. In addition, they can be used in direct numerical simulation (DNS) of turbulent flows


2020 ◽  
Author(s):  
Pia Jokela ◽  
Anu E Jääskeläinen ◽  
Hanna Jarva ◽  
Tanja Holma ◽  
Maarit J Ahava ◽  
...  

AbstractRapid sample-to-answer tests for detection of SARS-CoV-2 are emerging and data on their relative performance is urgently needed. We evaluated the analytical performance of two rapid nucleic acid tests, Cepheid Xpert® Xpress SARS-CoV-2 and Mobidiag Novodiag® Covid-19, in comparison to a combination reference of three large-scale PCR tests. Moreover, utility of the Novodiag® test in tertiary care emergency departments was assessed. In the preliminary evaluation, analysis of 90 respiratory samples resulted in 100% specificity and sensitivity for Xpert®, whereas analysis of 107 samples resulted in 93.4% sensitivity and 100% specificity for Novodiag®. Rapid SARS-CoV-2 testing with Novodiag® was made available for four tertiary care emergency departments in Helsinki, Finland between 18 and 31 May, coinciding with a rapidly declining epidemic phase. Altogether 361 respiratory specimens, together with relevant clinical data, were analyzed with Novodiag® and reference tests: 355/361 of the specimens were negative with both methods, and 1/361 was positive in Novodiag® and negative by the reference method. Of the 5 remaining specimens, two were negative with Novodiag®, but positive with the reference method with late Ct values. On average, a test result using Novodiag® was available nearly 8 hours earlier than that obtained with the large-scale PCR tests. While the performance of novel sample-to-answer PCR tests need to be carefully evaluated, they may provide timely and reliable results in detection of SARS-CoV-2 and thus facilitate patient management including effective cohorting.


Sign in / Sign up

Export Citation Format

Share Document