Recognizing Sample-Selection Bias in Historical Data

2020 ◽  
Vol 44 (3) ◽  
pp. 525-554 ◽  
Author(s):  
Ariell Zimran

ABSTRACTRecent research has ignited a debate in social science history over whether and how to draw conclusions for whole populations from sources that describe only select subsets of these populations. The idiosyncratic availability and survival of historical sources create a threat of sample-selection bias—an error that arises when there are systematic differences between the observed sample and the population of interest. This danger is common in studying trends in health as measured by average stature—scholars can often observe these trends only for soldiers and other similar groups; but whether these patterns are representative of those of the broader population is unclear. This article illustrates what simple patterns in a potentially selected sample can be used to recognize the presence of sample-selection bias in a source, and to understand how such bias might affect conclusions drawn from this source. Applying this intuition to the use of military data to describe stature in the antebellum United States, I present several simple empirical exercises based on these patterns. Finally, I use the results of these exercises to describe how sample-selection bias might affect the use of these data in testing for differences in average stature between the Northeast and the Midwest.

2020 ◽  
Vol 44 (3) ◽  
pp. 411-416 ◽  
Author(s):  
Kris Inwood ◽  
Hamish Maxwell-Stewart

AbstractHistorians and social scientists routinely, and inevitably, rely on sources that are unrepresentative of the past. The articles in this special issue of the journal illustrate the widespread prevalence of selection bias in historical sources, and the ways in which historians negotiate this challenge to reach useful conclusions from valuable, if imperfect sources.


1988 ◽  
Vol 12 (2) ◽  
pp. 197-215
Author(s):  
Aage R. Clausen ◽  
Jaushieh Joseph Wu

In a recent article in Social Science History, Professor Alan Bogue, former president of the Association and one of its founding fathers, has reviewed the first ten years of SSHA. In it he presents from the constitution the major purpose of the Association as “improving ‘the quality of historical explanation in every manner possible, but particularly by encouraging the selective use and adaptation in historical research and teaching of relevant social science’” (Bogue, 1987: 336). In this paper, we review the first ten years of the Social Science History journal in the context of an association formed to promote social science applications to the analysis of historical data. One indicator of the success of this enterprise is the extent to which historians are applying social science methods. Another indicator is the involvement of non-historians in social science history.


1985 ◽  
Vol 9 (2) ◽  
pp. 167-184
Author(s):  
Chad Gaffield ◽  
Peter Baskerville

The basis of most historical research including social science history is quite unsystematic. This characteristic results from the ways in which researchers find and choose historical sources for examination. Despite claims to be systematic, historians still tend to identify relevant evidence in impressionistic ways. Many social science histories involve the rigorous study of a source happily discovered by chance. Of course, access to the past has never been easy. Researchers have always lamented a presumed lack of “essential” records. Nonetheless, the actual ways we discover existing evidence have received little attention despite the fact that this process is fraught with difficulties and hidden dangers especially for researchers of a social scientific bent. Do not the presuppositions of social science history extend to the identification of sources? How do we know when we have all the “relevant data” for a particular project? Can systematic data analysis be justifiably built upon unsystematic identification of sources?


1991 ◽  
Vol 15 (3) ◽  
pp. 363-365
Author(s):  
Joy K. Lintelman

The articles on American domestic service that appear in this issue of Social Science History were part of a 1989 Social Science History Association Annual Meeting session. They reveal that, despite the investigations of Katzman (1978), Sutherland (1981), Dudden (1983), Glenn (1986), and others, there is still a great deal to know about domestic service in the United States. Each article offers a different perspective on transformations within domestic service in the United States in the nineteenth and twentieth centuries and provides new information about different demographic categories of domestic servants. Taken together, they suggest creative new ways of understanding the occupation and its relationship to race, ethnicity, gender, and the industrial labor market.


2013 ◽  
Vol 12 (2) ◽  
pp. 433-462 ◽  
Author(s):  
TOMER BROUDE ◽  
MICHAEL MOORE

AbstractThis unappealed Panel Report deals with now standard controversies involving US zeroing practices, but also involves a number of novel problems in administrative reviews of US anti-dumping orders that transcend zeroing issues. Most importantly, this dispute highlights the economic, legal, and statistical importance of sample-selection bias when calculating ‘all others’ rates for exporters that were not queried during dumping investigations. Sampling is particularly problematic in this dispute since US investigators found only zero and de minimis margins in the administrative reviews, a situation in which the relevant provision of the Anti-Dumping Agreement appears to provide no guidance (an apparent ‘lacuna’). The Panel did not directly deal with the key sample-selection issues in the case, and so we provide an alternative legal and statistical analysis. These issues are likely to become more important as the practice of zeroing is phased out in the United States. Indeed, sampling may well be the new zeroing.


2019 ◽  
Vol 79 (1) ◽  
pp. 99-138 ◽  
Author(s):  
Ariell Zimran

After adjusting for sample-selection bias, I find a net decline in average stature of 0.64 inches in the birth cohorts of 1832–1860 in the United States. This result supports the veracity of the Antebellum Puzzle—a deterioration of health during early modern economic growth in the United States. However, this adjustment alters the trend in average stature in the same cohort range, validating concerns over bias in the historical heights literature. The adjustment is based on census-linked military height data and uses a two-step semi-parametric sample-selection model to adjust for selection on observables and unobservables.


2015 ◽  
Vol 23 (1) ◽  
pp. 113-126 ◽  
Author(s):  
Leandro De Magalhaes

High rerunning rates among incumbents and among the two major parties allow studies of U.S. incumbency advantage to bypass the selection problem of who chooses to rerun. In countries where rerunning is not widespread among individuals or parties, estimation using methods developed for the United States may result in a sample selection bias. In countries with party switching, there may be a disconnect between party and individual estimates. This article proposes a definition of incumbency advantage that is valid for countries that present any of these characteristics and that is valid for cross-country comparison: the effect of incumbency for anindividualpolitician on theunconditionalprobability of winning. I illustrate the issues raised in this article with evidence from Brazilian mayoral elections.


2018 ◽  
pp. 43-51
Author(s):  
Osamu Saito

This personal reflection of more than 40 years' work on the supply of labour in a household context discusses the relationship between social science history (the application to historical phenomena of the tools developed by social scientists) and local population studies. The paper concludes that historians working on local source materials can give something new back to social scientists and social science historians, urging them to remake their tools.


Sign in / Sign up

Export Citation Format

Share Document