scholarly journals How Uncertainty about Privacy and Confidentiality is Hampering Efforts to More Effectively Use Administrative Records in Producing U.S. National Statistics

Author(s):  
Gerald W. Gates

U.S. federal statistical agencies continually face challenges in obtaining and using administrative records and in providing useful analytic products to support policy analysis and program planning. At each of three decision points—obtaining the administrative data, integrating the data into statistical programs, and releasing useful data products—concerns over privacy and confidentiality determine to a great extent how effectively these data are used. Although there is a long history of relevant research on privacy attitudes and methodologies to protect conconfidentiality in published data, agency decisions to share or publish data are not necessarily informed by known risks. Additional research is proposed to help identify and manage these risks. The paper also proposes government actions to ensure that U.S. federal statistical agencies are meeting the nation's data needs through the appropriate application of survey and administrative data.

Author(s):  
Margo J. Anderson ◽  
William Seltzer

The roots of the modern concept of statistical confidentiality in the US federal statistical system can be traced directly back to the late nineteenth century efforts of statisticians to ensure full and accurate responses by businesses to statistical inquiries. Officials argued that such confidentiality guarantees were needed to ensure that the providers of enterprise and establishment data could be confident that the statistical agencies could not be forced to share their responses with others, such as regulatory or tax authorities, congressional investigators, prying journalists, and competitors, who might use this information to the detriment of the data provider. Nevertheless, over the years, the principle of statistical confidentiality with respect to information provided by businesses in statistical inquiries has been repeatedly challenged by other executive branch departments, independent regulatory agencies, the courts, Congress, and members of the public, with quite varied results. The paper uses the published record and archival research to examine the history of challenges to statistical confidentiality, and the responses of the statistical agencies, the federal statistical system as a whole, including the office of the chief statistician in OMB (and its predecessors), executive department and independent non-statistical agencies, the courts, and Congress as well as representatives of the business community. Long-term trends and the implications for maintaining and strengthening the confidentiality protections for establishment- and enterprise-level business data provided to federal agencies for statistical purposes are discussed.


2008 ◽  
Vol 42 ◽  
pp. 36-54
Author(s):  
M. S. Kulikovskiy

Main trends of diatom studies in Sphagnum bogs of Russia and adjacent countries are shown. On the basis of published data, making use of modern taxonomical changes, the taxonomical list of diatoms from the considered Sphagnum bogs is presented.


Author(s):  
L.V. Vetchinnikova ◽  
◽  
A.F. Titov ◽  
◽  

The article reports on the application of the best known principles for mapping natural populations of curly (Karelian) birch Betula pendula Roth var. carelica (Mercklin) Hämet-Ahti – one of the most appealing representatives of the forest tree flora. Relying on the synthesis and analysis of the published data amassed over nearly 100 years and the data from own full-scale studies done in the past few decades almost throughout the area where curly birch has grown naturally, it is concluded that its range outlined in the middle of the 20th century and since then hardly revised is outdated. The key factors and reasons necessitating its revision are specified. Herewith it is suggested that the range is delineated using the population approach, and the key element will be the critical population size below which the population is no longer viable in the long term. This approach implies that the boundaries of the taxon range depend on the boundaries of local populations (rather than the locations of individual trees or small clumps of trees), the size of which should not be lower than the critical value, which is supposed to be around 100–500 trees for curly birch. A schematic map of the curly birch range delineated using this approach is provided. We specially address the problem of determining the minimum population size to secure genetic diversity maintenance. The advantages of the population approach to delineating the distribution range of curly birch with regard to its biological features are highlighted. The authors argue that it enables a more accurate delineation of the range; shows the natural evolutionary history of the taxon (although it is not yet officially recognized as a species) and its range; can be relatively easily updated (e.g. depending on the scope of reintroduction); should be taken into account when working on the strategy of conservation and other actions designed to maintain and regenerate this unique representative of the forest tree flora.


1998 ◽  
Vol 11 (1) ◽  
pp. 396-396
Author(s):  
I. Pustylnik

We study the short-time evolutionary history of the well-known contact binary VW Cep. Our analysis is based partly on the numerous UBV lightcurves obtained at Tartu Observatory, IUE spectra, and samples from the published data. Special attention is given to the effects of asymmetry of the light curves. A higher degree of asymmetry outside the eclipses along with the significant displacements of the brightness maxima in respect to the elongation phase is interpreted as evidence that a considerable portion of the flaring source is concentrated close to the neck connecting the components. We discuss the nature of asymmetry in terms of possible mass exchange and the flare activity and compare the results of our model computations with the record of orbital period variations over the last 60 years.


Author(s):  
Brian Foley ◽  
Tony Champion ◽  
Ian Shuttleworth

AbstractThe paper compares and contrasts internal migration measured by healthcard-based administrative data with census figures. This is useful because the collection of population data, its processing, and its dissemination by statistical agencies is becoming more reliant on administrative data. Statistical agencies already use healthcard data to make migration estimates and are increasingly confident about local population estimates from administrative sources. This analysis goes further than this work as it assesses how far healthcard data can produce reliable data products of the kind to which academics are accustomed. It does this by examining migration events versus transitions over a full intercensal period; population flows into and out of small areas; and the extent to which it produces microdata on migration equivalent to that in the census. It is shown that for most demographic groups and places healthcard data is an adequate substitute for census-based migration counts, the exceptions being for student households and younger people. However, census-like information is still needed to provide covariates for analysis and this will still be required whatever the future of the traditional census.


1974 ◽  
Vol 8 (2) ◽  
pp. 20-34 ◽  
Author(s):  
Virginia H. Gibbons

Dates in parentheses at the end of each statement represent the combined holdings of the Stanford University-Hoover Institution libraries and are meant to serve as a guide to the publication history of the documents.The bibliography is arranged by country and then by issuing agency. The Arabic form of the agency has been used when available.This bibliography is not a comprehensive listing, but rather serves as an introduction to the wealth of material buried in the confusing array of publications of statistical agencies in the Middle East.


Author(s):  
Gianluca Miglio ◽  
Lara Basso ◽  
Lucrezia G. Armando ◽  
Sara Traina ◽  
Elisa Benetti ◽  
...  

In a Drug Prescription Network (DPN), each drug is represented as a node and two drugs co-prescribed to the same patient are represented as an edge linking the nodes. The use of DPNs is a novel approach that has been proposed as a means to study the complexity of drug prescription. The aim of this study is to demonstrate the analytical power of the DPN-based approach when it is applied to the analysis of administrative data. Drug prescription data that were collected at a local health unit (ASL TO4, Regione Piemonte, Italy), over a 12-month period (July 2018–June 2019), were used to create several DPNs that correspond to the five levels of the Anatomical Therapeutic Chemical classification system. A total of 5,431,335 drugs prescribed to 361,574 patients (age 0–100 years; 54.7% females) were analysed. As indicated by our results, the DPNs were dense networks, with giant components that contain all nodes. The disassortative mixing of node degrees was observed, which implies that non-random connectivity exists in the networks. Network-based methods have proven to be a flexible and efficient approach to the analysis of administrative data on drug prescription.


Author(s):  
Amy O’Hara ◽  
Rachel M. Shattuck ◽  
Robert M. Goerge

Linkage of federal, state, and local administrative records to survey data holds great promise for research on families, in particular research on low-income families. Researchers can use administrative records in conjunction with survey data to better measure family relationships and to capture the experiences of individuals and family members across multiple points in time and social and economic domains. Administrative data can be used to evaluate program participation in government social welfare programs, as well as to evaluate the accuracy of reporting on receipt of such benefits. Administrative records can also be used to enhance collection and accuracy of survey and census data and to improve coverage of hard-to-reach populations. This article discusses potential uses of linked administrative and survey data, gives an overview of the linking methodology and infrastructure (including limitations), and reviews social science literature that has used this method to date.


2016 ◽  
Vol 39 (2) ◽  
pp. 73 ◽  
Author(s):  
Mohamad A Hussain ◽  
Muhammad Mamdani ◽  
Gustavo Saposnik ◽  
Jack V Tu ◽  
David Turkel-Parrella ◽  
...  

Purpose: The positive predictive value (PPV) of carotid endarterectomy (CEA) and carotid artery stenting (CAS) procedure and post-operative complication coding were assessed in Ontario health administrative databases. Methods: Between 1 April 2002 and 31 March 2014, a random sample of 428 patients were identified using Canadian Classification of Health Intervention (CCI) procedure codes and Ontario Health Insurance Plan (OHIP) billing codes from administrative data. A blinded chart review was conducted at two high-volume vascular centers to assess the level of agreement between the administrative records and the corresponding patients’ hospital charts. PPV was calculated with 95% confidence intervals (CIs) to estimate the validity of CEA and CAS coding, utilizing hospital charts as the gold standard. Sensitivity of CEA and CAS coding were also assessed by linking two independent databases of 540 CEA-treated patients (Ontario Stroke Registry) and 140 CAS-treated patients (single-center CAS database) to administrative records. Results: PPV for CEA ranged from 99% to 100% and sensitivity ranged from 81.5% to 89.6% using CCI and OHIP codes. A CCI code with a PPV of 87% (95% CI, 78.8-92.9) and sensitivity of 92.9% (95% CI, 87.4-96.1) in identifying CAS was also identified. PPV for post-admission complication diagnosis coding was 71.4% (95% CI, 53.7-85.4) for stroke/transient ischemic attack, and 82.4% (95% CI, 56.6-96.2) for myocardial infarction. Conclusions: Our analysis demonstrated that the codes used in administrative databases accurately identify CEA and CAS-treated patients. Researchers can confidently use administrative data to conduct population-based studies of CEA and CAS.


Author(s):  
Misty L Heggeness

The availability and excessiveness of alternative (non-survey) data sources, collected on a daily, hourly, and sometimes second-by-second basis, has challenged the federal statistical system to update existing protocol for developing official statistics. Federal statistical agencies collect data primarily through survey methodologies built on frames constructed from administrative records. They compute survey weights to adjust for non-response and unequal sampling probabilities, impute answers for nonresponse, and report official statistics via tabulations from these survey. The U.S. federal government has rigorously developed these methodologies since the advent of surveys -- an innovation produced by the urgent desire of Congress and the President to estimate annual unemployment rates of working age men during the Great Depression. In the 1930s, Twitter did not exist; high-scale computing facilities were not abundant let alone cheap, and the ease of the ether was just a storyline from the imagination of fiction writers. Today we do have the technology, and an abundance of data, record markers, and alternative sources, which, if curated and examined properly, can help enhance official statistics. Researchers at the Census Bureau have been experimenting with administrative records in an effort to understand how these alternative data sources can improve our understanding of official statistics. Innovative projects like these have advanced our knowledge of the limitations of survey data in estimating official statistics. This paper will discuss advances made in linking administrative records to survey data to-date and will summarize the research on the impact of administrative records on official statistics.


Sign in / Sign up

Export Citation Format

Share Document