scholarly journals The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software Libraries: A Collaborative Ethnography of Documentation Work

2018 ◽  
Author(s):  
R. Stuart Geiger ◽  
Nelle Varoquaux ◽  
Charlotte Cabasse ◽  
Christopher Holdgraf

Computational research and data analytics increasingly relies on complex ecosystems of open source software (OSS) "libraries" -- curated collections of reusable code that programmers import to perform a specific task. Software documentation for these libraries is crucial in helping programmers/analysts know what libraries are available and how to use them. Yet documentation for open source software libraries is widely considered low-quality. This article is a collaboration between CSCW researchers and contributors to data analytics OSS libraries, based on ethnographic fieldwork and qualitative interviews. We examine several issues around the formats, practices, and challenges around documentation in these largely volunteer-based projects. There are many different kinds and formats of documentation that exist around such libraries, which play a variety of educational, promotional, and organizational roles. The work behind documentation is similarly multifaceted, including writing, reviewing, maintaining, and organizing documentation. Different aspects of documentation work require contributors to have different sets of skills and overcome various social and technical barriers. Finally, most of our interviewees do not report high levels of intrinsic enjoyment for doing documentation work (compared to writing code). Their motivation is affected by personal and project-specific factors, such as the perceived level of credit for doing documentation work versus more `technical' tasks like adding new features or fixing bugs. In studying documentation work for data analytics OSS libraries, we gain a new window into the changing practices of data-intensive research, as well as help practitioners better understand how to support this often invisible and infrastructural work in their projects.

2018 ◽  
Vol 27 (3-6) ◽  
pp. 767-802 ◽  
Author(s):  
R. Stuart Geiger ◽  
Nelle Varoquaux ◽  
Charlotte Mazel-Cabasse ◽  
Chris Holdgraf

2021 ◽  
Vol 64 (10) ◽  
pp. 85-93
Author(s):  
Jihoon Lee ◽  
Gyuhong Lee ◽  
Jinsung Lee ◽  
Youngbin Im ◽  
Max Hollingsworth ◽  
...  

Modern cell phones are required to receive and display alerts via the Wireless Emergency Alert (WEA) program, under the mandate of the Warning, Alert, and Response Act of 2006. These alerts include AMBER alerts, severe weather alerts, and (unblockable) Presidential Alerts, intended to inform the public of imminent threats. Recently, a test Presidential Alert was sent to all capable phones in the U.S., prompting concerns about how the underlying WEA protocol could be misused or attacked. In this paper, we investigate the details of this system and develop and demonstrate the first practical spoofing attack on Presidential Alerts, using commercially available hardware and modified open source software. Our attack can be performed using a commercially available software-defined radio, and our modifications to the open source software libraries. We find that with only four malicious portable base stations of a single Watt of transmit power each, almost all of a 50,000-seat stadium can be attacked with a 90% success rate. The real impact of such an attack would, of course, depend on the density of cellphones in range; fake alerts in crowded cities or stadiums could potentially result in cascades of panic. Fixing this problem will require a large collaborative effort between carriers, government stakeholders, and cellphone manufacturers. To seed this effort, we also propose three mitigation solutions to address this threat.


F1000Research ◽  
2019 ◽  
Vol 8 ◽  
pp. 2132
Author(s):  
Nomi L. Harris ◽  
Peter J.A. Cock ◽  
Christopher J. Fields ◽  
Bastian Greshake Tzovaras ◽  
Michael Heuer ◽  
...  

The Bioinformatics Open Source Conference is a volunteer-organized meeting that covers open source software development and open science in bioinformatics. Launched in 2000, BOSC has been held every year since. BOSC 2019, the 20th annual BOSC, took place as one of the Communities of Special Interest (COSIs) at the Intelligent Systems for Molecular Biology meeting (ISMB/ECCB 2019). The two-day meeting included a total of 46 talks and 55 posters, as well as eight Birds of a Feather interest groups. The keynote speaker was University of Cape Town professor Dr. Nicola Mulder, who spoke on “Building infrastructure for responsible open science in Africa”. Immediately after BOSC 2019, about 50 people participated in the two-day CollaborationFest (CoFest for short), an open and free community-driven event at which participants work together to contribute to bioinformatics software, documentation, training materials, and use cases.


2016 ◽  
Vol 29 (3) ◽  
pp. 344-360 ◽  
Author(s):  
Malgorzata Ciesielska ◽  
Ann Westenholz

Purpose – The purpose of this paper is to contribute to the literature about the commercial involvement in open source software, levels of this involvement and consequences of attempting to mix various logics of action. Design/methodology/approach – This paper uses the case study approach based on mixed methods: literature reviews and news searches, electronic surveys, qualitative interviews and observations. It combines discussions from several research projects as well as previous publications to present the scope of commercial choices within open source software and their consequences. Findings – The findings show that higher levels of involvement in open source software communities poses important questions about the balance between economic, technological, and social logics as well as the benefits of being autonomous, having access to collaborative networks and minimizing risks related to free-riding. There are six levels of commercial involvement in open source communities, and each of them is characterized by a different dilemma. Originality/value – The paper sheds light on the various level of involvement of business in open source movement and emphasize that the popularized “open innovation” concept is only the first step in real involvement and paradigm shift.


2021 ◽  
Author(s):  
William Billingsley

Typical university Learning Management Systems (LMSs) place an enrolment paywall between students and the content within a unit. This has the effect not only of preventing access from potential students, but also of locking past students out from accessing updated materials as the subject develops over subsequent years to their enrolment. In this and many regards, the mechanisms by which academics can produce and publish content face limitations that open source software documentation sites do not. This provocation paper describes some of these limitations and gives an overview of the JamStack – common techniques that have developed within the software development community that allow convenient self-publishing of sites and materials. The paper then gives a brief introduction to Doctacular: a course-oriented static site generator that is under development (but already used for two live sites) to bring JamStack-style publishing to academic course materials.


2012 ◽  
Vol 3 (4) ◽  
pp. 1-16
Author(s):  
Stefan Koch ◽  
Mürvet Ozan Özgür

This study focuses on the attitudes of venture capital companies with regard to investments into open source software-based start-ups in Turkey, with a special focus on their preference for different possible business models which these start-ups could be based on. A series of in-depth qualitative interviews has been done with managers of venture capital firms with an IT focus operating in Turkey. Currently, no investments proposal have been received, and an analysis of barriers has resulted in showing lack of support from institutions (especially universities) and insufficient community support, which blocks the development of related expertise. On the other hand, interest exists based on perceptions of open source software-based new ventures as more innovative with higher probability of returns when compared to their proprietary counterparts. This confirms prior theories. For future applications, venture capital firms would prefer dual or hosted business models when investing. Another major outcome is that researchers should step back and focus attention on reasons related with the immaturity of open source-based projects in Turkey.


2016 ◽  
Vol 13 (1) ◽  
pp. 121-140 ◽  
Author(s):  
David Y. Chan ◽  
Alexander Kogan

ABSTRACT This is a hands-on introductory practical data analytics teaching case that can be used in an auditing or related course. Students will learn about data attributes, data creation, structured query language (SQL), basic statistics, and performing basic audit procedures using analytics by utilizing the open source software R. Instructors can use this case for an in-class discussion or an independent out-of-class assignment. A solutions guide is available in the Teaching Notes. Multimedia files are available for download, see Appendix B.


2020 ◽  
Vol 39 (3) ◽  
Author(s):  
Denise Quintel ◽  
Robert Wilson

When selecting a web analytics tool, academic libraries have traditionally turned to Google Analytics for data collection to gain insights into the usage of their web properties. As the valuable field of data analytics continues to grow, concerns about user privacy rise as well, especially when discussing a technology giant like Google. In this article, the authors explore the feasibility of using Matomo, a free and open-source software application, for web analytics in their library’s discovery layer. Matomo is a web analytics platform designed around user-privacy assurances. This article details the installation process, makes comparisons between Matomo and Google Analytics, and describes how an open-source analytics platform works within a library-specific application, EBSCO’s Discovery Service.


Sign in / Sign up

Export Citation Format

Share Document