scholarly journals baRcodeR with PyTrackDat: Open-source labelling and tracking of biological samples for repeatable science

2018 ◽  
Author(s):  
Yihan Wu ◽  
David R. Lougheed ◽  
Stephen C. Lougheed ◽  
Kristy Moniz ◽  
Virginia K. Walker ◽  
...  

AbstractRepeatable experiments with accurate data collection and reproducible analyses are fundamental to the scientific method but may be difficult to achieve in practice. Several flexible, open-source tools developed for the R and Python coding environments aid the reproducibility of data wrangling and analysis in scientific research. In contrast, analogous tools are generally lacking for earlier stages, such as systematic labelling and processing of field samples with hierarchical structure (e.g. time points of individuals from multiple lines or populations) or curating heterogenous data collected by different researchers over several years. Such tools are critical for modern research given trends toward globally distributed collaborators using higher-throughput technologies. As a step toward improving repeatability of methods for the collection of biological samples, and curation of biological data, we introduce the R package baRcodeR and the PyTrackDat pipeline in Python. The baRcodeR package provides tools for generating biologically informative, hierarchical labels with digitally encoded 2D barcodes that can be printed and scanned using low-cost commercial hardware. The PyTrackDat pipeline integrates with baRcodeR output to build a web interface for sample management and tracking along with data collection and curation. We briefly describe the application of principles from baRcodeR and PyTrackDat in three large research projects, which demonstrate their value to (i) help document sampling methods, (ii) facilitate collaboration and (iii) reduce opportunities for human errors and omissions that could otherwise propagate through downstream data analysis to compromise biological inference.

2018 ◽  
Author(s):  
Alexander Williams

Typical buoyancy engine-based Underwater Gliders are highly-complex and cost-prohibitive, generally ranging in price-point from 50,000USD to 250,000USD. A low-cost, Open-Source Underwater Glider (OSUG) was thus developed as a low-cost data-collection and research tool. This glider, OSUG, is a sub-1000USD, 1.2m long, 12kg, and capable of 50-hours of continuous operation. Its efficiency, and use-case feasibility were evaluated. The buoyancy engine is constructed of medical grade syringes that pull in water from the environment to simplify the system and lower costs. Direction of locomotion is controlled by altering pitch and roll via changing the center-of-mass. The system was designed to be primarily three-dimensionally (3D) printed and fully-modular to limit cost and ensure reproducibility.


Author(s):  
Lars Yndal Sørensen ◽  
Lars Toft Jacobsen ◽  
John Paulin Hansen

This paper present a platform for airborne sensor applications using low-cost, open-source components carried by an easy-to-fly unmanned aircraft vehicle (UAV). The system, available in open-source [1], is designed for researchers, students and makers for a broad range of their exploration and data-collection needs. The main contribution is the extensible architecture for modularized airborne sensor deployment and real-time data visualisation. Our open-source Android application provides data collection, flight path definition and map tools. Total cost of the system is below 800 dollars. The flexibility of the system are illustrated by mapping the location of Bluetooth beacons (iBeacons) on a ground field and by measuring water temperatures in a lake.


2020 ◽  
Vol 134 ◽  
pp. 104832
Author(s):  
Brandon Feenstra ◽  
Ashley Collier-Oxandale ◽  
Vasileios Papapostolou ◽  
David Cocker ◽  
Andrea Polidori

HardwareX ◽  
2020 ◽  
Vol 8 ◽  
pp. e00138
Author(s):  
Audun D. Myers ◽  
Joshua R. Tempelman ◽  
David Petrushenko ◽  
Firas A. Khasawneh

2020 ◽  
Vol 52 ◽  
pp. 55-61
Author(s):  
Ettore Potente ◽  
Cosimo Cagnazzo ◽  
Alessandro Deodati ◽  
Giuseppe Mastronuzzi

2020 ◽  
Author(s):  
Andrew Fang ◽  
Jonathan Kia-Sheng Phua ◽  
Terrence Chiew ◽  
Daniel De-Liang Loh ◽  
Lincoln Ming Han Liow ◽  
...  

BACKGROUND During the Coronavirus Disease 2019 (COVID-19) outbreak, community care facilities (CCF) were set up as temporary out-of-hospital isolation facilities to contain the surge of cases in Singapore. Confined living spaces within CCFs posed an increased risk of communicable disease spread among residents. OBJECTIVE This inspired our healthcare team managing a CCF operation to design a low-cost communicable disease outbreak surveillance system (CDOSS). METHODS Our CDOSS was designed with the following considerations: (1) comprehensiveness, (2) efficiency through passive reconnoitering from electronic medical record (EMR) data, (3) ability to provide spatiotemporal insights, (4) low-cost and (5) ease of use. We used Python to develop a lightweight application – Python-based Communicable Disease Outbreak Surveillance System (PyDOSS) – that was able perform syndromic surveillance and fever monitoring. With minimal user actions, its data pipeline would generate daily control charts and geospatial heat maps of cases from raw EMR data and logged vital signs. PyDOSS was successfully implemented as part of our CCF workflow. We also simulated a gastroenteritis (GE) outbreak to test the effectiveness of the system. RESULTS PyDOSS was used throughout the entire duration of operation; the output was reviewed daily by senior management. No disease outbreaks were identified during our medical operation. In the simulated GE outbreak, PyDOSS was able to effectively detect an outbreak within 24 hours and provided information about cluster progression which could aid in contact tracing. The code for a stock version of PyDOSS has been made publicly available. CONCLUSIONS PyDOSS is an effective surveillance system which was successfully implemented in a real-life medical operation. With the system developed using open-source technology and the code made freely available, it significantly reduces the cost of developing and operating CDOSS and may be useful for similar temporary medical operations, or in resource-limited settings.


2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Yixin Kong ◽  
Ariangela Kozik ◽  
Cindy H. Nakatsu ◽  
Yava L. Jones-Hall ◽  
Hyonho Chun

Abstract A latent factor model for count data is popularly applied in deconvoluting mixed signals in biological data as exemplified by sequencing data for transcriptome or microbiome studies. Due to the availability of pure samples such as single-cell transcriptome data, the accuracy of the estimates could be much improved. However, the advantage quickly disappears in the presence of excessive zeros. To correctly account for this phenomenon in both mixed and pure samples, we propose a zero-inflated non-negative matrix factorization and derive an effective multiplicative parameter updating rule. In simulation studies, our method yielded the smallest bias. We applied our approach to brain gene expression as well as fecal microbiome datasets, illustrating the superior performance of the approach. Our method is implemented as a publicly available R-package, iNMF.


Sign in / Sign up

Export Citation Format

Share Document