Software-Defined Infrastructure for Decentralized Data Lifecycle Governance: Principled Design and Open Challenges

Author(s):  
Gang Huang ◽  
Chaoran Luo ◽  
Kaidong Wu ◽  
Yun Ma ◽  
Ying Zhang ◽  
...  
2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Syed Iftikhar Hussain Shah ◽  
Vassilios Peristeras ◽  
Ioannis Magnisalis

AbstractThe public sector, private firms, business community, and civil society are generating data that is high in volume, veracity, velocity and comes from a diversity of sources. This kind of data is known as big data. Public Administrations (PAs) pursue big data as “new oil” and implement data-centric policies to transform data into knowledge, to promote good governance, transparency, innovative digital services, and citizens’ engagement in public policy. From the above, the Government Big Data Ecosystem (GBDE) emerges. Managing big data throughout its lifecycle becomes a challenging task for governmental organizations. Despite the vast interest in this ecosystem, appropriate big data management is still a challenge. This study intends to fill the above-mentioned gap by proposing a data lifecycle framework for data-driven governments. Through a Systematic Literature Review, we identified and analysed 76 data lifecycles models to propose a data lifecycle framework for data-driven governments (DaliF). In this way, we contribute to the ongoing discussion around big data management, which attracts researchers’ and practitioners’ interest.


2006 ◽  
Vol 41 (8) ◽  
pp. 34-43 ◽  
Author(s):  
J. Ángel Velázquez-Iturbide
Keyword(s):  

2021 ◽  
pp. 80-94
Author(s):  
Khadija Ouazzani-Touhami ◽  
◽  
Mohammed EL Arass ◽  
Nissrine Souissi

This paper investigates the potential of discrete event simulation for the analysis and evaluation of public strategies and policies and discusses the opportunities offered by the use of a simulation project lifecycle. Following this cycle, we evaluate a public policy use case, the voluntary departure operation initiated in Morocco in 2005, and analyses the success rate of this operation, as well as its impact on the Moroccan pension fund, and this for the period from 2005 to 2025. The results of this simulation highlighted, as already indicated in the Court of Auditors' reports, the irrelevance of this operation, particularly from a financial point of view.


2021 ◽  
Author(s):  
Kerstin Lehnert ◽  
Daven Quinn ◽  
Basil Tikoff ◽  
Douglas Walker ◽  
Sarah Ramdeen ◽  
...  

<div> <p>Management of geochemical data needs to consider the sequence of phases in the lifecycle of these data from field to lab to publication to archive. It also needs to address the large variety of chemical properties measured; the wide range of materials that are analyzed; the different ways, in which these materials may be prepared for analysis; the diversity of analytical techniques and instrumentation used to obtain analytical results; and the many ways used to calibrate and correct raw data, normalize them to standard reference materials, and otherwise treat them to obtain meaningful and comparable results. In order to extract knowledge from the data, they are then integrated and compared with other measurements, formatted for visualization, statistical analysis, or model generation, and finally cleaned and organized for publication and deposition in a data repository. Each phase in the geochemical data lifecycle has its specific workflows and metadata that need to be recorded to fully document the provenance of the data so that others can reproduce the results.</p> </div><div> <p>An increasing number of software tools are developed to support the different phases of the geochemical data lifecycle. These include electronic field notebooks, digital lab books, and Jupyter notebooks for data analysis, as well as data submission forms and templates. These tools are mostly disconnected and often require manual transcription or copying and pasting of data and metadata from one tool to the other. In an ideal world, these tools would be connected so that field observations gathered in a digital field notebook, such as sample locations and sampling dates, can be seamlessly send to an IGSN Allocating Agent to obtain a unique sample identifier with a QR code with a single click. The sample metadata would be readily accessible for the lab data management system that allows the researchers to capture information about the sample preparation, and that connects to the instrumentation to capture instrument settings and the raw data. The data would then be seamlessly accessed by data reduction software, visualized, and further compared to data from global databases that can be directly accessed. Ultimately, a few clicks will allow the user to format the data for publication and archiving.</p> </div><div> <p>Several data systems that support different stages in the lifecycle of samples and sample-based geochemical data have now come together to explore the development of standardized interfaces and APIs and consistent data and metadata schemas to link their systems into an efficient pipeline for geochemical data from the field to the archive. These systems include StraboSpot (www.strabospot.org; data system for digital collection, storage, and sharing of both field and lab data), SESAR (<span>www.geosamples.org</span>; sample registry and allocating agent for IGSN), EarthChem (www.earthchem.org; publishers and repository for geochemical data), Sparrow (sparrow-data.org; data system to organize analytical data and track project- and sample-level metadata), IsoBank (isobank.org; repository for stable isotope data), and MacroStrat (macrostrat.org; collaborative platform for geological data exploration and integration).</p> </div>


Sign in / Sign up

Export Citation Format

Share Document