Challenges in NoSQL-Based Distributed Data Storage: A Systematic Literature Review

Ramzan;  Bajwa;  Kazmi;  Amna

doi:10.3390/electronics8050488

Challenges in NoSQL-Based Distributed Data Storage: A Systematic Literature Review

Electronics ◽

10.3390/electronics8050488 ◽

2019 ◽

Vol 8 (5) ◽

pp. 488 ◽

Cited By ~ 1

Author(s):

Ramzan ◽

Bajwa ◽

Kazmi ◽

Amna

Keyword(s):

Load Balancing ◽

Literature Review ◽

Query Processing ◽

Data Storage ◽

Systematic Literature Review ◽

Quality Attributes ◽

Distributed Data ◽

Distributed Data Storage ◽

Art Research ◽

Indexing Technique

Key-Value stores (KVSs) are the most flexible and simplest model of NoSQL databases, which have become highly popular over the last few years due to their salient features such as availability, portability, reliability, and low operational cost. From the perspective of software engineering, the chief obstacle for KVSs is to achieve software quality attributes (consistency, throughput, latency, security, performance, load balancing, and query processing) to ensure quality. The presented research is a Systematic Literature Review (SLR) to find the state-of-the-art research in the KVS domain, and through doing so determine the major challenges and solutions. This work reviews the 45 papers between 2010–2018 that were found to be closely relevant to our study area. The results show that performance is addressed in 31% of the studies, consistency is addressed in 20% of the studies, latency and throughput are addressed in 16% of the studies, query processing is addressed in 13% of studies, security is addressed in 11% of the studies, and load balancing is addressed in 9% of the studies. Different models are used for execution. The indexing technique was used in 20% of the studies, the hashing technique was used in 13% of the studies, the caching and security techniques were used together in 9% of the studies, the batching technique was used in 5% of the studies, the encoding techniques and Paxos technique were used together in 4% of the studies, and 36% of the studies used other techniques. This systematic review will enable researchers to design key-value stores as efficient storage. Regarding future collaborations, trust and privacy are the quality attributes that can be addressed; KVS is an emerging facet due to its widespread popularity, opening the way to deploy it with proper protection.

Download Full-text

A novel approach for energy- and memory-efficient data loss prevention to support Internet of Things networks

International Journal of Distributed Sensor Networks ◽

10.1177/1550147720929823 ◽

2020 ◽

Vol 16 (6) ◽

pp. 155014772092982

Author(s):

Pooya Hejazi ◽

Gianluigi Ferrari

Keyword(s):

Internet Of Things ◽

Load Balancing ◽

Data Storage ◽

Optimization Problem ◽

Mixed Integer ◽

Optimization Approach ◽

Distributed Data ◽

Data Loss ◽

Distributed Data Storage ◽

Wide Range

Internet of Things integrates various technologies, including wireless sensor networks, edge computing, and cloud computing, to support a wide range of applications such as environmental monitoring and disaster surveillance. In these types of applications, IoT devices operate using limited resources in terms of battery, communication bandwidth, processing, and memory capacities. In this context, load balancing, fault tolerance, and energy and memory efficiency are among the most important issues related to data dissemination in IoT networks. In order to successfully cope with the abovementioned issues, two main approaches—data-centric storage and distributed data storage—have been proposed in the literature. Both approaches suffer from data loss due to memory and/or energy depletion in the storage nodes. Even though several techniques have been proposed so far to overcome the abovementioned problems, the proposed solutions typically focus on one issue at a time. In this article, we propose a cross-layer optimization approach to increase memory and energy efficiency as well as support load balancing. The optimization problem is a mixed-integer nonlinear programming problem, and we solve it using a genetic algorithm. Moreover, we integrate the data-centric storage features into distributed data storage mechanisms and present a novel heuristic approach, denoted as Collaborative Memory and Energy Management, to solve the underlying optimization problem. We also propose analytical and simulation frameworks for performance evaluation. Our results show that the proposed method outperforms the existing approaches in various IoT scenarios.

Download Full-text

Activity of public control entities and development of distributed computing and distributed data storage systems

Journal of Law and Administration ◽

10.24833/2073-8420-2018-1-46-14-22 ◽

2018 ◽

pp. 14-22

Author(s):

D. V. Gribanov

Keyword(s):

Distributed Computing ◽

Data Storage ◽

Storage Systems ◽

Legal Regulation ◽

Distributed Data ◽

Distributed Data Storage ◽

Public Control ◽

Blockchain Technology ◽

Legal Method ◽

Digital Assets

Introduction. This article is devoted to legal regulation of digital assets turnover, utilization possibilities of distributed computing and distributed data storage systems in activities of public authorities and entities of public control. The author notes that some national and foreign scientists who study a “blockchain” technology (distributed computing and distributed data storage systems) emphasize its usefulness in different activities. Data validation procedure of digital transactions, legal regulation of creation, issuance and turnover of digital assets need further attention.Materials and methods. The research is based on common scientific (analysis, analogy, comparing) and particular methods of cognition of legal phenomena and processes (a method of interpretation of legal rules, a technical legal method, a formal legal method and a formal logical one).Results of the study. The author conducted an analysis which resulted in finding some advantages of the use of the “blockchain” technology in the sphere of public control which are as follows: a particular validation system; data that once were entered in the system of distributed data storage cannot be erased or forged; absolute transparency of succession of actions while exercising governing powers; automatic repeat of recurring actions. The need of fivefold validation of exercising governing powers is substantiated. The author stresses that the fivefold validation shall ensure complex control over exercising of powers by the civil society, the entities of public control and the Russian Federation as a federal state holding sovereignty over its territory. The author has also conducted a brief analysis of judicial decisions concerning digital transactions.Discussion and conclusion. The use of the distributed data storage system makes it easier to exercise control due to the decrease of risks of forge, replacement or termination of data. The author suggests defining digital transaction not only as some actions with digital assets, but also as actions toward modification and addition of information about legal facts with a purpose of its establishment in the systems of distributed data storage. The author suggests using the systems of distributed data storage for independent validation of information about activities of the bodies of state authority. In the author’s opinion, application of the “blockchain” technology may result not only in the increase of efficiency of public control, but also in the creation of a new form of public control – automatic control. It is concluded there is no legislation basis for regulation of legal relations concerning distributed data storage today.

Download Full-text

An information model of interagency communication based on distributed data storage

Proceedings of the International Conference on Electronic Governance and Open Society Challenges in Eurasia - EGOSE '16 ◽

10.1145/3014087.3014097 ◽

2016 ◽

Cited By ~ 3

Author(s):

Yuri P. Lipuntsov

Keyword(s):

Data Storage ◽

Information Model ◽

Distributed Data ◽

Distributed Data Storage

Download Full-text

A modified cryptographic approach for securing distributed data storage in cloud computing

2017 International Conference on Networks & Advances in Computational Technologies (NetACT) ◽

10.1109/netact.2017.8076755 ◽

2017 ◽

Cited By ~ 3

Author(s):

M. Rajasekhar Reddy ◽

R Akilandeswari ◽

S. Priyadarshini ◽

B. Karthikeyan ◽

E. Ponmani

Keyword(s):

Cloud Computing ◽

Data Storage ◽

Distributed Data ◽

Distributed Data Storage

Download Full-text

The Use of Distributed Data Storage and Processing Systems in Bioinformatic Data Analysis

Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety - Communications in Computer and Information Science ◽

10.1007/978-3-319-99987-6_2 ◽

2018 ◽

pp. 18-32

Author(s):

Michał Bochenek ◽

Kamil Folkert ◽

Roman Jaksik ◽

Michał Krzesiak ◽

Marcin Michalak ◽

...

Keyword(s):

Data Analysis ◽

Data Storage ◽

Distributed Data ◽

Distributed Data Storage

Download Full-text

An Architecture for Distributed Electronic Documents Storage in Decentralized Blockchain B2B Applications

Computers ◽

10.3390/computers10110142 ◽

2021 ◽

Vol 10 (11) ◽

pp. 142

Author(s):

Obadah Hammoud ◽

Ivan Tarkhanov ◽

Artyom Kosmarski

Keyword(s):

Distributed Systems ◽

Data Storage ◽

Distributed Storage ◽

Distributed Data ◽

Erasure Coding ◽

Distributed Data Storage ◽

Electronic Documents ◽

File Storage ◽

Load Balancer ◽

The Cost

This paper investigates the problem of distributed storage of electronic documents (both metadata and files) in decentralized blockchain-based b2b systems (DApps). The need to reduce the cost of implementing such systems and the insufficient elaboration of the issue of storing big data in DLT are considered. An approach for building such systems is proposed, which allows optimizing the size of the required storage (by using Erasure coding) and simultaneously providing secure data storage in geographically distributed systems of a company, or within a consortium of companies. The novelty of this solution is that we are the first who combine enterprise DLT with distributed file storage, in which the availability of files is controlled. The results of our experiment demonstrate that the speed of the described DApp is comparable to known b2c torrent projects, and subsequently justify the choice of Hyperledger Fabric and Ethereum Enterprise for its use. Obtained test results show that public blockchain networks are not suitable for creating such a b2b system. The proposed system solves the main challenges of distributed data storage by grouping data into clusters and managing them with a load balancer, while preventing data tempering using a blockchain network. The considered DApps storage methodology easily scales horizontally in terms of distributed file storage and can be deployed on cloud computing technologies, while minimizing the required storage space. We compare this approach with known methods of file storage in distributed systems, including central storage, torrents, IPFS, and Storj. The reliability of this approach is calculated and the result is compared to traditional solutions based on full backup.

Download Full-text

SOME EXPERIMENTS ON THE CONSTRUCTION AND ANALYSIS OF THE DE BRUIJN GRAPH

System Informatics ◽

10.31144/si.2307-6410.2020.n16.p47-56 ◽

2020 ◽

Author(s):

Alexander G. Marchuk ◽

◽

Sergey Nikolaevich Troshkov ◽

Keyword(s):

Data Storage ◽

Parallel Computations ◽

De Bruijn Graph ◽

Distributed Data ◽

Distributed Data Storage ◽

De Bruijn

This paper describes the experience of solving the problem of finding chains in the De Bruijn graph using parallel computations and distributed data storage.

Download Full-text

Identity based Distributed Data Storage using Digital Signature and Diffie Hellmann Key Exchange Mechanism

International Journal of Computer Applications ◽

10.5120/16412-6002 ◽

2014 ◽

Vol 94 (14) ◽

pp. 27-30

Author(s):

R. Allvijay ◽

V. M. Sivagami

Keyword(s):

Data Storage ◽

Digital Signature ◽

Key Exchange ◽

Exchange Mechanism ◽

Distributed Data ◽

Distributed Data Storage ◽

Identity Based

Download Full-text

The Using of gLite Middleware for Organization of Distributed Data Storage

Izvestiya of Saratov University New Series Series Mathematics Mechanics Informatics ◽

10.18500/1816-9791-2009-9-1-85-91 ◽

2009 ◽

Vol 9 (1) ◽

pp. 85-91

Author(s):

V.M. Solovyev ◽

◽

M.G. Scherbakov ◽

Keyword(s):

Data Storage ◽

Distributed Data ◽

Distributed Data Storage

Download Full-text

Fault-tolerant load balancing in cloud computing: A systematic literature review

IEEE Access ◽

10.1109/access.2021.3139730 ◽

2021 ◽

pp. 1-1

Author(s):

Vahid Mohammadian ◽

Aso Darwesh

Keyword(s):

Cloud Computing ◽

Load Balancing ◽

Literature Review ◽

Systematic Literature Review ◽

Fault Tolerant

Download Full-text