Multi-path Coverage of all Final States for Model-Based Testing Theory using Spark In-memory Design

10.36227/techrxiv.13283477.v1 ◽

2020 ◽

Author(s):

Wilfried Yves Hamilton Adoni ◽

Moez Krichen ◽

Tarik Nahhal ◽

Abdeltif Elbyed

Keyword(s):

Time Complexity ◽

Large Scale ◽

Computation Time ◽

Final States ◽

Memory Design ◽

Model Based ◽

Distributed Approach ◽

Distributed Framework ◽

Model Based Testing ◽

Testing Theory

This paper deals with an efficient and robust distributed framework for finite state machine coverage in the field model based testing theory. All final states coverage in large-scale automaton is inherently computing-intensive and memory exhausting with impractical time complexity because of an explosion of the number of states. Thus, it is important to propose a faster solution that reduces the time complexity by exploiting big data concept based on Spark RDD computation. To cope with this situation, we propose a parallel and distributed approach based on Spark in-memory design which exploits A* algorithm for optimal coverage. The experiments performed on multi-node cluster prove that the proposed framework achieves significant gain of the computation time.

Download Full-text

Multi-path Coverage of All Final States for Model-Based Testing Theory Using Spark In-memory Design

Lecture Notes in Computer Science - Verification and Evaluation of Computer and Communication Systems ◽

10.1007/978-3-030-65955-4_14 ◽

2020 ◽

pp. 195-204

Author(s):

Wilfried Yves Hamilton Adoni ◽

Moez Krichen ◽

Tarik Nahhal ◽

Abdeltif Elbyed

Keyword(s):

Final States ◽

Memory Design ◽

Model Based ◽

Model Based Testing ◽

Testing Theory ◽

Path Coverage

Download Full-text

X-DMM: Fast and Scalable Model Based Text Clustering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014197 ◽

2019 ◽

Vol 33 ◽

pp. 4197-4204 ◽

Cited By ~ 1

Author(s):

Linwei Li ◽

Liangchen Guo ◽

Zhenying He ◽

Yinan Jing ◽

X. Sean Wang

Keyword(s):

Time Complexity ◽

Large Scale ◽

Dirichlet Distribution ◽

Average Length ◽

Scale Up ◽

Text Clustering ◽

Number Of Clusters ◽

Model Based ◽

Model Training ◽

Real World Datasets

Text clustering is a widely studied problem in the text mining domain. The Dirichlet Multinomial Mixture (DMM) model based clustering algorithms have shown good performance to cope with high dimensional sparse text data, obtaining reasonable results in both clustering accuracy and computational efficiency. However, the time complexity of DMM model training is proportional to the average document length and the number of clusters, making it inefficient for scaling up to long text and large corpora, which is common in realworld applications such as documents organization, retrieval and recommendation. In this paper, we leverage a symmetric prior setting for Dirichlet distribution, and build indices to decrease the time complexity of the sampling-based training for DMM from O(K∗L) to O(K∗U), where K is the number of clusters, L the average length of document, and U the average number of unique words in each document. We introduce a Metropolis-Hastings sampling algorithm, which further reduces the sampling time complexity from O(K∗U) to O(U) in the nearly-to-convergence training stages. Moreover, we also parallelize the DMM model training to obtain a further acceleration by using an uncollapsed Gibbs sampler. We combine all these optimizations into a highly efficient implementation, called X-DMM, which enables the DMM model to scale up for long and large-scale text clustering. We evaluate the performance of X-DMM on several real world datasets, and the experimental results show that XDMM achieves substantial speed up compared with existing state-of-the-art algorithms without clustering accuracy degradation.

Download Full-text

Content-Based Image Retrial Based on Hadoop

Mathematical Problems in Engineering ◽

10.1155/2013/684615 ◽

2013 ◽

Vol 2013 ◽

pp. 1-7 ◽

Cited By ~ 6

Author(s):

DongSheng Yin ◽

DeBo Liu

Keyword(s):

Time Complexity ◽

Large Scale ◽

Experimental Results ◽

Locality Sensitive Hashing ◽

Complexity Of Algorithms ◽

Speeded Up Robust Features ◽

Distributed Framework ◽

Hadoop Platform

Generally, time complexity of algorithms for content-based image retrial is extremely high. In order to retrieve images on large-scale databases efficiently, a new way for retrieving based on Hadoop distributed framework is proposed. Firstly, a database of images features is built by using Speeded Up Robust Features algorithm and Locality-Sensitive Hashing and then perform the search on Hadoop platform in a parallel way specially designed. Considerable experimental results show that it is able to retrieve images based on content on large-scale cluster and image sets effectively.

Download Full-text

Model-based testing of global properties on large-scale distributed systems

Information and Software Technology ◽

10.1016/j.infsof.2014.02.002 ◽

2014 ◽

Vol 56 (7) ◽

pp. 749-762 ◽

Cited By ~ 11

Author(s):

Gerson Sunyé ◽

Eduardo Cunha de Almeida ◽

Yves Le Traon ◽

Benoit Baudry ◽

Jean-Marc Jézéquel

Keyword(s):

Distributed Systems ◽

Large Scale ◽

Model Based ◽

Global Properties ◽

Model Based Testing

Download Full-text

Model-based Control Techniques for Large-Scale High-Precision Stage

IEEJ Transactions on Industry Applications ◽

10.1541/ieejias.140.272 ◽

2020 ◽

Vol 140 (4) ◽

pp. 272-280

Author(s):

Wataru Ohnishi ◽

Hiroshi Fujimoto ◽

Koichi Sakata

Keyword(s):

High Precision ◽

Large Scale ◽

Model Based Control ◽

Precision Stage ◽

Control Techniques ◽

Model Based

Download Full-text

Model-based Identification, Estimation, and Control for Large-scale Urban Road Networks

2020 European Control Conference (ECC) ◽

10.23919/ecc51009.2020.9143995 ◽

2020 ◽

Author(s):

Isik Ilber Sirmatel ◽

Nikolas Geroliminis

Keyword(s):

Large Scale ◽

Road Networks ◽

Urban Road ◽

Model Based ◽

Estimation And Control ◽

And Control

Download Full-text

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

10.26434/chemrxiv.6104804.v1 ◽

2018 ◽

Author(s):

Pavel Pokhilko ◽

Evgeny Epifanovsky ◽

Anna I. Krylov

Keyword(s):

Large Scale ◽

Computation Time ◽

Coupled Cluster ◽

Double Precision ◽

Many Body ◽

Single Precision ◽

Parallel Performance ◽

Point Representation ◽

Electron Repulsion Integrals ◽

Cluster Methods

Using single precision floating point representation reduces the size of data and computation time by a factor of two relative to double precision conventionally used in electronic structure programs. For large-scale calculations, such as those encountered in many-body theories, reduced memory footprint alleviates memory and input/output bottlenecks. Reduced size of data can lead to additional gains due to improved parallel performance on CPUs and various accelerators. However, using single precision can potentially reduce the accuracy of computed observables. Here we report an implementation of coupled-cluster and equation-of-motion coupled-cluster methods with single and double excitations in single precision. We consider both standard implementation and one using Cholesky decomposition or resolution-of-the-identity of electron-repulsion integrals. Numerical tests illustrate that when single precision is used in correlated calculations, the loss of accuracy is insignificant and pure single-precision implementation can be used for computing energies, analytic gradients, excited states, and molecular properties. In addition to pure single-precision calculations, our implementation allows one to follow a single-precision calculation by clean-up iterations, fully recovering double-precision results while retaining significant savings.

Download Full-text

Model-Based Testing for Web Applications

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2011.01012 ◽

2011 ◽

Vol 34 (6) ◽

pp. 1012-1028 ◽

Cited By ~ 6

Author(s):

Huai-Kou MIAO ◽

Sheng-Bo CHEN ◽

Hong-Wei ZENG

Keyword(s):

Web Applications ◽

Model Based ◽

Model Based Testing

Download Full-text

A Model-Based Real-Time Intrusion Detection System for Large Scale Heterogeneous Networks

10.21236/ada420824 ◽

2003 ◽

Cited By ~ 1

Author(s):

Richard A. Kemmer ◽

Giovanni Vigna

Keyword(s):

Intrusion Detection ◽

Real Time ◽

Heterogeneous Networks ◽

Intrusion Detection System ◽

Large Scale ◽

Detection System ◽

Model Based

Download Full-text