An Efficient Algorithm for Automating Classification of Chemical Reactions into Classes in Ugi’s Reaction Scheme

There are two approaches for classification of chemical reactions: Model-Driven and Data-Driven. In this paper, the authors develop an efficient algorithm based on a model-driven approach developed by Ugi and co-workers for classification of chemical reactions. The authors’ algorithm takes reaction matrix of a chemical reaction as input and generates its appropriate class as output. Reaction matrices being symmetric, matrix implementation of Ugi’s scheme using upper/lower tri-angular matrix is of O(n2) in terms of space complexity. Time complexity of similar matrix implementation is O(n4), both in worst case as well as in average case. The proposed algorithm uses two fixed size look-up tables in a novel way and requires constant space complexity. Time complexity both in worst and average cases of the algorithm is linear.

Download Full-text

Extension of Ugi's Scheme for Model-Driven Classification of Chemical Reactions

International Journal of Chemoinformatics and Chemical Engineering ◽

10.4018/ijcce.2015010103 ◽

2015 ◽

Vol 4 (1) ◽

pp. 26-51 ◽

Cited By ~ 2

Author(s):

Shyantani Maiti ◽

Sanjay Ram ◽

Somnath Pal

Keyword(s):

Chemical Reactions ◽

Chemical Reaction ◽

Time Complexity ◽

Symmetric Matrix ◽

Molecular Graph ◽

Space Complexity ◽

Worst Case ◽

Model Driven ◽

Chemical Structures

The first step to predict the outcome of a chemical reaction is to classify existing chemical reactions, on the basis of which possible outcome of unknown reaction can be predicted. There are two approaches for classification of chemical reactions: Model-Driven and Data-Driven. In model-driven approach, chemical structures are usually stored in a computer as molecular graphs. Such graphs can also be represented as matrices. The most preferred matrix representation to store molecular graph is Bond-Electron matrix (BE-matrix). The Reaction matrix (R-matrix) of a chemical reaction can be obtained from the BE-matrices of educts and products was shown by Ugi and his co-workers. Ugi's Scheme comprises of 30 reaction classes according to which reactions can be classified, but in spite of such reaction classes there were several reactions which could not be classified. About 4000 reactions were studied in this work from The Chemical Thesaurus (a chemical reaction database) and accordingly 24 new classes have emerged which led to the extension of Ugi's Scheme. An efficient algorithm based on the extended Ugi's scheme have been developed for classification of chemical reactions. Reaction matrices being symmetric, matrix implementation of extended Ugi's scheme using conventional upper/lower tri-angular matrix is of O(n2) in terms of space complexity. Time complexity of similar matrix implementation is O(n2) in worst case. The authors' proposed algorithm uses two fixed size look-up tables in a novel way and requires constant space complexity. Worst case time complexity of their algorithm although still O(n2) but it outperforms conventional matrix implementation when number of atoms or components in the chemical reaction is 4 or more.

Download Full-text

PATH CONSISTENCY REVISITED

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213096000092 ◽

1996 ◽

Vol 05 (01n02) ◽

pp. 127-141 ◽

Cited By ~ 6

Author(s):

MONINDER SINGH

Keyword(s):

Time Complexity ◽

Real Life ◽

Space Complexity ◽

Minimal Amount ◽

Worst Case ◽

Average Case ◽

Main Factors ◽

High Space

One of the main factors limiting the use of path consistency algorithms in real life applications is their high space complexity. Han and Lee proposed a path consistency algorithm, PC-4, with O(n3a3) space complexity, which makes it practicable only for small problems. I present a new path consistency algorithm, PC-5, which has an O(n3a2) space complexity while retaining the worst-case time complexity of PC-4. Moreover, the new algorithm exhibits a much better average-case time complexity. The new algorithm is based on the idea (due to Bessiere) that, at any time, only a minimal amount of support has to be found and recorded for a labeling to establish its viability; one has to look for a new support only if the current support is eliminated. I also show that PC-5 can be improved further to yield an algorithm, PC5++, with even better average-case performance and the same space complexity.

Download Full-text

TIME OPTIMAL ALGORITHMS FOR BLACK HOLE SEARCH IN RINGS

Discrete Mathematics Algorithms and Applications ◽

10.1142/s1793830911001346 ◽

2011 ◽

Vol 03 (04) ◽

pp. 457-471 ◽

Cited By ~ 8

Author(s):

B. BALAMOHAN ◽

P. FLOCCHINI ◽

A. MIRI ◽

N. SANTORO

Keyword(s):

Black Hole ◽

Time Complexity ◽

Time Cost ◽

Ring Networks ◽

Worst Case ◽

Average Case ◽

Case Complexity ◽

Average Case Complexity ◽

Time Optimal ◽

Optimal Average

In a network environment supporting mobile entities (called robots or agents), a black hole is a harmful site that destroys any incoming entity without leaving any visible trace. The black-hole search problit is the task of a team of k > 1 mobile entities, starting from the same safe location and executing the same algorithm, to determine within finite time the location of the black hole. In this paper, we consider the black hole search problit in asynchronous ring networks of n nodes, and focus on time complexity. It is known that any algorithm for black-hole search in a ring requires at least 2(n - 2) time in the worst case. The best known algorithm achieves this bound with a team of n - 1 agents with an average time cost of 2(n - 2), equal to the worst case. In this paper, we first show how the same number of agents using 2 extra time units in the worst case, can solve the problit in only [Formula: see text] time on the average. We then prove that the optimal average case complexity of [Formula: see text] can be achieved without increasing the worst case using 2(n - 1) agents. Finally, we design an algorithm that achieves asymptotically optimal both worst and average case time complexities itploying an optimal team of k = 2 agents, thus improving on the earlier results that required O(n) agents.

Download Full-text

d-PBWT: dynamic positional Burrows-Wheeler transform

10.1101/2020.01.14.906487 ◽

2020 ◽

Author(s):

Ahsan Sanaullah ◽

Degui Zhi ◽

Shaojie Zhang

Keyword(s):

Data Structure ◽

Time Complexity ◽

Linear Time ◽

Genotype Imputation ◽

Worst Case ◽

Average Case ◽

Insertion And Deletion ◽

Static Data ◽

Efficient Retrieval ◽

Burrows Wheeler Transform

AbstractDurbin’s PBWT, a scalable data structure for haplotype matching, has been successfully applied to identical by descent (IBD) segment identification and genotype imputation. Once the PBWT of a haplotype panel is constructed, it supports efficient retrieval of all shared long segments among all individuals (long matches) and efficient query between an external haplotype and the panel. However, the standard PBWT is an array-based static data structure and does not support dynamic updates of the panel. Here, we generalize the static PBWT to a dynamic data structure, d-PBWT, where the reverse prefix sorting at each position is represented by linked lists. We developed efficient algorithms for insertion and deletion of individual haplotypes. In addition, we verified that d-PBWT can support all algorithms of PBWT. In doing so, we systematically investigated variations of set maximal match and long match query algorithms: while they all have average case time complexity independent of database size, they have different worst case complexities, linear time complexity with the size of the genome, and dependency on additional data structures.

Download Full-text

Time Complexity Analysis of the Binary Tree Roll Algorithm

JITA - Journal of Information Technology and Applications (Banja Luka) - APEIRON ◽

10.7251/jit1602053b ◽

2017 ◽

Vol 12 (2) ◽

Author(s):

Adrijan Božinovski ◽

George Tanev ◽

Biljana Stojčevska ◽

Veno Pačovski ◽

Nevena Ackovska

Keyword(s):

Theoretical Analysis ◽

Empirical Analysis ◽

Binary Tree ◽

Time Complexity ◽

Recurrence Relations ◽

Complexity Analysis ◽

Worst Case ◽

Average Case ◽

The Empirical Analysis

This paper presents the time complexity analysis of the Binary Tree Roll algorithm. The time complexity is analyzed theoretically and the results are then confirmed empirically. The theoretical analysis consists of finding recurrence relations for the time complexity, and solving them using various methods. The empirical analysis consists of exhaustively testing all trees with given numbers of nodes and counting the minimum and maximum steps necessary to complete the roll algorithm. The time complexity is shown, both theoretically and empirically, to be linear in the best case and quadratic in the worst case, whereas its average case is shown to be dominantly linear for trees with a relatively small number of nodes and dominantly quadratic otherwise.

Download Full-text

Space Complexity Analysis of the Binary Tree Roll Algorithm

JITA - Journal of Information Technology and Applications (Banja Luka) - APEIRON ◽

10.7251/jit1701009b ◽

2017 ◽

Vol 13 (1) ◽

Author(s):

Adrijan Božinovski ◽

George Tanev ◽

Biljana Stojčevska ◽

Veno Pačovski ◽

Nevena Ackovska

Keyword(s):

Theoretical Analysis ◽

Empirical Analysis ◽

Binary Tree ◽

Complexity Analysis ◽

Space Complexity ◽

Worst Case ◽

Average Case ◽

Tree Topologies ◽

The Given ◽

The Empirical Analysis

This paper presents the space complexity analysis of the Binary Tree Roll algorithm. The space complexity is analyzed theoretically and the results are then confirmed empirically. The theoretical analysis consists of determining the amount of memory occupied during the execution of the algorithm and deriving functions of it, in terms of the number of nodes of the tree n, for the worst - and best-case scenarios. The empirical analysis of the space complexity consists of measuring the maximum and minimum amounts of memory occupied during the execution of the algorithm, for all binary tree topologies with the given number of nodes. The space complexity is shown, both theoretically and empirically, to be logarithmic in the best case and linear in the worst case, whereas its average case is shown to be dominantly logarithmic.

Download Full-text

Synchronizing Almost-Group Automata

International Journal of Foundations of Computer Science ◽

10.1142/s0129054120420058 ◽

2020 ◽

pp. 1-22

Author(s):

Mikhail V. Berlinkov ◽

Cyril Nicaud

Keyword(s):

Lower Bound ◽

Efficient Algorithm ◽

High Probability ◽

Worst Case ◽

Average Case ◽

Model Of Computation ◽

Letter Alphabet ◽

Strongly Connected ◽

Small Change ◽

Random Automata

In this paper we address the question of synchronizing random automata in the critical settings of almost-group automata. Group automata are automata where all letters act as permutations on the set of states, and they are not synchronizing (unless they have one state). In almost-group automata, one of the letters acts as a permutation on [Formula: see text] states, and the others as permutations. We prove that this small change is enough for automata to become synchronizing with high probability. More precisely, we establish that the probability that a strongly-connected almost-group automaton is not synchronizing is [Formula: see text], for a [Formula: see text]-letter alphabet. We also present an efficient algorithm that decides whether a strongly-connected almost-group automaton is synchronizing. For a natural model of computation, we establish a [Formula: see text] worst-case lower bound for this problem ([Formula: see text] for the average case), which is almost matched by our algorithm.

Download Full-text

The New Fast Algorithm Based on Transposed Matrix for Frequent Sets Mining of Association Rule

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.457-458.992 ◽

2013 ◽

Vol 457-458 ◽

pp. 992-997

Author(s):

Shao Yun Song ◽

Bao Hua Zhang

Keyword(s):

Fast Algorithm ◽

Association Rule ◽

Efficient Algorithm ◽

Time Complexity ◽

Association Rule Mining ◽

Space Complexity ◽

The Other ◽

Rule Mining ◽

Related Information ◽

Frequent Sets

Apriori and its improved algorithms can be generally classified into two kinds: SQL-based and on memory-based. In order to improve association rule mining efficiency, after analyzing the efficiency bottlenecks in some algorithms of the second class, an improved efficient algorithm is proposed. Two matrixes are introduced into the algorithm: one is used to map database and the other to store frequent 2-itemsets related information. Through the operation of two matrixes, its time complexity and space complexity decrease significantly. The experiment indicates that the method has better performance.

Download Full-text

Probabilistic quality estimations for combinatorial optimization problems

Georgian Mathematical Journal ◽

10.1515/gmj-2017-0041 ◽

2018 ◽

Vol 25 (1) ◽

pp. 123-134 ◽

Cited By ~ 1

Author(s):

Nodari Vakhania

Keyword(s):

Time Complexity ◽

Case Analysis ◽

Performance Measure ◽

Accurate Estimation ◽

Average Case Analysis ◽

Worst Case ◽

Average Case ◽

Probabilistic Average ◽

Problem Instances

AbstractThe computational complexity of an algorithm is traditionally measured for the worst and the average case. The worst-case estimation guarantees a certain worst-case behavior of a given algorithm, although it might be rough, since in “most instances” the algorithm may have a significantly better performance. The probabilistic average-case analysis claims to derive an average performance of an algorithm, say, for an “average instance” of the problem in question. That instance may be far away from the average of the problem instances arising in a given real-life application, and so the average case analysis would also provide a non-realistic estimation. We suggest that, in general, a wider use of probabilistic models for a more accurate estimation of the algorithm efficiency could be possible. For instance, the quality of the solutions delivered by an approximation algorithm may also be estimated in the “average” probabilistic case. Such an approach would deal with the estimation of the quality of the solutions delivered by the algorithm for the most common (for a given application) problem instances. As we illustrate, the probabilistic modeling can also be used to derive an accurate time complexity performance measure, distinct from the traditional probabilistic average-case time complexity measure. Such an approach could, in particular, be useful when the traditional average-case estimation is still rough or is not possible at all.

Download Full-text