Constructions for Clumps Statistics.

Frédérique Bassino; Julien Clément; Julien Fayolle; Pierre Nicodème

doi:10.46298/dmtcs.3563

Constructions for Clumps Statistics.

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3563 ◽

2008 ◽

Vol DMTCS Proceedings vol. AI,... (Proceedings) ◽

Author(s):

Frédérique Bassino ◽

Julien Clément ◽

Julien Fayolle ◽

Pierre Nicodème

Keyword(s):

Poisson Distribution ◽

Unsolved Problem ◽

Probabilistic Approach ◽

Compound Poisson Distribution ◽

Exact Results ◽

Combinatorial Method ◽

Asymptotic Results ◽

Finite Set ◽

International Audience ◽

Overlapping Sets

International audience We consider a component of the word statistics known as clump; starting from a finite set of words, clumps are maximal overlapping sets of these occurrences. This object has first been studied by Schbath with the aim of counting the number of occurrences of words in random texts. Later work with similar probabilistic approach used the Chen-Stein approximation for a compound Poisson distribution, where the number of clumps follows a law close to Poisson. Presently there is no combinatorial counterpart to this approach, and we fill the gap here. We also provide a construction for the yet unsolved problem of clumps of an arbitrary finite set of words. In contrast with the probabilistic approach which only provides asymptotic results, the combinatorial method provides exact results that are useful when considering short sequences.

Download Full-text

Supermodularity on chains and complexity of maximum constraint satisfaction

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3420 ◽

2005 ◽

Vol DMTCS Proceedings vol. AE,... (Proceedings) ◽

Author(s):

Vladimir Deineko ◽

Peter Jonsson ◽

Mikael Klasson ◽

Andrei Krokhin

Keyword(s):

Polynomial Time ◽

Constraint Satisfaction ◽

Constraint Satisfaction Problem ◽

Finite Domain ◽

Finite Collection ◽

Simple Description ◽

Weighted Constraints ◽

Finite Set ◽

International Audience ◽

Overlapping Sets

International audience In the maximum constraint satisfaction problem ($\mathrm{Max \; CSP}$), one is given a finite collection of (possibly weighted) constraints on overlapping sets of variables, and the goal is to assign values from a given finite domain to the variables so as to maximise the number (or the total weight) of satisfied constraints. This problem is $\mathrm{NP}$-hard in general so it is natural to study how restricting the allowed types of constraints affects the complexity of the problem. In this paper, we show that any $\mathrm{Max \; CSP}$ problem with a finite set of allowed constraint types, which includes all constants (i.e. constraints of the form $x=a$), is either solvable in polynomial time or is $\mathrm{NP}$-complete. Moreover, we present a simple description of all polynomial-time solvable cases of our problem. This description uses the well-known combinatorial property of supermodularity.

Download Full-text

Log-Compound-Poisson Distribution Model of Intermittency in Turbulence

Zeitschrift für Naturforschung A ◽

10.1515/zna-1998-10-1105 ◽

1998 ◽

Vol 53 (10-11) ◽

pp. 828-832

Author(s):

Feng Quing-Zeng

Keyword(s):

Energy Dissipation ◽

Poisson Distribution ◽

Turbulent Energy ◽

Compound Poisson Distribution ◽

Distribution Model ◽

Velocity Difference ◽

Scaling Exponents ◽

Compound Poisson ◽

Developed Turbulence ◽

Experimental Values

Abstract The log-compound-Poisson distribution for the breakdown coefficients of turbulent energy dissipation is proposed, and the scaling exponents for the velocity difference moments in fully developed turbulence are obtained, which agree well with experimental values up to measurable orders. The under-lying physics of this model is directly related to the burst phenomenon in turbulence, and a detailed discussion is given in the last section.

Download Full-text

Functionals of random mappings: exact and asymptotic results

Advances in Applied Probability ◽

10.2307/1427616 ◽

1991 ◽

Vol 23 (3) ◽

pp. 437-455 ◽

Cited By ~ 7

Author(s):

P. J. Donnelly ◽

W. J. Ewens ◽

S. Padmadisastra

Keyword(s):

Frequency Spectrum ◽

Exact Results ◽

Asymptotic Results ◽

Number Of Components ◽

Random Mapping ◽

Random Mappings ◽

Simulation Results ◽

Central Tool

A random mapping partitions the set {1, 2, ···, m} into components, where i and j are in the same component if some functional iterate of i equals some functional iterate of j. We consider various functionals of these partitions and of samples from it, including the number of components of ‘small' size and of size O(m) as m → ∞the size of the largest component, the number of components, and various symmetric functionals of the normalized component sizes. In many cases exact results, while available, are uniformative, and we consider various approximations. Numerical and simulation results are also presented. A central tool for many calculations is the ‘frequency spectrum', both exact and asymptotic.

Download Full-text

Modelling Heterogeneity in Survival Analysis by the Compound Poisson Distribution

The Annals of Applied Probability ◽

10.1214/aoap/1177005583 ◽

1992 ◽

Vol 2 (4) ◽

pp. 951-972 ◽

Cited By ~ 116

Author(s):

Odd O. Aalen

Keyword(s):

Survival Analysis ◽

Poisson Distribution ◽

Compound Poisson Distribution ◽

Compound Poisson

Download Full-text

A Distribution for Multivariate Frailty Based on the Compound Poisson Distribution with Random Scale

Lifetime Data Analysis ◽

10.1007/s10985-004-5639-z ◽

2005 ◽

Vol 11 (1) ◽

pp. 41-59 ◽

Cited By ~ 21

Author(s):

Tron Anders Moger ◽

Odd O. Aalen

Keyword(s):

Poisson Distribution ◽

Compound Poisson Distribution ◽

Compound Poisson ◽

Random Scale

Download Full-text

Prediction and estimation for the compound Poisson distribution

Herbert Robbins Selected Papers ◽

10.1007/978-1-4612-5110-1_5 ◽

1985 ◽

pp. 70-71

Author(s):

Herbert Robbins

Keyword(s):

Poisson Distribution ◽

Compound Poisson Distribution ◽

Compound Poisson

Download Full-text

Counting occurrences for a finite set of words: an inclusion-exclusion approach

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3543 ◽

2007 ◽

Vol DMTCS Proceedings vol. AH,... (Proceedings) ◽

Author(s):

Frédérique Bassino ◽

Julien Clément ◽

J. Fayolle ◽

P. Nicodème

Keyword(s):

Generating Function ◽

Exclusion Principle ◽

Complete Proof ◽

Detailed Proof ◽

Finite Set ◽

International Audience ◽

The One ◽

Maple Package

International audience In this paper, we give the multivariate generating function counting texts according to their length and to the number of occurrences of words from a finite set. The application of the inclusion-exclusion principle to word counting due to Goulden and Jackson (1979, 1983) is used to derive the result. Unlike some other techniques which suppose that the set of words is reduced (<i>i..e.</i>, where no two words are factor of one another), the finite set can be chosen arbitrarily. Noonan and Zeilberger (1999) already provided a MAPLE package treating the non-reduced case, without giving an expression of the generating function or a detailed proof. We give a complete proof validating the use of the inclusion-exclusion principle and compare the complexity of the method proposed here with the one using automata for solving the problem.

Download Full-text

Graphs with many vertex-disjoint cycles

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.587 ◽

2012 ◽

Vol Vol. 14 no. 2 (Graph Theory) ◽

Author(s):

Dieter Rautenbach ◽

Friedrich Regen

Keyword(s):

Graph Theory ◽

Upper Bound ◽

Linear Time ◽

Simple Extension ◽

Disjoint Cycles ◽

Cyclomatic Number ◽

Finite Set ◽

Extension Rule ◽

International Audience ◽

Vertex Disjoint

Graph Theory International audience We study graphs G in which the maximum number of vertex-disjoint cycles nu(G) is close to the cyclomatic number mu(G), which is a natural upper bound for nu(G). Our main result is the existence of a finite set P(k) of graphs for all k is an element of N-0 such that every 2-connected graph G with mu(G)-nu(G) = k arises by applying a simple extension rule to a graph in P(k). As an algorithmic consequence we describe algorithms calculating minmu(G)-nu(G), k + 1 in linear time for fixed k.

Download Full-text

Sorting using complete subintervals and the maximum number of runs in a randomly evolving sequence: Extended abstract.

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3548 ◽

2007 ◽

Vol DMTCS Proceedings vol. AH,... (Proceedings) ◽

Author(s):

Svante Janson

Keyword(s):

Order Term ◽

Combinatorial Problem ◽

Random Order ◽

Priority Queues ◽

Sorting Algorithm ◽

Asymptotic Results ◽

First Order ◽

Asymptotically Normal ◽

International Audience ◽

Space Requirements

International audience We study the space requirements of a sorting algorithm where only items that at the end will be adjacent are kept together. This is equivalent to the following combinatorial problem: Consider a string of fixed length n that starts as a string of 0's, and then evolves by changing each 0 to 1, with the n changes done in random order. What is the maximal number of runs of 1's? We give asymptotic results for the distribution and mean. It turns out that, as in many problems involving a maximum, the maximum is asymptotically normal, with fluctuations of order $n^{1/2}$, and to the first order well approximated by the number of runs at the instance when the expectation is maximized, in this case when half the elements have changed to 1; there is also a second order term of order $n^{1/3}$. We also treat some variations, including priority queues and sock-sorting.

Download Full-text

Enumerating alternating tree families

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3624 ◽

2008 ◽

Vol DMTCS Proceedings vol. AJ,... (Proceedings) ◽

Author(s):

Markus Kuba ◽

Alois Panholzer

Keyword(s):

Nous Obtenons ◽

Asymptotic Results ◽

Root Node ◽

Ordered Trees ◽

Exponential Generating Function ◽

Random Node ◽

Enumeration Problems ◽

International Audience ◽

Nous Examinons ◽

Labelled Trees

International audience We study two enumeration problems for $\textit{up-down alternating trees}$, i.e., rooted labelled trees $T$, where the labels $ v_1, v_2, v_3, \ldots$ on every path starting at the root of $T$ satisfy $v_1 < v_2 > v_3 < v_4 > \cdots$. First we consider various tree families of interest in combinatorics (such as unordered, ordered, $d$-ary and Motzkin trees) and study the number $T_n$ of different up-down alternating labelled trees of size $n$. We obtain for all tree families considered an implicit characterization of the exponential generating function $T(z)$ leading to asymptotic results of the coefficients $T_n$ for various tree families. Second we consider the particular family of up-down alternating labelled ordered trees and study the influence of such an alternating labelling to the average shape of the trees by analyzing the parameters $\textit{label of the root node}$, $\textit{degree of the root node}$ and $\textit{depth of a random node}$ in a random tree of size $n$. This leads to exact enumeration results and limiting distribution results. Nous étudions deux problèmes de dénombrement d'$\textit{arbres alternés haut-bas}$ : par définition, ce sont des arbres munis d'une racine et tels que, pour tout chemin partant de la racine, les valeurs $v_1,v_2,v_3,\ldots$ associées aux nœuds du chemin satisfont la chaîne d'inégalités $v_1 < v_2 > v_3 < v_4 > \cdots$. D'une part, nous considérons diverses familles d'arbres intéressantes du point de vue de l'analyse combinatoire (comme les arbres de Motzkin, les arbres non ordonnés, ordonnés et $d$-aires) et nous étudions pour chaque famille le nombre total $T_n$ d'arbres alternés haut-bas de taille $n$. Nous obtenons pour toutes les familles d'arbres considérées une caractérisation implicite de la fonction génératrice exponentielle $T(z)$. Cette caractérisation nous renseigne sur le comportement asymptotique des coefficients $T_n$ de plusieurs familles d'arbres. D'autre part, nous examinons le cas particulier de la famille des arbres ordonnés : nous étudions l'influence de l'étiquetage alterné haut-bas sur l'allure générale de ces arbres en analysant trois paramètres dans un arbre aléatoire (valeur de la racine, degré de la racine et profondeur d'un nœud aléatoire). Nous obtenons alors des résultats en terme de distribution limite, mais aussi de dénombrement exact.

Download Full-text