Scalable Algorithms for Adaptive Statistical Designs

Scientific Programming ◽

10.1155/2000/508081 ◽

2000 ◽

Vol 8 (3) ◽

pp. 183-193 ◽

Cited By ~ 2

Author(s):

Robert Oehmke ◽

Janis Hardwick ◽

Quentin F. Stout

Keyword(s):

Clinical Trials ◽

High Performance ◽

Learning Algorithms ◽

Adaptive Designs ◽

Memory Access ◽

Important Class ◽

Stochastic Environment ◽

Scalable Algorithms ◽

Nested Loops ◽

Statistical Designs

We present a scalable, high-performance solution to multidimensional recurrences that arise in adaptive statistical designs. Adaptive designs are an important class of learning algorithms for a stochastic environment, and we focus on the problem of optimally assigning patients to treatments in clinical trials. While adaptive designs have significant ethical and cost advantages, they are rarely utilized because of the complexity of optimizing and analyzing them. Computational challenges include massive memory requirements, few calculations per memory access, and multiply-nested loops with dynamic indices. We analyze the effects of various parallelization options, and while standard approaches do not work well, with effort an efficient, highly scalable program can be developed. This allows us to solve problems thousands of times more complex than those solved previously, which helps make adaptive designs practical. Further, our work applies to many other problems involving neighbor recurrences, such as generalized string matching.

Download Full-text

Genomics and breeding innovations for enhancing genetic gain for climate resilience and nutrition traits

Theoretical and Applied Genetics ◽

10.1007/s00122-021-03847-6 ◽

2021 ◽

Author(s):

Pallavi Sinha ◽

Vikas K. Singh ◽

Abhishek Bohra ◽

Arvind Kumar ◽

Jochen C. Reif ◽

...

Keyword(s):

Statistical Methods ◽

Genetic Gain ◽

Crop Improvement ◽

Breeding Population ◽

Breeding Cycle ◽

Climate Resilience ◽

Heritability Estimation ◽

Improvement Programs ◽

Statistical Designs ◽

Novel Alleles

Abstract Key message Integrating genomics technologies and breeding methods to tweak core parameters of the breeder’s equation could accelerate delivery of climate-resilient and nutrient rich crops for future food security. Abstract Accelerating genetic gain in crop improvement programs with respect to climate resilience and nutrition traits, and the realization of the improved gain in farmers’ fields require integration of several approaches. This article focuses on innovative approaches to address core components of the breeder’s equation. A prerequisite to enhancing genetic variance (σ2g) is the identification or creation of favorable alleles/haplotypes and their deployment for improving key traits. Novel alleles for new and existing target traits need to be accessed and added to the breeding population while maintaining genetic diversity. Selection intensity (i) in the breeding program can be improved by testing a larger population size, enabled by the statistical designs with minimal replications and high-throughput phenotyping. Selection priorities and criteria to select appropriate portion of the population too assume an important role. The most important component of breeder′s equation is heritability (h2). Heritability estimates depend on several factors including the size and the type of population and the statistical methods. The present article starts with a brief discussion on the potential ways to enhance σ2g in the population. We highlight statistical methods and experimental designs that could improve trait heritability estimation. We also offer a perspective on reducing the breeding cycle time (t), which could be achieved through the selection of appropriate parents, optimizing the breeding scheme, rapid fixation of target alleles, and combining speed breeding with breeding programs to optimize trials for release. Finally, we summarize knowledge from multiple disciplines for enhancing genetic gains for climate resilience and nutritional traits.

Download Full-text

A distributional semantics-based information retrieval framework for online social networks

Intelligent Decision Technologies ◽

10.3233/idt-200001 ◽

2021 ◽

pp. 1-11

Author(s):

V.S. Anoop ◽

P. Deepak ◽

S. Asharaf

Keyword(s):

Social Networks ◽

Information Retrieval ◽

Online Social Networks ◽

Latent Dirichlet Allocation ◽

Relevant Information ◽

Distributional Semantics ◽

Scalable Algorithms ◽

Mobile Platforms ◽

Cancer Support ◽

Efficient Extraction

Online social networks are considered to be one of the most disruptive platforms where people communicate with each other on any topic ranging from funny cat videos to cancer support. The widespread diffusion of mobile platforms such as smart-phones causes the number of messages shared in such platforms to grow heavily, thus more intelligent and scalable algorithms are needed for efficient extraction of useful information. This paper proposes a method for retrieving relevant information from social network messages using a distributional semantics-based framework powered by topic modeling. The proposed framework combines the Latent Dirichlet Allocation and distributional representation of phrases (Phrase2Vec) for effective information retrieval from online social networks. Extensive and systematic experiments on messages collected from Twitter (tweets) show this approach outperforms some state-of-the-art approaches in terms of precision and accuracy and better information retrieval is possible using the proposed method.

Download Full-text

A review of statistical designs for improving the efficiency of phase II studies in oncology

Statistical Methods in Medical Research ◽

10.1177/0962280215588247 ◽

2015 ◽

Vol 25 (3) ◽

pp. 1010-1021 ◽

Cited By ~ 4

Author(s):

James MS Wason ◽

Thomas Jaki

Keyword(s):

Phase Ii ◽

Phase Ii Studies ◽

Statistical Designs

Download Full-text

hIPPYlib

ACM Transactions on Mathematical Software ◽

10.1145/3428447 ◽

2021 ◽

Vol 47 (2) ◽

pp. 1-34

Author(s):

Umberto Villa ◽

Noemi Petra ◽

Omar Ghattas

Keyword(s):

Inverse Problems ◽

Large Scale ◽

Low Rank ◽

Scalable Algorithms ◽

Low Rank Approximation ◽

Dimensional Parameter ◽

Infinite Dimensional ◽

Bayesian Inverse Problems ◽

Rank Approximation ◽

Extensible Software

We present an extensible software framework, hIPPYlib, for solution of large-scale deterministic and Bayesian inverse problems governed by partial differential equations (PDEs) with (possibly) infinite-dimensional parameter fields (which are high-dimensional after discretization). hIPPYlib overcomes the prohibitively expensive nature of Bayesian inversion for this class of problems by implementing state-of-the-art scalable algorithms for PDE-based inverse problems that exploit the structure of the underlying operators, notably the Hessian of the log-posterior. The key property of the algorithms implemented in hIPPYlib is that the solution of the inverse problem is computed at a cost, measured in linearized forward PDE solves, that is independent of the parameter dimension. The mean of the posterior is approximated by the MAP point, which is found by minimizing the negative log-posterior with an inexact matrix-free Newton-CG method. The posterior covariance is approximated by the inverse of the Hessian of the negative log posterior evaluated at the MAP point. The construction of the posterior covariance is made tractable by invoking a low-rank approximation of the Hessian of the log-likelihood. Scalable tools for sample generation are also discussed. hIPPYlib makes all of these advanced algorithms easily accessible to domain scientists and provides an environment that expedites the development of new algorithms.

Download Full-text

Statistical designs for two-color microarray experiments involving technical replication

Computational Statistics & Data Analysis ◽

10.1016/j.csda.2006.05.017 ◽

2006 ◽

Vol 51 (3) ◽

pp. 2078-2090 ◽

Cited By ~ 7

Author(s):

Shin-Fu Tsai ◽

Chen-Tuo Liao ◽

Feng-Shun Chai

Keyword(s):

Microarray Experiments ◽

Statistical Designs ◽

Technical Replication

Download Full-text

2020 IEEE/ACM 11th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA)

10.1109/scala51936.2020 ◽

2020 ◽

Keyword(s):

Large Scale ◽

Scalable Algorithms ◽

Large Scale Systems

Download Full-text

Decolourization of aqueous dye solutions by a novel adsorbent: Application of statistical designs and surface plots for the optimization and regression analysis

Journal of Hazardous Materials ◽

10.1016/j.jhazmat.2005.03.008 ◽

2005 ◽

Vol 122 (1-2) ◽

pp. 75-83 ◽

Cited By ~ 107

Author(s):

K. Ravikumar ◽

B. Deebika ◽

K. Balu

Keyword(s):

Regression Analysis ◽

Statistical Designs ◽

Dye Solutions

Download Full-text

of Statistical Designs

Handbook of Bolts and Bolted Joints ◽

10.1201/9781482273786-236 ◽

1998 ◽

pp. 846-846

Keyword(s):

Statistical Designs

Download Full-text

Model-free, Model-based, and General Intelligence

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/2 ◽

2018 ◽

Cited By ~ 2

Author(s):

Hector Geffner

Keyword(s):

Intelligent Systems ◽

Human Mind ◽

Writing Programs ◽

Scalable Algorithms ◽

Model Based ◽

Model Free ◽

Considerable Success ◽

Black Boxes ◽

Free Model ◽

Intelligent Behavior

During the 60s and 70s, AI researchers explored intuitions about intelligence by writing programs that displayed intelligent behavior. Many good ideas came out from this work but programs written by hand were not robust or general. After the 80s, research increasingly shifted to the development of learners capable of inferring behavior and functions from experience and data, and solvers capable of tackling well-defined but intractable models like SAT, classical planning, Bayesian networks, and POMDPs. The learning approach has achieved considerable success but results in black boxes that do not have the flexibility, transparency, and generality of their model-based counterparts. Model-based approaches, on the other hand, require models and scalable algorithms. Model-free learners and model-based solvers have indeed close parallels with Systems 1 and 2 in current theories of the human mind: the first, a fast, opaque, and inflexible intuitive mind; the second, a slow, transparent, and flexible analytical mind. In this paper, I review developments in AI and draw on these theories to discuss the gap between model-free learners and model-based solvers, a gap that needs to be bridged in order to have intelligent systems that are robust and general.

Download Full-text