Fast Performance Modeling across Different Database Versions Using Partitioned Co-Kriging

Rong Cao; Liang Bao; Shouxin Wei; Jiarui Duan; Xi Wu; Yeye Du; Ren Sun

doi:10.3390/app11209669

Fast Performance Modeling across Different Database Versions Using Partitioned Co-Kriging

Applied Sciences ◽

10.3390/app11209669 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9669

Author(s):

Rong Cao ◽

Liang Bao ◽

Shouxin Wei ◽

Jiarui Duan ◽

Xi Wu ◽

...

Keyword(s):

Performance Prediction ◽

System Performance ◽

Prediction Accuracy ◽

Performance Modeling ◽

State Of The Art ◽

Database Systems ◽

Target Domain ◽

Reliable Performance ◽

Measurement Effort ◽

Control Functional

Database systems have a large number of configuration parameters that control functional and non-functional properties (e.g., performance and cost). Different configurations may lead to different performance values. To understand and predict the effect of configuration parameters on system performance, several learning-based strategies have been recently proposed. However, existing approaches usually assume a fixed database version such that learning has to be repeated once the database version changes. Repeating measurement and learning for each version is expensive and often practically infeasible. Instead, we propose the Partitioned Co-Kriging (PCK) approach that transfers knowledge from an older database version (source domain) to learn a reliable performance prediction model fast for a newer database version (target domain). Our method is based on the key observations that performance responses typically exhibit similarities across different database versions. We conducted extensive experiments under 5 different database systems with different versions to demonstrate the superiority of PCK. Experimental results show that PCK outperforms six state-of-the-art baseline algorithms in terms of prediction accuracy and measurement effort.

Download Full-text

Performance Evaluation of Distributed Database Systems

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v13i9.2383 ◽

2014 ◽

Vol 13 (9) ◽

pp. 4859-4867

Author(s):

Khaled Saleh Maabreh

Keyword(s):

Performance Evaluation ◽

System Performance ◽

Operation Mode ◽

Database Systems ◽

Distributed Database ◽

Evaluation Study ◽

Distributed Database Systems ◽

Different Types ◽

Distributed Database System ◽

Distributed Database Management

Distributed database management systems manage a huge amount of data as well as large and increasingly growing number of users through different types of queries. Therefore, efficient methods for accessing these data volumes will be required to provide a high and an acceptable level of system performance. Â Data in these systems are varying in terms of types from texts to images, audios and videos that must be available through an optimized level of replication. Distributed database systems have many parameters like data distribution degree, operation mode and the number of sites and replication. These parameters have played a major role in any performance evaluation study. This paper investigates the main parameters that may affect the system performance, which may help with configuring the distributed database system for enhancing the overall system performance.

Download Full-text

Equivariant Adversarial Network for Image-to-image Translation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3458280 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-14

Author(s):

Masoumeh Zareapoor ◽

Jie Yang

Keyword(s):

State Of The Art ◽

Generative Models ◽

Generative Model ◽

Target Domain ◽

Adversarial Network ◽

Proposed Model ◽

Image Translation ◽

Great Performance ◽

Representative Model ◽

The Ideal

Image-to-Image translation aims to learn an image from a source domain to a target domain. However, there are three main challenges, such as lack of paired datasets, multimodality, and diversity, that are associated with these problems and need to be dealt with. Convolutional neural networks (CNNs), despite of having great performance in many computer vision tasks, they fail to detect the hierarchy of spatial relationships between different parts of an object and thus do not form the ideal representative model we look for. This article presents a new variation of generative models that aims to remedy this problem. We use a trainable transformer, which explicitly allows the spatial manipulation of data within training. This differentiable module can be augmented into the convolutional layers in the generative model, and it allows to freely alter the generated distributions for image-to-image translation. To reap the benefits of proposed module into generative model, our architecture incorporates a new loss function to facilitate an effective end-to-end generative learning for image-to-image translation. The proposed model is evaluated through comprehensive experiments on image synthesizing and image-to-image translation, along with comparisons with several state-of-the-art algorithms.

Download Full-text

Domain Adaptation Using a Three-Way Decision Improves the Identification of Autism Patients from Multisite fMRI Data

Brain Sciences ◽

10.3390/brainsci11050603 ◽

2021 ◽

Vol 11 (5) ◽

pp. 603

Author(s):

Chunlei Shi ◽

Xianwei Xin ◽

Jiacai Zhang

Keyword(s):

Machine Learning ◽

Domain Adaptation ◽

Recognition Accuracy ◽

State Of The Art ◽

Autism Spectrum ◽

Fmri Data ◽

Target Domain ◽

Sample Distribution ◽

Machine Learning Methods ◽

First Time

Machine learning methods are widely used in autism spectrum disorder (ASD) diagnosis. Due to the lack of labelled ASD data, multisite data are often pooled together to expand the sample size. However, the heterogeneity that exists among different sites leads to the degeneration of machine learning models. Herein, the three-way decision theory was introduced into unsupervised domain adaptation in the first time, and applied to optimize the pseudolabel of the target domain/site from functional magnetic resonance imaging (fMRI) features related to ASD patients. The experimental results using multisite fMRI data show that our method not only narrows the gap of the sample distribution among domains but is also superior to the state-of-the-art domain adaptation methods in ASD recognition. Specifically, the ASD recognition accuracy of the proposed method is improved on all the six tasks, by 70.80%, 75.41%, 69.91%, 72.13%, 71.01% and 68.85%, respectively, compared with the existing methods.

Download Full-text

Numerical simulation of turbulence flow in a Kaplan turbine -Evaluation on turbine performance prediction accuracy-

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/22/2/022006 ◽

2014 ◽

Vol 22 (2) ◽

pp. 022006 ◽

Cited By ~ 3

Author(s):

P Ko ◽

S Kurosawa

Keyword(s):

Numerical Simulation ◽

Performance Prediction ◽

Prediction Accuracy ◽

Kaplan Turbine ◽

Turbulence Flow ◽

Turbine Performance

Download Full-text

Digital loop system performance prediction using an impulse noise database

10.1109/glocom.1989.64212 ◽

2003 ◽

Author(s):

L. Alles ◽

K. Cuffling ◽

R. Bertrand ◽

R. Quintal

Keyword(s):

Performance Prediction ◽

System Performance ◽

Impulse Noise ◽

Loop System

Download Full-text

Comparison of Wind Turbine Performance Prediction and Measurement

Journal of Solar Energy Engineering ◽

10.1115/1.3266290 ◽

1982 ◽

Vol 104 (2) ◽

pp. 84-88 ◽

Cited By ~ 11

Author(s):

J. L. Tangler

Keyword(s):

Test Data ◽

Wind Turbines ◽

Performance Prediction ◽

State Of The Art ◽

Correlation Study ◽

Performance Models ◽

Rocky Flats ◽

Turbine Performance ◽

Horizontal Axis Wind Turbines ◽

Development Center

The purpose of this work was to evaluate the state-of-the-art of performance prediction for small horizontal-axis wind turbines. This effort was undertaken since few of the existing performance methods used to predict rotor power output have been validated with reliable test data. The program involved evaluating several existing performance models from four contractors by comparing their predictions for two wind turbines with actual test data. Test data were acquired by Rocky Flats Test and Development Center and furnished to the contractors after submission of their prediction reports. The results of the correlation study will help identify areas in which existing rotor performance models are inadequate and, where possible, the reasons for the models shortcomings. In addition, several problems associated with obtaining accurate test data will be discussed.

Download Full-text

Hyperspectral Imager System Performance Modeling

Hyperspectral Satellites and System Design ◽

10.1201/9780429266201-7 ◽

2020 ◽

pp. 323-356

Author(s):

Shen-En Qian

Keyword(s):

System Performance ◽

Performance Modeling

Download Full-text

Neural Embedding-Based Metrics for Pre-retrieval Query Performance Prediction

10.32920/ryerson.14654253.v1 ◽

2021 ◽

Author(s):

Arabzadehghahyazi Negar

Keyword(s):

Performance Prediction ◽

State Of The Art ◽

Learning To Rank ◽

The State ◽

Test Collection ◽

Query Performance ◽

Performance Predictors ◽

Level Statistics ◽

Ablation Study ◽

Individual Specificity

file:///C:/Users/MWF/Downloads/Arabzadehghahyazi, Negar.Pre-retrieval Query Performance Prediction (QPP) methods are oblivious to the performance of the retrieval model as they predict query difficulty prior to observing the set of documents retrieved for the query. Among pre-retrieval query performance predictors, specificity-based metrics investigate how corpus, query and corpus-query level statistics can be used to predict the performance of the query. In this thesis, we explore how neural embeddings can be utilized to define corpus-independent and semantics-aware specificity metrics. Our metrics are based on the intuition that a term that is closely surrounded by other terms in the embedding space is more likely to be specific while a term surrounded by less closely related terms is more likely to be generic. On this basis, we leverage geometric properties between embedded terms to define four groups of metrics: (1) neighborhood-based, (2) graph-based, (3) cluster-based and (4) vector-based metrics. Moreover, we employ learning-to-rank techniques to analyze the importance of individual specificity metrics. To evaluate the proposed metrics, we have curated and publicly share a test collection of term specificity measurements defined based on Wikipedia category hierarchy and DMOZ taxonomy. We report on our extensive experiments on the effectiveness of our metrics through metric comparison, ablation study and comparison against the state-of-the-art baselines. We have shown that our proposed set of pre-retrieval QPP metrics based on the properties of pre-trained neural embeddings are more effective for performance prediction compared to the state-of-the-art methods. We report our findings based on Robust04, ClueWeb09 and Gov2 corpora and their associated TREC topics.

Download Full-text

ДОСЯГНЕННЯ ЕФЕКТИВНОГО РОЗПОДІЛЕНОГО ПЛАНУВАННЯ ЗА ДОПОМОГОЮ ЧЕРГ ПОВІДОМЛЕНЬ У ХМАРІ ДЛЯ БАГАТОЗАДАЧНИХ ОБЧИСЛЕНЬ ТА ВИСОКОПРОДУКТИВНИХ ОБЧИСЛЕНЬ

INTERNATIONAL ACADEMY JOURNAL Web of Scholar ◽

10.31435/rsglobal_wos/30122020/7323 ◽

2020 ◽

Author(s):

Старовойтенко Олексій Володимирович

Keyword(s):

Distributed Systems ◽

System Performance ◽

State Of The Art ◽

Computation Time ◽

Cloud Services ◽

Distributed Scheduling ◽

Message Processing ◽

Computational Problem ◽

Control Units ◽

And Control

Due to the growth of data and the number of computational tasks, it is necessary to ensure the required level of system performance. Performance can be achieved by scaling the system horizontally / vertically, but even increasing the amount of computing resources does not solve all the problems. For example, a complex computational problem should be decomposed into smaller subtasks, the computation time of which is much shorter. However, the number of such tasks may be constantly increasing, due to which the processing on the services is delayed or even certain messages will not be processed. In many cases, message processing should be coordinated, for example, message A should be processed only after messages B and C. Given the problems of processing a large number of subtasks, we aim in this work - to design a mechanism for effective distributed scheduling through message queues. As services we will choose cloud services Amazon Webservices such as Amazon EC2, SQS and DynamoDB. Our FlexQueue solution can compete with state-of-the-art systems such as Sparrow and MATRIX. Distributed systems are quite complex and require complex algorithms and control units, so the solution of this problem requires detailed research.

Download Full-text

Machine learning based system performance prediction model for reactor control

Annals of Nuclear Energy ◽

10.1016/j.anucene.2017.11.014 ◽

2018 ◽

Vol 113 ◽

pp. 270-278 ◽

Cited By ~ 6

Author(s):

Yuyun Zeng ◽

Jingquan Liu ◽

Kaichao Sun ◽

Lin-wen Hu

Keyword(s):

Machine Learning ◽

Prediction Model ◽

Performance Prediction ◽

System Performance ◽

Reactor Control

Download Full-text