An Activation Method of Topic Dictionary to Expand Training Data for Trend Rule Discovery

Applied Computational Intelligence and Soft Computing ◽

10.1155/2014/871412 ◽

2014 ◽

Vol 2014 ◽

pp. 1-11 ◽

Cited By ~ 3

Author(s):

Shigeaki Sakurai ◽

Kyoko Makino ◽

Shigeru Matsumoto

Keyword(s):

Stock Price ◽

Numerical Experiments ◽

Detection Performance ◽

Activation Method ◽

Training Data ◽

Sequential Data ◽

Rule Discovery ◽

Near Future ◽

News Headlines ◽

Numerical Change

This paper improves a method which predicts whether evaluation objects such as companies and products are to be attractive in near future. The attractiveness is evaluated by trend rules. The trend rules represent relationships among evaluation objects, keywords, and numerical changes related to the evaluation objects. They are inductively acquired from text sequential data and numerical sequential data. The method assigns evaluation objects to the text sequential data by activating a topic dictionary. The dictionary describes keywords representing the numerical change. It can expand the amount of the training data. It is anticipated that the expansion leads to the acquisition of more valid trend rules. This paper applies the method to a task which predicts attractive stock brands based on both news headlines and stock price sequences. It shows that the method can improve the detection performance of evaluation objects through numerical experiments.

Download Full-text

CNN performance dependence on linear image processing

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.10.ipas-182 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 310-1-310-7

Author(s):

Khalid Omer ◽

Luca Caucci ◽

Meredith Kupinski

Keyword(s):

Image Processing ◽

Texture Classification ◽

Full Rank ◽

Detection Performance ◽

Ideal Observer ◽

Training Data ◽

Image Texture ◽

Training Images ◽

Analytic Expressions ◽

Linear Compression

This work reports on convolutional neural network (CNN) performance on an image texture classification task as a function of linear image processing and number of training images. Detection performance of single and multi-layer CNNs (sCNN/mCNN) are compared to optimal observers. Performance is quantified by the area under the receiver operating characteristic (ROC) curve, also known as the AUC. For perfect detection AUC = 1.0 and AUC = 0.5 for guessing. The Ideal Observer (IO) maximizes AUC but is prohibitive in practice because it depends on high-dimensional image likelihoods. The IO performance is invariant to any fullrank, invertible linear image processing. This work demonstrates the existence of full-rank, invertible linear transforms that can degrade both sCNN and mCNN even in the limit of large quantities of training data. A subsequent invertible linear transform changes the images’ correlation structure again and can improve this AUC. Stationary textures sampled from zero mean and unequal covariance Gaussian distributions allow closed-form analytic expressions for the IO and optimal linear compression. Linear compression is a mitigation technique for high-dimension low sample size (HDLSS) applications. By definition, compression strictly decreases or maintains IO detection performance. For small quantities of training data, linear image compression prior to the sCNN architecture can increase AUC from 0.56 to 0.93. Results indicate an optimal compression ratio for CNN based on task difficulty, compression method, and number of training images.

Download Full-text

Rethinking the Random Cropping Data Augmentation Method Used in the Training of CNN-Based SAR Image Ship Detector

Remote Sensing ◽

10.3390/rs13010034 ◽

2020 ◽

Vol 13 (1) ◽

pp. 34

Author(s):

Rong Yang ◽

Robert Wang ◽

Yunkai Deng ◽

Xiaoxue Jia ◽

Heng Zhang

Keyword(s):

Neural Network ◽

Data Augmentation ◽

Back Propagation ◽

Detection Performance ◽

Training Data ◽

Sar Image ◽

Optical Images ◽

The Neural Network ◽

Effective Training ◽

Standard Configuration

The random cropping data augmentation method is widely used to train convolutional neural network (CNN)-based target detectors to detect targets in optical images (e.g., COCO datasets). It can expand the scale of the dataset dozens of times while consuming only a small amount of calculations when training the neural network detector. In addition, random cropping can also greatly enhance the spatial robustness of the model, because it can make the same target appear in different positions of the sample image. Nowadays, random cropping and random flipping have become the standard configuration for those tasks with limited training data, which makes it natural to introduce them into the training of CNN-based synthetic aperture radar (SAR) image ship detectors. However, in this paper, we show that the introduction of traditional random cropping methods directly in the training of the CNN-based SAR image ship detector may generate a lot of noise in the gradient during back propagation, which hurts the detection performance. In order to eliminate the noise in the training gradient, a simple and effective training method based on feature map mask is proposed. Experiments prove that the proposed method can effectively eliminate the gradient noise introduced by random cropping and significantly improve the detection performance under a variety of evaluation indicators without increasing inference cost.

Download Full-text

On Learning Interpreted Languages with Recurrent Models

Computational Linguistics ◽

10.1162/coli_a_00431 ◽

2022 ◽

pp. 1-13

Author(s):

Denis Paperno

Keyword(s):

Natural Language ◽

Data Processing ◽

Syntactic Structure ◽

Neural Nets ◽

Training Data ◽

Sequential Data ◽

Extensive Training ◽

Formal Syntax ◽

Compositional Interpretation

Abstract Can recurrent neural nets, inspired by human sequential data processing, learn to understand language? We construct simplified datasets reflecting core properties of natural language as modeled in formal syntax and semantics: recursive syntactic structure and compositionality. We find LSTM and GRU networks to generalise to compositional interpretation well, but only in the most favorable learning settings, with a well-paced curriculum, extensive training data, and left-to-right (but not right-to-left) composition.

Download Full-text

Second-Order Cone Programming Formulations for Robust Multiclass Classification

Neural Computation ◽

10.1162/neco.2007.19.1.258 ◽

2007 ◽

Vol 19 (1) ◽

pp. 258-282 ◽

Cited By ~ 22

Author(s):

Ping Zhong ◽

Masao Fukushima

Keyword(s):

Numerical Experiments ◽

Optimization Problems ◽

Multiclass Classification ◽

Training Data ◽

Support Vector ◽

Research Subject ◽

Ongoing Research ◽

Second Order Cone ◽

Linear And Nonlinear ◽

Nonlinear Robust

Multiclass classification is an important and ongoing research subject in machine learning. Current support vector methods for multiclass classification implicitly assume that the parameters in the optimization problems are known exactly. However, in practice, the parameters have perturbations since they are estimated from the training data, which are usually subject to measurement noise. In this article, we propose linear and nonlinear robust formulations for multiclass classification based on the M-SVM method. The preliminary numerical experiments confirm the robustness of the proposed method.

Download Full-text

PREDICTION AND ANALYSIS OF GEOMECHANICAL PROPERTIES OF JIMUSAER SHALE USING A MACHINE LEARNING APPROACH

10.30632/spwla-2021-0089 ◽

2021 ◽

Author(s):

Lianteng Song ◽

◽

Zhonghua Liu ◽

Chaoliu Li ◽

Congqian Ning ◽

...

Keyword(s):

Machine Learning ◽

Cross Validation ◽

Gamma Ray ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Training Data ◽

Sequential Data ◽

Log Data ◽

Geomechanical Properties ◽

Single Well

Geomechanical properties are essential for safe drilling, successful completion, and exploration of both conven-tional and unconventional reservoirs, e.g. deep shale gas and shale oil. Typically, these properties could be calcu-lated from sonic logs. However, in shale reservoirs, it is time-consuming and challenging to obtain reliable log-ging data due to borehole complexity and lacking of in-formation, which often results in log deficiency and high recovery cost of incomplete datasets. In this work, we propose the bidirectional long short-term memory (BiL-STM) which is a supervised neural network algorithm that has been widely used in sequential data-based pre-diction to estimate geomechanical parameters. The pre-diction from log data can be conducted from two differ-ent aspects. 1) Single-Well prediction, the log data from a single well is divided into training data and testing data for cross validation; 2) Cross-Well prediction, a group of wells from the same geographical region are divided into training set and testing set for cross validation, as well. The logs used in this work were collected from 11 wells from Jimusaer Shale, which includes gamma ray, bulk density, resistivity, and etc. We employed 5 vari-ous machine learning algorithms for comparison, among which BiLSTM showed the best performance with an R-squared of more than 90% and an RMSE of less than 10. The predicted results can be directly used to calcu-late geomechanical properties, of which accuracy is also improved in contrast to conventional methods.

Download Full-text

Improving Regional and Teleseismic Detection for Single-Trace Waveforms Using a Deep Temporal Convolutional Neural Network Trained with an Array-Beam Catalog

Sensors ◽

10.3390/s19030597 ◽

2019 ◽

Vol 19 (3) ◽

pp. 597 ◽

Cited By ~ 1

Author(s):

Joshua Dickey ◽

Brett Borghetti ◽

William Junek

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Detection Performance ◽

Training Data ◽

Characteristic Functions ◽

Type I ◽

Test Set ◽

Arrival Times ◽

Treaty Monitoring ◽

Single Trace

The detection of seismic events at regional and teleseismic distances is critical to Nuclear Treaty Monitoring. Traditionally, detecting regional and teleseismic events has required the use of an expensive multi-instrument seismic array; however in this work, we present DeepPick, a novel seismic detection algorithm capable of array-like detection performance from a single-trace. We achieve this performance through three novel steps: First, a high-fidelity dataset is constructed by pairing array-beam catalog arrival-times with single-trace waveforms from the reference instrument of the array. Second, an idealized characteristic function is created, with exponential peaks aligned to the cataloged arrival times. Third, a deep temporal convolutional neural network is employed to learn the complex non-linear filters required to transform the single-trace waveforms into corresponding idealized characteristic functions. The training data consists of all arrivals in the International Seismological Centre Database for seven seismic arrays over a five year window from 1 January 2010 to 1 January 2015, yielding a total training set of 608,362 detections. The test set consists of the same seven arrays over a one year window from 1 January 2015 to 1 January 2016. We report our results by training the algorithm on six of the arrays and testing it on the seventh, so as to demonstrate the generalization and transportability of the technique to new stations. Detection performance against this test set is outstanding, yielding significant improvements in recall over existing techniques. Fixing a type-I error rate of 0.001, the algorithm achieves an overall recall (true positive rate) of 56% against the 141,095 array-beam arrivals in the test set, yielding 78,802 correct detections. This is more than twice the 37,572 detections made by an STA/LTA detector over the same period, and represents a 35% improvement over the 58,515 detections made by a state-of-the-art kurtosis-based detector. Furthermore, DeepPick provides at least a 4 dB improvement in detector sensitivity across the board, and is more computationally efficient, with run-times an order of magnitude faster than either of the other techniques tested. These results demonstrate the potential of our algorithm to significantly enhance the effectiveness of the global treaty monitoring network.

Download Full-text

LSTM with Wavelet Transform Based Data Preprocessing for Stock Price Prediction

Mathematical Problems in Engineering ◽

10.1155/2019/1340174 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8 ◽

Cited By ~ 8

Author(s):

Xiaodan Liang ◽

Zhaodi Ge ◽

Liling Sun ◽

Maowei He ◽

Hanning Chen

Keyword(s):

Wavelet Transform ◽

Stock Price ◽

Signal Reconstruction ◽

Data Preprocessing ◽

Original Data ◽

Training Data ◽

Threshold Method ◽

Denoising Method ◽

Stock Price Prediction ◽

Price Prediction

For profit maximization, the model-based stock price prediction can give valuable guidance to the investors. However, due to the existence of the high noise in financial data, it is inevitable that the deep neural networks trained by the original data fail to accurately predict the stock price. To address the problem, the wavelet threshold-denoising method, which has been widely applied in signal denoising, is adopted to preprocess the training data. The data preprocessing with the soft/hard threshold method can obviously restrain noise, and a new multioptimal combination wavelet transform (MOCWT) method is proposed. In this method, a novel threshold-denoising function is presented to reduce the degree of distortion in signal reconstruction. The experimental results clearly showed that the proposed MOCWT outperforms the traditional methods in the term of prediction accuracy.

Download Full-text

Kernel-Based Ensemble Learning in Python

Information ◽

10.3390/info11020063 ◽

2020 ◽

Vol 11 (2) ◽

pp. 63 ◽

Cited By ~ 1

Author(s):

Benjamin Guedj ◽

Bhargav Srinivasa Desikan

Keyword(s):

Numerical Experiments ◽

Learning Strategy ◽

Learning Algorithm ◽

Real Life ◽

Training Data ◽

Smoothing Kernel ◽

Regression Problems ◽

Classification And Regression ◽

Synthetic Datasets ◽

Python Package

We propose a new supervised learning algorithm for classification and regression problems where two or more preliminary predictors are available. We introduce KernelCobra, a non-linear learning strategy for combining an arbitrary number of initial predictors. KernelCobra builds on the COBRA algorithm introduced by Biau et al. (2016), which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalise this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and KernelCobra systematically outperforms the COBRA algorithm. While COBRA is intended for regression, KernelCobra deals with classification and regression. KernelCobra is included as part of the open source Python package Pycobra (0.2.4 and onward), introduced by Srinivasa Desikan (2018). Numerical experiments were undertaken to assess the performance (in terms of pure prediction and computational complexity) of KernelCobra on real-life and synthetic datasets.

Download Full-text

Long Short-Term Memory with Dynamic Skip Connections

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016481 ◽

2019 ◽

Vol 33 ◽

pp. 6481-6488 ◽

Cited By ~ 3

Author(s):

Tao Gui ◽

Qi Zhang ◽

Lujun Zhao ◽

Yaosong Lin ◽

Minlong Peng ◽

...

Keyword(s):

Language Processing ◽

Short Term Memory ◽

Training Data ◽

Sequential Data ◽

Short Term ◽

Term Memory ◽

Transition Functions ◽

Proposed Model ◽

Long Short Term Memory

In recent years, long short-term memory (LSTM) has been successfully used to model sequential data of variable length. However, LSTM can still experience difficulty in capturing long-term dependencies. In this work, we tried to alleviate this problem by introducing a dynamic skip connection, which can learn to directly connect two dependent words. Since there is no dependency information in the training data, we propose a novel reinforcement learning-based method to model the dependency relationship and connect dependent words. The proposed model computes the recurrent transition functions based on the skip connections, which provides a dynamic skipping advantage over RNNs that always tackle entire sentences sequentially. Our experimental results on three natural language processing tasks demonstrate that the proposed method can achieve better performance than existing methods. In the number prediction experiment, the proposed model outperformed LSTM with respect to accuracy by nearly 20%.

Download Full-text

Study of Marker Detection Performance on Deep Learning via Distortion and Rotation Augmentation of Training Data on Underwater Sonar Image

The Journal of Korea Robotics Society ◽

10.7746/jkros.2019.14.1.014 ◽

2019 ◽

Vol 14 (1) ◽

pp. 14-21 ◽

Cited By ~ 2

Author(s):

Eon-Ho Lee ◽

◽

Yeongjun Lee ◽

Jinwoo Choi ◽

Sejin Lee

Keyword(s):

Deep Learning ◽

Detection Performance ◽

Training Data ◽

Sonar Image ◽

Marker Detection

Download Full-text