Identification of Weakly Pitch-Shifted Voice Based on Convolutional Neural Network

International Journal of Digital Multimedia Broadcasting ◽

10.1155/2020/8927031 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Yongchao Ye ◽

Lingjie Lao ◽

Diqun Yan ◽

Rangding Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Topology ◽

Activation Function ◽

Detection Methods ◽

Detection Rates ◽

Feature Map ◽

Dynamic Coefficients ◽

Input Feature ◽

High Detection

Pitch shifting is a common voice editing technique in which the original pitch of a digital voice is raised or lowered. It is likely to be abused by the malicious attacker to conceal his/her true identity. Existing forensic detection methods are no longer effective for weakly pitch-shifted voice. In this paper, we proposed a convolutional neural network (CNN) to detect not only strongly pitch-shifted voice but also weakly pitch-shifted voice of which the shifting factor is less than ±4 semitones. Specifically, linear frequency cepstral coefficients (LFCC) computed from power spectrums are considered and their dynamic coefficients are extracted as the discriminative features. And the CNN model is carefully designed with particular attention to the input feature map, the activation function and the network topology. We evaluated the algorithm on voices from two datasets with three pitch shifting software. Extensive results show that the algorithm achieves high detection rates for both binary and multiple classifications.

Download Full-text

Sparse convolutional neural network acceleration with lossless input feature map compression for resource‐constrained systems

IET Computers & Digital Techniques ◽

10.1049/cdt2.12038 ◽

2021 ◽

Author(s):

Jisu Kwon ◽

Joonho Kong ◽

Arslan Munir

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Constrained Systems ◽

Resource Constrained ◽

Feature Map ◽

Input Feature

Download Full-text

Research on Activation Function in Deep Convolutional Neural Network

Proceedings of the 2020 Conference on Artificial Intelligence and Healthcare ◽

10.1145/3433996.3434001 ◽

2020 ◽

Author(s):

Hong Hua Xiu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Activation Function ◽

Deep Convolutional Neural Network

Download Full-text

Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network

The Visual Computer ◽

10.1007/s00371-020-02033-x ◽

2021 ◽

Author(s):

Feng-Ping An ◽

Jun-e Liu ◽

Lei Bai

Keyword(s):

Neural Network ◽

Object Recognition ◽

Convolutional Neural Network ◽

Activation Function ◽

Recognition Algorithm ◽

Nonlinear Activation Function

Download Full-text

A Convolutional Neural Network based Model with Improved Activation Function and Optimizer for Effective Intrusion Detection and Classification

2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) ◽

10.1109/icacite51222.2021.9404584 ◽

2021 ◽

Author(s):

Solaiman Kabir ◽

Sadman Sakib ◽

Md. Akib Hossain ◽

Safi Islam ◽

Muhammad Iqbal Hossain

Keyword(s):

Neural Network ◽

Intrusion Detection ◽

Convolutional Neural Network ◽

Activation Function

Download Full-text

Automatic Handgun Detection with Deep Learning in Video Surveillance Images

Applied Sciences ◽

10.3390/app11136085 ◽

2021 ◽

Vol 11 (13) ◽

pp. 6085

Author(s):

Jesus Salido ◽

Vanesa Lomas ◽

Jesus Ruiz-Santaquiteria ◽

Oscar Deniz

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Video Surveillance ◽

Automatic Detection ◽

Public Spaces ◽

Detection Methods ◽

Training Dataset ◽

Average Precision ◽

Terrorist Acts

There is a great need to implement preventive mechanisms against shootings and terrorist acts in public spaces with a large influx of people. While surveillance cameras have become common, the need for monitoring 24/7 and real-time response requires automatic detection methods. This paper presents a study based on three convolutional neural network (CNN) models applied to the automatic detection of handguns in video surveillance images. It aims to investigate the reduction of false positives by including pose information associated with the way the handguns are held in the images belonging to the training dataset. The results highlighted the best average precision (96.36%) and recall (97.23%) obtained by RetinaNet fine-tuned with the unfrozen ResNet-50 backbone and the best precision (96.23%) and F1 score values (93.36%) obtained by YOLOv3 when it was trained on the dataset including pose information. This last architecture was the only one that showed a consistent improvement—around 2%—when pose information was expressly considered during training.

Download Full-text

A Single Target Grasp Detection Network Based on Convolutional Neural Network

Computational Intelligence and Neuroscience ◽

10.1155/2021/5512728 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Longzhi Zhang ◽

Dongmei Wu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Transfer Learning ◽

High Accuracy ◽

Experimental Results ◽

Detection Accuracy ◽

Single Object ◽

Single Target ◽

High Detection ◽

Good Detection

Grasp detection based on convolutional neural network has gained some achievements. However, overfitting of multilayer convolutional neural network still exists and leads to poor detection precision. To acquire high detection accuracy, a single target grasp detection network that generalizes the fitting of angle and position, based on the convolution neural network, is put forward here. The proposed network regards the image as input and grasping parameters including angle and position as output, with the detection manner of end-to-end. Particularly, preprocessing dataset is to achieve the full coverage to input of model and transfer learning is to avoid overfitting of network. Importantly, a series of experimental results indicate that, for single object grasping, our network has good detection results and high accuracy, which proves that the proposed network has strong generalization in direction and category.

Download Full-text

Yamatani Activation: Edge Homogeneous Response Super Resolution Neural Network

10.36227/techrxiv.11861187.v1 ◽

2020 ◽

Author(s):

Takuma Yoshimura

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Dynamic Range ◽

Super Resolution ◽

Activation Function

In this research, I propose a two-variable activation function "Yamatani" that satisfies the first-degree homogeneity, and realize a super-resolution convolutional neural network that is independent of the dynamic range and symmetrical about the luminance inversion.

Download Full-text

An Advanced Relevance Feedback Method to Improve Performance of CBIR using Convolutional Neural Network and Comprehensive Values

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b2741.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 5427-5438

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Image Retrieval ◽

Convolutional Neural Network ◽

Large Scale ◽

Activation Function ◽

Image Feature ◽

Similarity Measurement ◽

Query Image ◽

Image Production

Content-Based Image Retrieval (CBIR) is extensively used technique for image retrieval from large image databases. However, users are not satisfied with the conventional image retrieval techniques. In addition, the advent of web development and transmission networks, the number of images available to users continues to increase. Therefore, a permanent and considerable digital image production in many areas takes place. Quick access to the similar images of a given query image from this extensive collection of images pose great challenges and require proficient techniques. From query by image to retrieval of relevant images, CBIR has key phases such as feature extraction, similarity measurement, and retrieval of relevant images. However, extracting the features of the images is one of the important steps. Recently Convolutional Neural Network (CNN) shows good results in the field of computer vision due to the ability of feature extraction from the images. Alex Net is a classical Deep CNN for image feature extraction. We have modified the Alex Net Architecture with a few changes and proposed a novel framework to improve its ability for feature extraction and for similarity measurement. The proposal approach optimizes Alex Net in the aspect of pooling layer. In particular, average pooling is replaced by max-avg pooling and the non-linear activation function Maxout is used after every Convolution layer for better feature extraction. This paper introduces CNN for features extraction from images in CBIR system and also presents Euclidean distance along with the Comprehensive Values for better results. The proposed framework goes beyond image retrieval, including the large-scale database. The performance of the proposed work is evaluated using precision. The proposed work show better results than existing works.

Download Full-text

A Modified Activation Function for Deep Convolutional Neural Network and Its Application to Condition Monitoring

Proceedings of IncoME-V & CEPE Net-2020 - Mechanisms and Machine Science ◽

10.1007/978-3-030-75793-9_83 ◽

2021 ◽

pp. 895-909

Author(s):

Ibrahim Alqatawneh ◽

Khalid Rabeyee ◽

Chao Zhang ◽

Guojin Feng ◽

Fengshou Gu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Condition Monitoring ◽

Activation Function ◽

Deep Convolutional Neural Network

Download Full-text

Security of E-Health Systems Using Face Recognition Based on Convolutional Neural Network

International Journal of Extreme Automation and Connectivity in Healthcare ◽

10.4018/ijeach.2020070104 ◽

2020 ◽

Vol 2 (2) ◽

pp. 37-41

Author(s):

Zhixian Chen ◽

Jialin Tang ◽

Xueyuan Gong ◽

Qinglang Su

Keyword(s):

Neural Network ◽

Face Recognition ◽

Convolutional Neural Network ◽

Principal Component ◽

Activation Function ◽

Support Vector ◽

Data Set ◽

Novel Approach ◽

Rectified Linear Unit ◽

The Face

In order to improve the low accuracy of the face recognition methods in the case of e-health, this paper proposed a novel face recognition approach, which is based on convolutional neural network (CNN). In detail, through resolving the convolutional kernel, rectified linear unit (ReLU) activation function, dropout, and batch normalization, this novel approach reduces the number of parameters of the CNN model, improves the non-linearity of the CNN model, and alleviates overfitting of the CNN model. In these ways, the accuracy of face recognition is increased. In the experiments, the proposed approach is compared with principal component analysis (PCA) and support vector machine (SVM) on ORL, Cohn-Kanade, and extended Yale-B face recognition data set, and it proves that this approach is promising.

Download Full-text