Driver Drowsiness Estimation Based on Factorized Bilinear Feature Fusion and a Long-Short-Term Recurrent Convolutional Network

Shuang Chen; Zengcai Wang; Wenxin Chen

doi:10.3390/info12010003

Driver Drowsiness Estimation Based on Factorized Bilinear Feature Fusion and a Long-Short-Term Recurrent Convolutional Network

Information ◽

10.3390/info12010003 ◽

2020 ◽

Vol 12 (1) ◽

pp. 3

Author(s):

Shuang Chen ◽

Zengcai Wang ◽

Wenxin Chen

Keyword(s):

Short Term Memory ◽

Feature Fusion ◽

Detection Methods ◽

Video Frame ◽

Estimation Model ◽

Short Term ◽

Convolutional Network ◽

Drowsiness Detection ◽

Driver Drowsiness ◽

Time Information

The effective detection of driver drowsiness is an important measure to prevent traffic accidents. Most existing drowsiness detection methods only use a single facial feature to identify fatigue status, ignoring the complex correlation between fatigue features and the time information of fatigue features, and this reduces the recognition accuracy. To solve these problems, we propose a driver sleepiness estimation model based on factorized bilinear feature fusion and a long- short-term recurrent convolutional network to detect driver drowsiness efficiently and accurately. The proposed framework includes three models: fatigue feature extraction, fatigue feature fusion, and driver drowsiness detection. First, we used a convolutional neural network (CNN) to effectively extract the deep representation of eye and mouth-related fatigue features from the face area detected in each video frame. Then, based on the factorized bilinear feature fusion model, we performed a nonlinear fusion of the deep feature representations of the eyes and mouth. Finally, we input a series of fused frame-level features into a long-short-term memory (LSTM) unit to obtain the time information of the features and used the softmax classifier to detect sleepiness. The proposed framework was evaluated with the National Tsing Hua University drowsy driver detection (NTHU-DDD) video dataset. The experimental results showed that this method had better stability and robustness compared with other methods.

Download Full-text

Using long short term memory and convolutional neural networks for driver drowsiness detection

Accident Analysis & Prevention ◽

10.1016/j.aap.2021.106107 ◽

2021 ◽

Vol 156 ◽

pp. 106107

Author(s):

Azhar Quddus ◽

Ali Shahidi Zandi ◽

Laura Prest ◽

Felix J.E. Comeau

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Drowsiness Detection ◽

Driver Drowsiness ◽

Long Short Term Memory

Download Full-text

Driver drowsiness detection using hybrid convolutional neural network and long short-term memory

Multimedia Tools and Applications ◽

10.1007/s11042-018-6378-6 ◽

2018 ◽

Vol 78 (20) ◽

pp. 29059-29087 ◽

Cited By ~ 3

Author(s):

Jing-Ming Guo ◽

Herleeyandi Markoni

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Drowsiness Detection ◽

Driver Drowsiness ◽

Long Short Term Memory

Download Full-text

Computer-Aided Recognition and Analysis of Abnormal Behavior in Video

Computer-Aided Design and Applications ◽

10.14733/cadaps.2021.s3.34-45 ◽

2020 ◽

Vol 18 (S3) ◽

pp. 34-45

Author(s):

Zhingtang Zhao ◽

Qingtao Wu

Keyword(s):

Short Term Memory ◽

Abnormal Behavior ◽

Video Sequences ◽

Video Frame ◽

Dynamic Features ◽

Short Term ◽

Behavior Recognition ◽

Convolutional Network ◽

Computer Aided

In intelligent computer-aided video abnormal behavior recognition, pedestrian behavior analysis technology can detect and handle abnormal behaviors in time, which has great practical value in ensuring social safety. We analyze a deep learning video behavior recognition network that has advantages in current research. The network first sparsely sampled the input video to obtain the video frame of each video segment, and then used a two-dimensional convolutional network to extract the characteristics of each video frame, then used a three-dimensional network to fuse them. The method realizes the recognition of long-term and short-term actions in the video at the same time. In order to overcome the shortcoming of the large amount of calculation in the 3D convolution part of the network, this paper proposes an improvement to this module in the network, and proposes a mobile 3D convolution network structure. Aiming at the problem of low utilization of long-term motion features in video sequences, this paper constructs a deep residual module by introducing long and short-term memory networks, residual connection design, etc., to fully and effectively utilize the long-term dynamic features in video sequences. Aiming at the problem of large differences in similar actions and small differences between classes in abnormal behavior videos, this paper proposes a 2CSoftmax function based on double center loss to optimize the network model, which is beneficial to maximize the distance between classes and minimize the distance between classes, so as to realize the classification and recognition of similar actions and improve the recognition accuracy.

Download Full-text

Driver drowsiness detection system using hybrid approach of convolutional neural network and bidirectional long short term memory (CNN_BILSTM)

Materials Today Proceedings ◽

10.1016/j.matpr.2020.11.898 ◽

2021 ◽

Author(s):

S.P. Rajamohana ◽

E.G. Radhika ◽

S. Priya ◽

S. Sangeetha

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Detection System ◽

Hybrid Approach ◽

Short Term ◽

Term Memory ◽

Drowsiness Detection ◽

Driver Drowsiness ◽

Long Short Term Memory

Download Full-text

Reasoning over temporal knowledge graph with temporal consistency constraints

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210064 ◽

2021 ◽

pp. 1-10

Author(s):

Xiaojun Chen ◽

Shengbin Jia ◽

Ling Ding ◽

Yang Xiang

Keyword(s):

Short Term Memory ◽

Knowledge Graph ◽

Temporal Consistency ◽

Short Term ◽

Basic Model ◽

Consistency Constraints ◽

Memory Network ◽

Long Short Term Memory ◽

Relation Prediction ◽

Time Information

Knowledge graph reasoning or completion aims at inferring missing facts by reasoning about the information already present in the knowledge graph. In this work, we explore the problem of temporal knowledge graph reasoning that performs inference on the graph over time. Most existing reasoning models ignore the time information when learning entities and relations representations. For example, the fact (Scarlett Johansson, spouse Of, Ryan Reynolds) was true only during 2008 - 2011. To facilitate temporal reasoning, we present TA-TransRILP, which involves temporal information by utilizing RNNs and takes advantage of Integer Linear Programming. Specifically, we utilize a character-level long short-term memory network to encode relations with sequences of temporal tokens, and combine it with common reasoning model. To achieve more accurate reasoning, we further deploy temporal consistency constraints to basic model, which can help in assessing the validity of a fact better. We conduct entity prediction and relation prediction on YAGO11k and Wikidata12k datasets. Experimental results demonstrate that TA-TransRILP can make more accurate predictions by taking time information and temporal consistency constraints into account, and outperforms existing methods with a significant improvement about 6-8% on Hits@10.

Download Full-text

Deep Recurrent Neural Networks for Automatic Detection of Sleep Apnea from Single Channel Respiration Signals

Sensors ◽

10.3390/s20185037 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5037

Author(s):

Hisham ElMoaqet ◽

Mohammad Eid ◽

Martin Glos ◽

Mutaz Ryalat ◽

Thomas Penzel

Keyword(s):

Feature Extraction ◽

Sleep Apnea ◽

Short Term Memory ◽

Single Channel ◽

Automated Detection ◽

Detection Methods ◽

Short Term ◽

Term Memory ◽

Deep Recurrent Neural Network ◽

Long Short Term Memory

Sleep apnea is a common sleep disorder that causes repeated breathing interruption during sleep. The performance of automated apnea detection methods based on respiratory signals depend on the signals considered and feature extraction methods. Moreover, feature engineering techniques are highly dependent on the experts’ experience and their prior knowledge about different physiological signals and conditions of the subjects. To overcome these problems, a novel deep recurrent neural network (RNN) framework is developed for automated feature extraction and detection of apnea events from single respiratory channel inputs. Long short-term memory (LSTM) and bidirectional long short-term memory (BiLSTM) are investigated to develop the proposed deep RNN model. The proposed framework is evaluated over three respiration signals: Oronasal thermal airflow (FlowTh), nasal pressure (NPRE), and abdominal respiratory inductance plethysmography (ABD). To demonstrate our results, we use polysomnography (PSG) data of 17 patients with obstructive, central, and mixed apnea events. Our results indicate the effectiveness of the proposed framework in automatic extraction for temporal features and automated detection of apneic events over the different respiratory signals considered in this study. Using a deep BiLSTM-based detection model, the NPRE signal achieved the highest overall detection results with true positive rate (sensitivity) = 90.3%, true negative rate (specificity) = 83.7%, and area under receiver operator characteristic curve = 92.4%. The present results contribute a new deep learning approach for automated detection of sleep apnea events from single channel respiration signals that can potentially serve as a helpful and alternative tool for the traditional PSG method.

Download Full-text

Prediction of Protein Secondary Structure Based on WS-BiLSTM Model

Symmetry ◽

10.3390/sym14010089 ◽

2022 ◽

Vol 14 (1) ◽

pp. 89

Author(s):

Yang Gao ◽

Yawu Zhao ◽

Yuming Ma ◽

Yihui Liu

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Short Term Memory ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Short Term ◽

Convolutional Network ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Protein secondary structure prediction is an important topic in bioinformatics. This paper proposed a novel model named WS-BiLSTM, which combined the wavelet scattering convolutional network and the long-short-term memory network for the first time to predict protein secondary structure. This model captures nonlocal interactions between amino acid sequences and remembers long-range interactions between amino acids. In our WS-BiLSTM model, the wavelet scattering convolutional network is used to extract protein features from the PSSM sliding window; the extracted features are combined with the original PSSM data as the input features of the long-short-term memory network to predict protein secondary structure. It is worth noting that the wavelet scattering convolutional network is asymmetric as a member of the continuous wavelet family. The Q3 accuracy on the test set CASP9, CASP10, CASP11, CASP12, CB513, and PDB25 reached 85.26%, 85.84%, 84.91%, 85.13%, 86.10%, and 85.52%, which were higher 2.15%, 2.16%, 3.5%, 3.19%, 4.22%, and 2.75%, respectively, than using the long-short-term memory network alone. Comparing our results with the state-of-art methods shows that our proposed model achieved better results on the CB513 and CASP12 data sets. The experimental results show that the features extracted from the wavelet scattering convolutional network can effectively improve the accuracy of protein secondary structure prediction.

Download Full-text

A Lane-based Predictive Model of Downstream Arrival Rates in a Queue Estimation Model Using a Long Short-Term Memory Network

Transportation Research Procedia ◽

10.1016/j.trpro.2018.11.028 ◽

2018 ◽

Vol 34 ◽

pp. 163-170 ◽

Cited By ~ 1

Author(s):

Seunghyeon Lee ◽

Kun Xie ◽

Dong Ngoduy ◽

Mehdi Keyvan-Ekbatani ◽

Hong Yang

Keyword(s):

Predictive Model ◽

Short Term Memory ◽

Estimation Model ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory ◽

Lane Based

Download Full-text

3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition

Future Internet ◽

10.3390/fi11020042 ◽

2019 ◽

Vol 11 (2) ◽

pp. 42 ◽

Cited By ~ 5

Author(s):

Sheeraz Arif ◽

Jing Wang ◽

Tehseen Ul Hassan ◽

Zesong Fei

Keyword(s):

Short Term Memory ◽

Research Work ◽

Video Frame ◽

Feature Maps ◽

Convolutional Network ◽

Convolutional Networks ◽

Spatio Temporal ◽

3D Cnn ◽

Public Datasets ◽

Motion Map

Human activity recognition is an active field of research in computer vision with numerous applications. Recently, deep convolutional networks and recurrent neural networks (RNN) have received increasing attention in multimedia studies, and have yielded state-of-the-art results. In this research work, we propose a new framework which intelligently combines 3D-CNN and LSTM networks. First, we integrate discriminative information from a video into a map called a ‘motion map’ by using a deep 3-dimensional convolutional network (C3D). A motion map and the next video frame can be integrated into a new motion map, and this technique can be trained by increasing the training video length iteratively; then, the final acquired network can be used for generating the motion map of the whole video. Next, a linear weighted fusion scheme is used to fuse the network feature maps into spatio-temporal features. Finally, we use a Long-Short-Term-Memory (LSTM) encoder-decoder for final predictions. This method is simple to implement and retains discriminative and dynamic information. The improved results on benchmark public datasets prove the effectiveness and practicability of the proposed method.

Download Full-text

Monitoring of volcanic ash cloud from heterogeneous data using feature fusion and convolutional neural networks–long short-term memory

Neural Computing and Applications ◽

10.1007/s00521-020-05050-y ◽

2020 ◽

Author(s):

Lan Liu ◽

Cheng-fan Li ◽

Xian-kun Sun ◽

Jun-juan Zhao

Keyword(s):

Neural Networks ◽

Volcanic Ash ◽

Short Term Memory ◽

Feature Fusion ◽

Heterogeneous Data ◽

Short Term ◽

Term Memory ◽

Volcanic Ash Cloud ◽

Long Short Term Memory ◽

Ash Cloud

Download Full-text