Improving Action Recognition via Temporal and Complementary Learning

2021 ◽  
Vol 12 (3) ◽  
pp. 1-24
Author(s):  
Nour Eldin Elmadany ◽  
Yifeng He ◽  
Ling Guan

In this article, we study the problem of video-based action recognition. We improve the action recognition performance by finding an effective temporal and appearance representation. For capturing the temporal representation, we introduce two temporal learning techniques for improving long-term temporal information modeling, specifically Temporal Relational Network and Temporal Second-Order Pooling-based Network. Moreover, we harness the representation using complementary learning techniques, specifically Global-Local Network and Fuse-Inception Network. Performance evaluation on three datasets (UCF101, HMDB-51, and Mini-Kinetics-200) demonstrated the superiority of the proposed framework compared to the 2D Deep ConvNets-based state-of-the-art techniques.

2021 ◽  
Vol 9 (3A) ◽  
Author(s):  
Sheeraz Arif ◽  
◽  
Jing Wang ◽  
Adnan Ahmed Siddiqui ◽  
Rashid Hussain ◽  
...  

Deep convolutional neural network (DCNN) and recurrent neural network (RNN) have been proved as an imperious research area in multimedia understanding and obtained remarkable action recognition performance. However, videos contain rich motion information with varying dimensions. Existing recurrent based pipelines fail to capture long-term motion dynamics in videos with various motion scales and complex actions performed by multiple actors. Consideration of contextual and salient features is more important than mapping a video frame into a static video representation. This research work provides a novel pipeline by analyzing and processing the video information using a 3D convolution (C3D) network and newly introduced deep bidirectional LSTM. Like popular two-stream convent, we also introduce a two-stream framework with one modification; that is, we replace the optical flow stream by saliency-aware stream to avoid the computational complexity. First, we generate a saliency-aware video stream by applying the saliency-aware method. Secondly, a two-stream 3D-convolutional network (C3D) is utilized with two different types of streams, i.e., RGB stream and saliency-aware video stream, to collect both spatial and semantic temporal features. Next, a deep bidirectional LSTM network is used to learn sequential deep temporal dynamics. Finally, time-series-pooling-layer and softmax-layers classify human activity and behavior. The introduced system can learn long-term temporal dependencies and can predict complex human actions. Experimental results demonstrate the significant improvement in action recognition accuracy on different benchmark datasets.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2681
Author(s):  
Kedir Mamo Besher ◽  
Juan Ivan Nieto-Hipolito ◽  
Raymundo Buenrostro-Mariscal ◽  
Mohammed Zamshed Ali

With constantly increasing demand in connected society Internet of Things (IoT) network is frequently becoming congested. IoT sensor devices lose more power while transmitting data through congested IoT networks. Currently, in most scenarios, the distributed IoT devices in use have no effective spectrum based power management, and have no guarantee of a long term battery life while transmitting data through congested IoT networks. This puts user information at risk, which could lead to loss of important information in communication. In this paper, we studied the extra power consumed due to retransmission of IoT data packet and bad communication channel management in a congested IoT network. We propose a spectrum based power management solution that scans channel conditions when needed and utilizes the lowest congested channel for IoT packet routing. It also effectively measured power consumed in idle, connected, paging and synchronization status of a standard IoT device in a congested IoT network. In our proposed solution, a Freescale Freedom Development Board (FREDEVPLA) is used for managing channel related parameters. While supervising the congestion level and coordinating channel allocation at the FREDEVPLA level, our system configures MAC and Physical layer of IoT devices such that it provides the outstanding power utilization based on the operating network in connected mode compared to the basic IoT standard. A model has been set up and tested using freescale launchpads. Test data show that battery life of IoT devices using proposed spectrum based power management increases by at least 30% more than non-spectrum based power management methods embedded within IoT devices itself. Finally, we compared our results with the basic IoT standard, IEEE802.15.4. Furthermore, the proposed system saves lot of memory for IoT devices, improves overall IoT network performance, and above all, decrease the risk of losing data packets in communication. The detail analysis in this paper also opens up multiple avenues for further research in future use of channel scanning by FREDEVPLA board.


2021 ◽  
Vol 13 (2) ◽  
pp. 164
Author(s):  
Chuyao Luo ◽  
Xutao Li ◽  
Yongliang Wen ◽  
Yunming Ye ◽  
Xiaofeng Zhang

The task of precipitation nowcasting is significant in the operational weather forecast. The radar echo map extrapolation plays a vital role in this task. Recently, deep learning techniques such as Convolutional Recurrent Neural Network (ConvRNN) models have been designed to solve the task. These models, albeit performing much better than conventional optical flow based approaches, suffer from a common problem of underestimating the high echo value parts. The drawback is fatal to precipitation nowcasting, as the parts often lead to heavy rains that may cause natural disasters. In this paper, we propose a novel interaction dual attention long short-term memory (IDA-LSTM) model to address the drawback. In the method, an interaction framework is developed for the ConvRNN unit to fully exploit the short-term context information by constructing a serial of coupled convolutions on the input and hidden states. Moreover, a dual attention mechanism on channels and positions is developed to recall the forgotten information in the long term. Comprehensive experiments have been conducted on CIKM AnalytiCup 2017 data sets, and the results show the effectiveness of the IDA-LSTM in addressing the underestimation drawback. The extrapolation performance of IDA-LSTM is superior to that of the state-of-the-art methods.


2021 ◽  
Author(s):  
Sophie de Bruin ◽  
Jannis Hoch ◽  
Nina von Uexkull ◽  
Halvard Buhaug ◽  
Nico Wanders

<p>The socioeconomic impacts of changes in climate-related and hydrology-related factors are increasingly acknowledged to affect the on-set of violent conflict. Full consensus upon the general mechanisms linking these factors with conflict is, however, still limited. The absence of full understanding of the non-linearities between all components and the lack of sufficient data make it therefore hard to address violent conflict risk on the long-term. </p><p>Although it is neither desirable nor feasible to make exact predictions, projections are a viable means to provide insights into potential future conflict risks and uncertainties thereof. Hence, making different projections is a legitimate way to deal with and understand these uncertainties, since the construction of diverse scenarios delivers insights into possible realizations of the future.  </p><p>Through machine learning techniques, we (re)assess the major drivers of conflict for the current situation in Africa, which are then applied to project the regions-at-risk following different scenarios. The model shows to accurately reproduce observed historic patterns leading to a high ROC score of 0.91. We show that socio-economic factors are most dominant when projecting conflicts over the African continent. The projections show that there is an overall reduction in conflict risk as a result of increased economic welfare that offsets the adverse impacts of climate change and hydrologic variables. It must be noted, however, that these projections are based on current relations. In case the relations of drivers and conflict change in the future, the resulting regions-at-risk may change too.   By identifying the most prominent drivers, conflict risk mitigation measures can be tuned more accurately to reduce the direct and indirect consequences of climate change on the population in Africa. As new and improved data becomes available, the model can be updated for more robust projections of conflict risk in Africa under climate change.</p>


2021 ◽  
Author(s):  
Nikos Fazakis ◽  
Elias Dritsas ◽  
Otilia Kocsis ◽  
Nikos Fakotakis ◽  
Konstantinos Moustakas

2021 ◽  
pp. 65-67
Author(s):  
Tatyana Nikolaevna Asalkhanova ◽  
◽  
Andrey Alexandrovich Oskolkov ◽  

The JSC «RZD» Long-term program up to 2025 envisages a digital transformation of railway transport. It dedicates a special attention to information and digital technologies of modeling advanced development of infrastructure for provision of growing traffic, increase of production efficiency, provision of expected result of traffic safety and industry economics in whole. The paper presents a design of the information model for infrastructure control within the framework of introduction of the BIM RZD Automated Control System on the example of organization of transport production of permanent way work.


Sign in / Sign up

Export Citation Format

Share Document