Multiple spatio-temporal scales neural network for contextual visual recognition of human actions

Author(s):  
Minju Jung ◽  
Jungsik Hwang ◽  
Jun Tani
Author(s):  
Lin Han ◽  
Lu Han

With the rapid development of China’s market economy, brand image is becoming more and more important for an enterprise to enhance its market competitiveness and occupy a favorable market share. However, the brand image of many established companies gradually loses with the development of society and the improvement of people’s aesthetic pursuit. This has forced it to change its corporate brand image and regain the favor of the market. Based on this, this article combines the related knowledge and concepts of fuzzy theory, from the perspective of visual identity design, explores the development of corporate brand image visual identity intelligent system, and aims to design a set of visual identity system that is different from competitors in order to shape the enterprise. Distinctive brand image and improve its market competitiveness. This article first collected a large amount of information through the literature investigation method, and made a systematic and comprehensive introduction to fuzzy theory, visual recognition technology and related theoretical concepts of brand image, which laid a sufficient theoretical foundation for the later discussion of the application of fuzzy theory in the design of brand image visual recognition intelligent system; then the fuzzy theory algorithm is described in detail, a fuzzy neural network is proposed and applied to the design of the brand image visual recognition intelligent system, and the design experiment of the intelligent recognition system is carried out; finally, through the use of the specific case of KFC brand logo, the designed intelligent recognition system was tested, and it was found that the visual recognition intelligent system had an overall accuracy rate of 96.08% for the KFC brand logo. Among them, the accuracy rate of color recognition was the highest, 96.62%; comparing the changes in the output value of the training sample and the test sample, the output convergence effect of the color network is the best; through the comparison test of the BP neural network, the recognition effect of the fuzzy neural network is better.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Brett H. Hokr ◽  
Joel N. Bixler

AbstractDynamic, in vivo measurement of the optical properties of biological tissues is still an elusive and critically important problem. Here we develop a technique for inverting a Monte Carlo simulation to extract tissue optical properties from the statistical moments of the spatio-temporal response of the tissue by training a 5-layer fully connected neural network. We demonstrate the accuracy of the method across a very wide parameter space on a single homogeneous layer tissue model and demonstrate that the method is insensitive to parameter selection of the neural network model itself. Finally, we propose an experimental setup capable of measuring the required information in real time in an in vivo environment and demonstrate proof-of-concept level experimental results.


Author(s):  
Sophia Bano ◽  
Francisco Vasconcelos ◽  
Emmanuel Vander Poorten ◽  
Tom Vercauteren ◽  
Sebastien Ourselin ◽  
...  

Abstract Purpose Fetoscopic laser photocoagulation is a minimally invasive surgery for the treatment of twin-to-twin transfusion syndrome (TTTS). By using a lens/fibre-optic scope, inserted into the amniotic cavity, the abnormal placental vascular anastomoses are identified and ablated to regulate blood flow to both fetuses. Limited field-of-view, occlusions due to fetus presence and low visibility make it difficult to identify all vascular anastomoses. Automatic computer-assisted techniques may provide better understanding of the anatomical structure during surgery for risk-free laser photocoagulation and may facilitate in improving mosaics from fetoscopic videos. Methods We propose FetNet, a combined convolutional neural network (CNN) and long short-term memory (LSTM) recurrent neural network architecture for the spatio-temporal identification of fetoscopic events. We adapt an existing CNN architecture for spatial feature extraction and integrated it with the LSTM network for end-to-end spatio-temporal inference. We introduce differential learning rates during the model training to effectively utilising the pre-trained CNN weights. This may support computer-assisted interventions (CAI) during fetoscopic laser photocoagulation. Results We perform quantitative evaluation of our method using 7 in vivo fetoscopic videos captured from different human TTTS cases. The total duration of these videos was 5551 s (138,780 frames). To test the robustness of the proposed approach, we perform 7-fold cross-validation where each video is treated as a hold-out or test set and training is performed using the remaining videos. Conclusion FetNet achieved superior performance compared to the existing CNN-based methods and provided improved inference because of the spatio-temporal information modelling. Online testing of FetNet, using a Tesla V100-DGXS-32GB GPU, achieved a frame rate of 114 fps. These results show that our method could potentially provide a real-time solution for CAI and automating occlusion and photocoagulation identification during fetoscopic procedures.


Sign in / Sign up

Export Citation Format

Share Document