Deep Learning in Computer Vision: Methods, Interpretation, Causation, and Fairness

Author(s):  
Nikhil Malik ◽  
Param Vir Singh
Author(s):  
Qing Wu ◽  
Yungang Liu ◽  
Qiang Li ◽  
Shaoli Jin ◽  
Fengzhong Li

2021 ◽  
pp. 443-471
Author(s):  
Abu Sufian ◽  
Ekram Alam ◽  
Anirudha Ghosh ◽  
Farhana Sultana ◽  
Debashis De ◽  
...  

2019 ◽  
Author(s):  
Xiaozhe Yao

In recent years, the multimedia community has witnessed an emerging usage of deep learning in computer vision, such as image caption\cite{Vinyals_2015_CVPR}, object detection, instance segmentation and etc, and many companies are adopting deep learning into their commercial products. Developing and Deploying a fast and reliable deep learning system is complex and involves labour-intensive works, like image annotation\cite{Ratner_2017}, data and models versioning and etc. To facilitate such a process, we proposed CVFlow, which is a suite of versatile computer vision libraries and toolkits for handling computer vision tasks, including intelligent annotation toolkit, deep learning package manager, models sharing platform and computer vision serving program.


Author(s):  
Mahmoud Hassaballah ◽  
Ali Ismail Awad

Procedia CIRP ◽  
2021 ◽  
Vol 103 ◽  
pp. 237-242
Author(s):  
Christos Manettas ◽  
Nikolaos Nikolakis ◽  
Kosmas Alexopoulos

2020 ◽  
Vol 53 (5-6) ◽  
pp. 796-806
Author(s):  
Hongchang Li ◽  
Jing Wang ◽  
Jianjun Han ◽  
Jinmin Zhang ◽  
Yushan Yang ◽  
...  

Violent interaction detection is a hot topic in computer vision. However, the recent research works on violent interaction detection mainly focus on the traditional hand-craft features, and does not make full use of the research results of deep learning in computer vision. In this paper, we propose a new robust violent interaction detection framework based on multi-stream deep learning in surveillance scene. The proposed approach enhances the recognition performance of violent action in video by fusing three different streams: attention-based spatial RGB stream, temporal stream, and local spatial stream. The attention-based spatial RGB stream learns the spatial attention regions of persons that have high probability to be action region through soft-attention mechanism. The temporal stream employs optical flow as input to extract temporal features. The local spatial stream learns spatial local features using block images as input. Experimental results demonstrate the effectiveness and reliability of the proposed method on three violent interactive datasets: hockey fights, movies, violent interaction. We also verify the proposed method on our own elevator surveillance video dataset and the performance of the proposed method is satisfied.


Sign in / Sign up

Export Citation Format

Share Document