Blotch Detection in Archive Films Based on Visual Saliency Map

Complexity ◽

10.1155/2020/5965387 ◽

2020 ◽

Vol 2020 ◽

pp. 1-17

Author(s):

Yildiz Aydin ◽

Bekir Dizdaroğlu

Keyword(s):

Computational Complexity ◽

Cultural Heritage ◽

False Alarm ◽

Detection Method ◽

Visual Saliency ◽

Saliency Map ◽

Saliency Maps ◽

Block Based ◽

The Given ◽

Blotch Detection

Degradations frequently occur in archive films that symbolize the historical and cultural heritage of a nation. In this study, the problem of detection blotches commonly encountered in archive films is handled. Here, a block-based blotch detection method is proposed based on a visual saliency map. The visual saliency map reveals prominent areas in an input frame and thus enables more accurate results in the blotch detection. A simple and effective visual saliency map method is taken into consideration in order to reduce computational complexity for the detection phase. After the visual saliency maps of the given frames are obtained, blotch regions are estimated by considered spatiotemporal patches—without the requirement for motion estimation—around the saliency pixels, which are subjected to a prethresholding process. Experimental results show that the proposed block-based blotch detection method provides a significant advantage with reducing false alarm rates over HOG feature (Yous and Serir, 2017), LBP feature (Yous and Serir, 2017), and regions-matching (Yous and Serir, 2016) methods presented in recent years.

Download Full-text

Ship Detection in Panchromatic Optical Remote Sensing Images Based on Visual Saliency and Multi-Dimensional Feature Description

Remote Sensing ◽

10.3390/rs12010152 ◽

2020 ◽

Vol 12 (1) ◽

pp. 152 ◽

Cited By ~ 6

Author(s):

Ting Nie ◽

Xiyu Han ◽

Bin He ◽

Xiansheng Li ◽

Hongxing Liu ◽

...

Keyword(s):

Remote Sensing ◽

Fourier Transform ◽

Detection Method ◽

Saliency Detection ◽

Visual Saliency ◽

Saliency Map ◽

False Alarms ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Ship Detection

Ship detection in panchromatic optical remote sensing images is faced with two major challenges, locating candidate regions from complex backgrounds quickly and describing ships effectively to reduce false alarms. Here, a practical method was proposed to solve these issues. Firstly, we constructed a novel visual saliency detection method based on a hyper-complex Fourier transform of a quaternion to locate regions of interest (ROIs), which can improve the accuracy of the subsequent discrimination process for panchromatic images, compared with the phase spectrum quaternary Fourier transform (PQFT) method. In addition, the Gaussian filtering of different scales was performed on the transformed result to synthesize the best saliency map. An adaptive method based on GrabCut was then used for binary segmentation to extract candidate positions. With respect to the discrimination stage, a rotation-invariant modified local binary pattern (LBP) description was achieved by combining shape, texture, and moment invariant features to describe the ship targets more powerfully. Finally, the false alarms were eliminated through SVM training. The experimental results on panchromatic optical remote sensing images demonstrated that the presented saliency model under various indicators is superior, and the proposed ship detection method is accurate and fast with high robustness, based on detailed comparisons to existing efforts.

Download Full-text

Moving Objects Detection Based on Gaussian Mixture Model and Saliency Map

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.63-64.350 ◽

2011 ◽

Vol 63-64 ◽

pp. 350-354 ◽

Cited By ~ 1

Author(s):

Li Li Lin ◽

Neng Rong Chen

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Moving Objects ◽

Visual Saliency ◽

Gaussian Mixture ◽

Saliency Map ◽

Dynamic Background ◽

Moving Objects Detection ◽

Saliency Maps ◽

Objects Detection

The background modeling method based on the Gaussian mixture model (GMM) is usually used to detect the moving objects in static background. But when applied to dynamic background, for example caused by camera jitter, the wrong detection rate of moving objects is high, and thus affects the follow-up tracking. In addition, the method with GMM can not effectively remove the moving objects shadow region. This paper proposes a moving object detection method based on GMM and visual saliency maps, which not only can remove the disturbance caused by camera jitter, but also can effectively solve the shadow problem and achieve stable moving objects detection.

Download Full-text

The Study of Randomized Visual Saliency Detection Algorithm

Computational and Mathematical Methods in Medicine ◽

10.1155/2013/380245 ◽

2013 ◽

Vol 2013 ◽

pp. 1-9

Author(s):

Yuantao Chen ◽

Weihong Xu ◽

Fangjun Kuang ◽

Shangbing Gao

Keyword(s):

Image Segmentation ◽

Detection Method ◽

Saliency Detection ◽

Visual Saliency ◽

Detection Algorithm ◽

Saliency Map ◽

Input Image ◽

Memory Space ◽

Segmentation Process ◽

Visual Saliency Detection

Image segmentation process for high quality visual saliency map is very dependent on the existing visual saliency metrics. It is mostly only get sketchy effect of saliency map, and roughly based visual saliency map will affect the image segmentation results. The paper had presented the randomized visual saliency detection algorithm. The randomized visual saliency detection method can quickly generate the same size as the original input image and detailed results of the saliency map. The randomized saliency detection method can be applied to real-time requirements for image content-based scaling saliency results map. The randomization method for fast randomized video saliency area detection, the algorithm only requires a small amount of memory space can be detected detailed oriented visual saliency map, the presented results are shown that the method of visual saliency map used in image after the segmentation process can be an ideal segmentation results.

Download Full-text

Inferring Visual Biases in UAV Videos from Eye Movements

Drones ◽

10.3390/drones4030031 ◽

2020 ◽

Vol 4 (3) ◽

pp. 31

Author(s):

Anne-Flore Perrin ◽

Lu Zhang ◽

Olivier Le Meur

Keyword(s):

Dimensional Space ◽

Visual Saliency ◽

Low Complexity ◽

Saliency Map ◽

Point Of View ◽

High Dimensional ◽

Saliency Maps ◽

Saliency Prediction ◽

Aerial Vehicle ◽

Definition Of

Unmanned Aerial Vehicle (UAV) imagery is gaining a lot of momentum lately. Indeed, gathered information from a bird-point-of-view is particularly relevant for numerous applications, from agriculture to surveillance services. We herewith study visual saliency to verify whether there are tangible differences between this imagery and more conventional contents. We first describe typical and UAV contents based on their human saliency maps in a high-dimensional space, encompassing saliency map statistics, distribution characteristics, and other specifically designed features. Thanks to a large amount of eye tracking data collected on UAV, we stress the differences between typical and UAV videos, but more importantly within UAV sequences. We then designed a process to extract new visual attention biases in the UAV imagery, leading to the definition of a new dictionary of visual biases. We then conduct a benchmark on two different datasets, whose results confirm that the 20 defined biases are relevant as a low-complexity saliency prediction system.

Download Full-text

The Improved Algorithm of Sample Adaptive Offset Based on Visual Saliency

Fuzzy Systems and Data Mining VI - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200710 ◽

2020 ◽

Author(s):

Nana Shan ◽

Wei Zhou ◽

Zhemin Duan

Keyword(s):

Computational Complexity ◽

Video Coding ◽

Visual Saliency ◽

Saliency Map ◽

Experimental Result ◽

Cost Calculation ◽

Performance Loss ◽

High Efficient ◽

Efficient Video

Sample Adaptive Offset (SAO) in High Efficient Video Coding (HEVC) is a new technic to improve the quality of videos. It categories the pixels and choices the best way by adding some offsets to the reconstructed video. So, it causes a dramatically increased computational complexity. According to the dependency of sample adaptive offset and visual saliency map, an improved SAO method is proposed in order to minimize the coding time of SAO by skipping some RD cost calculation. Experimental result shows that the proposed method reduces 27.02% SAO encoding time with negligible performance loss.

Download Full-text

A New Visual Attention Model Designed for SAR Images Based on Texture Saliency

10.20944/preprints201710.0093.v1 ◽

2017 ◽

Author(s):

Wei Xiong ◽

Yongli Xu ◽

Yafei Lv ◽

Libo Yao

Keyword(s):

Visual Attention ◽

Satellite Image ◽

Visual Saliency ◽

Saliency Map ◽

Sar Image ◽

Sar Images ◽

Visual Attention Model ◽

Attention Model ◽

Saliency Maps ◽

Visual Models

Targets detection in synthetic aperture radar (SAR) remote sensing images, which is a fundamental but challenging problem in the field of satellite image analysis, plays an important role for a wide range of applications and is receiving significant attention in recent years. Besides, the ability of human visual system to detect visual saliency is extraordinarily fast and reliable. However, computational modeling of SAR image scene still remains a challenge. This paper analyzes the defects and shortcomings of traditional visual models applied to SAR images. Then a visual attention model designed for SAR images is proposed. The model draws the basic framework of classical ITTI model; selects and extracts the texture features and other features that can describe the SAR image better. We proposes a new algorithm for computing the local texture saliency of the input image, then the model constructs the corresponding saliency maps of features; Next, a new mechanism of feature fusion is adopted to replace the linear additive mechanism of classical models to obtain the overall saliency map; Finally, the gray-scale characteristics of focus of attention (FOA) in saliency map of all features are taken into account, our model choose the best saliency representation, Through the multi-scale competition strategy, the filter and threshold segmentation of the saliency maps can be used to select the salient regions accurately, thereby completing this operation for the visual saliency detection in SAR images. In the paper, several types of satellite image data, such as TerraSAR-X (TS-X), Radarsat-2, are used to evaluate the performance of visual models. The results show that our model provides superior performance compared with classical visual models. By further contrasting with the classical visual models, Our model reduce the false alarm caused by speckle noise, and its detection speed is greatly improved, and it is increased by 25% to 45%.

Download Full-text

Guided Spatial Transformers for Facial Expression Recognition

Applied Sciences ◽

10.3390/app11167217 ◽

2021 ◽

Vol 11 (16) ◽

pp. 7217

Author(s):

Cristina Luna-Jiménez ◽

Jorge Cristóbal-Martín ◽

Ricardo Kleinlein ◽

Manuel Gil-Martín ◽

José M. Moya ◽

...

Keyword(s):

Facial Expression ◽

Facial Expression Recognition ◽

Expert Knowledge ◽

Visual Saliency ◽

Expression Recognition ◽

Spatial Transformations ◽

Facial Landmarks ◽

Saliency Maps ◽

Best Fit

Spatial Transformer Networks are considered a powerful algorithm to learn the main areas of an image, but still, they could be more efficient by receiving images with embedded expert knowledge. This paper aims to improve the performance of conventional Spatial Transformers when applied to Facial Expression Recognition. Based on the Spatial Transformers’ capacity of spatial manipulation within networks, we propose different extensions to these models where effective attentional regions are captured employing facial landmarks or facial visual saliency maps. This specific attentional information is then hardcoded to guide the Spatial Transformers to learn the spatial transformations that best fit the proposed regions for better recognition results. For this study, we use two datasets: AffectNet and FER-2013. For AffectNet, we achieve a 0.35% point absolute improvement relative to the traditional Spatial Transformer, whereas for FER-2013, our solution gets an increase of 1.49% when models are fine-tuned with the Affectnet pre-trained weights.

Download Full-text

Lighter and Faster Cross-Concatenated Multi-Scale Residual Block Based Network for Visual Saliency Prediction

10.1109/icip42928.2021.9506710 ◽

2021 ◽

Author(s):

Sai Phani Kumar Malladi ◽

Jayanta Mukhopadhyay ◽

Chaker Larabi ◽

Santanu Chaudhury

Keyword(s):

Visual Saliency ◽

Multi Scale ◽

Saliency Prediction ◽

Block Based ◽

Residual Block

Download Full-text

Explaining Neural Networks Using Attentive Knowledge Distillation

Sensors ◽

10.3390/s21041280 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1280

Author(s):

Hyeonseok Lee ◽

Sungchan Kim

Keyword(s):

Neural Networks ◽

Model Prediction ◽

Saliency Map ◽

Model Parameters ◽

Learning Capability ◽

Fine Grained ◽

Network Layers ◽

Saliency Maps ◽

Novel Approach ◽

Knowledge Distillation

Explaining the prediction of deep neural networks makes the networks more understandable and trusted, leading to their use in various mission critical tasks. Recent progress in the learning capability of networks has primarily been due to the enormous number of model parameters, so that it is usually hard to interpret their operations, as opposed to classical white-box models. For this purpose, generating saliency maps is a popular approach to identify the important input features used for the model prediction. Existing explanation methods typically only use the output of the last convolution layer of the model to generate a saliency map, lacking the information included in intermediate layers. Thus, the corresponding explanations are coarse and result in limited accuracy. Although the accuracy can be improved by iteratively developing a saliency map, this is too time-consuming and is thus impractical. To address these problems, we proposed a novel approach to explain the model prediction by developing an attentive surrogate network using the knowledge distillation. The surrogate network aims to generate a fine-grained saliency map corresponding to the model prediction using meaningful regional information presented over all network layers. Experiments demonstrated that the saliency maps are the result of spatially attentive features learned from the distillation. Thus, they are useful for fine-grained classification tasks. Moreover, the proposed method runs at the rate of 24.3 frames per second, which is much faster than the existing methods by orders of magnitude.

Download Full-text

An Object-Based Image Reducing Approach

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.1044-1045.1049 ◽

2014 ◽

Vol 1044-1045 ◽

pp. 1049-1052 ◽

Cited By ~ 1

Author(s):

Chin Chen Chang ◽

I Ta Lee ◽

Tsung Ta Ke ◽

Wen Kai Tai

Keyword(s):

Visual Saliency ◽

Saliency Map ◽

Input Image ◽

Image Size ◽

Target Image ◽

Feature Maps ◽

Object Based ◽

Wide Range

Common methods for reducing image size include scaling and cropping. However, these two approaches have some quality problems for reduced images. In this paper, we propose an image reducing algorithm by separating the main objects and the background. First, we extract two feature maps, namely, an enhanced visual saliency map and an improved gradient map from an input image. After that, we integrate these two feature maps to an importance map. Finally, we generate the target image using the importance map. The proposed approach can obtain desired results for a wide range of images.

Download Full-text