Evaluating Color Descriptors for Object and Scene Recognition

Koen E A van de Sande; T Gevers; Cees G M Snoek

doi:10.1109/tpami.2009.154

Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2009.154 ◽

2010 ◽

Vol 32 (9) ◽

pp. 1582-1596 ◽

Cited By ~ 1145

Author(s):

Koen E A van de Sande ◽

T Gevers ◽

Cees G M Snoek

Keyword(s):

Scene Recognition ◽

Color Descriptors

Download Full-text

Topological Mapping and Scene Recognition With Lightweight Color Descriptors for an Omnidirectional Camera

IEEE Transactions on Robotics ◽

10.1109/tro.2013.2272250 ◽

2014 ◽

Vol 30 (2) ◽

pp. 310-324 ◽

Cited By ~ 43

Author(s):

Ming Liu ◽

Roland Siegwart

Keyword(s):

Scene Recognition ◽

Omnidirectional Camera ◽

Topological Mapping ◽

Color Descriptors

Download Full-text

Evaluation of color descriptors for object and scene recognition

2008 IEEE Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2008.4587658 ◽

2008 ◽

Cited By ~ 84

Author(s):

Koen E. A. van de Sande ◽

Theo Gevers ◽

Cees G. M. Snoek

Keyword(s):

Scene Recognition ◽

Color Descriptors

Download Full-text

Object and Scene Recognition Using Color Descriptors and Adaptive Color KLT

Lecture Notes in Computer Science - Human Interface and the Management of Information. Interacting with Information ◽

10.1007/978-3-642-21669-5_42 ◽

2011 ◽

pp. 355-363

Author(s):

Volkan H. Bagci ◽

Mariofanna Milanova ◽

Roumen Kountchev ◽

Roumiana Kountcheva ◽

Vladimir Todorov

Keyword(s):

Scene Recognition ◽

Color Descriptors

Download Full-text

Scene Recognition in Pigeons

PsycEXTRA Dataset ◽

10.1037/e413792005-374 ◽

2000 ◽

Author(s):

Jennifer E. Sutton ◽

William A. Roberts

Keyword(s):

Scene Recognition

Download Full-text

Scene Recognition with Infrared, Low-Light, and Sensor-Fused Imagery

10.21236/ada389643 ◽

1999 ◽

Cited By ~ 5

Author(s):

Michael J. Sinai ◽

Jason S. McCarley ◽

William K. Krebs

Keyword(s):

Scene Recognition ◽

Low Light

Download Full-text

Towards an Understanding of the Effect of Night Vision Display Imagery on Scene Recognition~!2009-06-09~!2009-09-11~!2009-12-24~!

The Ergonomics Open Journal ◽

10.2174/1875934300902030150 ◽

2010 ◽

Vol 2 (3) ◽

pp. 150-158

Author(s):

Geoffrey W. Stuart ◽

Philip K. Hughes

Keyword(s):

Scene Recognition ◽

Night Vision

Download Full-text

Design of Desktop Audiovisual Entertainment System with Deep Learning and Haptic Sensations

Symmetry ◽

10.3390/sym12101718 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1718

Author(s):

Chien-Hsing Chou ◽

Yu-Sheng Su ◽

Che-Ju Hsu ◽

Kong-Chang Lee ◽

Ping-Hsuan Han

Keyword(s):

Deep Learning ◽

Object Detection ◽

User Experience ◽

Recognition System ◽

Scene Recognition ◽

Single Shot ◽

Auditory Signals ◽

Hot Weather ◽

Viewing Experience ◽

At Home

In this study, we designed a four-dimensional (4D) audiovisual entertainment system called Sense. This system comprises a scene recognition system and hardware modules that provide haptic sensations for users when they watch movies and animations at home. In the scene recognition system, we used Google Cloud Vision to detect common scene elements in a video, such as fire, explosions, wind, and rain, and further determine whether the scene depicts hot weather, rain, or snow. Additionally, for animated videos, we applied deep learning with a single shot multibox detector to detect whether the animated video contained scenes of fire-related objects. The hardware module was designed to provide six types of haptic sensations set as line-symmetry to provide a better user experience. After the system considers the results of object detection via the scene recognition system, the system generates corresponding haptic sensations. The system integrates deep learning, auditory signals, and haptic sensations to provide an enhanced viewing experience.

Download Full-text

Wireless Channel Scene Recognition Method Based on an Autocorrelation Function and Deep Learning

IEEE Access ◽

10.1109/access.2020.3044167 ◽

2020 ◽

Vol 8 ◽

pp. 226324-226336

Author(s):

Shuguang Ning ◽

Yigang He ◽

Lifen Yuan ◽

Yuan Huang ◽

Shudong Wang ◽

...

Keyword(s):

Deep Learning ◽

Autocorrelation Function ◽

Wireless Channel ◽

Scene Recognition ◽

Recognition Method

Download Full-text

Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition

Proceedings of the 28th ACM International Conference on Multimedia ◽

10.1145/3394171.3413568 ◽

2020 ◽

Author(s):

Xinhang Song ◽

Haitao Zeng ◽

Sixian Zhang ◽

Luis Herranz ◽

Shuqiang Jiang

Keyword(s):

Scene Recognition ◽

Generalized Zero

Download Full-text

Natural Language Description of Videos for Smart Surveillance

Applied Sciences ◽

10.3390/app11093730 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3730

Author(s):

Aniqa Dilawari ◽

Muhammad Usman Ghani Khan ◽

Yasser D. Al-Otaibi ◽

Zahoor-ur Rehman ◽

Atta-ur Rahman ◽

...

Keyword(s):

Natural Language ◽

Feature Recognition ◽

Scene Recognition ◽

Video Data ◽

Surveillance Video ◽

Video Footage ◽

Parallel Pipeline ◽

September 11 Attacks ◽

Description Framework ◽

High Level

After the September 11 attacks, security and surveillance measures have changed across the globe. Now, surveillance cameras are installed almost everywhere to monitor video footage. Though quite handy, these cameras produce videos in a massive size and volume. The major challenge faced by security agencies is the effort of analyzing the surveillance video data collected and generated daily. Problems related to these videos are twofold: (1) understanding the contents of video streams, and (2) conversion of the video contents to condensed formats, such as textual interpretations and summaries, to save storage space. In this paper, we have proposed a video description framework on a surveillance dataset. This framework is based on the multitask learning of high-level features (HLFs) using a convolutional neural network (CNN) and natural language generation (NLG) through bidirectional recurrent networks. For each specific task, a parallel pipeline is derived from the base visual geometry group (VGG)-16 model. Tasks include scene recognition, action recognition, object recognition and human face specific feature recognition. Experimental results on the TRECViD, UET Video Surveillance (UETVS) and AGRIINTRUSION datasets depict that the model outperforms state-of-the-art methods by a METEOR (Metric for Evaluation of Translation with Explicit ORdering) score of 33.9%, 34.3%, and 31.2%, respectively. Our results show that our framework has distinct advantages over traditional rule-based models for the recognition and generation of natural language descriptions.

Download Full-text