Mining the Semantics of Visual Concepts and Context

Language Label Learning for Visual Concepts Discovered from Video Sequences

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint - Lecture Notes in Computer Science ◽

10.1007/978-3-540-77343-6_6 ◽

2007 ◽

pp. 91-105

Author(s):

Prithwijit Guha ◽

Amitabha Mukerjee

Keyword(s):

Video Sequences ◽

Visual Concepts

Download Full-text

Discovering meaningful multimedia patterns with audio-visual concepts and associated text

2004 International Conference on Image Processing, 2004. ICIP '04. ◽

10.1109/icip.2004.1421580 ◽

2005 ◽

Cited By ~ 11

Author(s):

L. Xie ◽

L. Kennedy ◽

S.-F. Chang ◽

A. Divakarun ◽

H. Sun ◽

...

Keyword(s):

Visual Concepts

Download Full-text

Harvesting Mid-level Visual Concepts from Large-Scale Internet Images

2013 IEEE Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvpr.2013.115 ◽

2013 ◽

Cited By ~ 86

Author(s):

Quannan Li ◽

Jiajun Wu ◽

Zhuowen Tu

Keyword(s):

Large Scale ◽

Visual Concepts ◽

Internet Images

Download Full-text

Continual Learning of Visual Concepts for Robots through Limited Supervision

Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction ◽

10.1145/3434074.3446357 ◽

2021 ◽

Author(s):

Ali Ayub ◽

Alan R. Wagner

Keyword(s):

Visual Concepts ◽

Continual Learning

Download Full-text

Cross-Dataset Learning of Visual Concepts

Adaptive Multimedia Retrieval: Semantics, Context, and Adaptation - Lecture Notes in Computer Science ◽

10.1007/978-3-319-12093-5_4 ◽

2014 ◽

pp. 87-101 ◽

Cited By ~ 1

Author(s):

Christian Hentschel ◽

Harald Sack ◽

Nadine Steinmetz

Keyword(s):

Visual Concepts

Download Full-text

Semantic Analysis of Field Sports Video using a Petri-Net of Audio-Visual Concepts

The Computer Journal ◽

10.1093/comjnl/bxn058 ◽

2008 ◽

Vol 52 (7) ◽

pp. 808-823 ◽

Cited By ~ 3

Author(s):

L. Bai ◽

S. Lao ◽

A. F. Smeaton ◽

N. E. O'Connor ◽

D. Sadlier ◽

...

Keyword(s):

Petri Net ◽

Semantic Analysis ◽

Sports Video ◽

Visual Concepts ◽

Field Sports

Download Full-text

Detection of Visual Concepts and Annotation of Images Using Ensembles of Trees for Hierarchical Multi-Label Classification

Recognizing Patterns in Signals, Speech, Images and Videos - Lecture Notes in Computer Science ◽

10.1007/978-3-642-17711-8_16 ◽

2010 ◽

pp. 152-161 ◽

Cited By ~ 4

Author(s):

Ivica Dimitrovski ◽

Dragi Kocev ◽

Suzana Loskovska ◽

Sašo Džeroski

Keyword(s):

Visual Concepts

Download Full-text

Meta-Remediation as a Mechanism to Address Crowd Decision-Making in the Context of Media Art

International Journal of Creative Interfaces and Computer Graphics ◽

10.4018/ijcicg.2019010104 ◽

2019 ◽

Vol 10 (1) ◽

pp. 43-55

Author(s):

Jose Alberto Raposo Pinheiro ◽

Mirian Tavares

Keyword(s):

Public Space ◽

Short Story ◽

Creative Process ◽

Interaction Model ◽

Digital Art ◽

Public Event ◽

The Public ◽

Art Installation ◽

Visual Concepts ◽

Digital Artifact

uTurn is a digital art installation that allows interaction inside a cinema-like environment or a similar public space — an exhibition system in the context of an audience, retrieving an elected media from the choices made by the majority of the public. The software in its core manages the selection — a meta-remediation that elects a media block, in the form of short-story movies (Vidbits) to be watched by a crowd. The interaction model assumes the need to find a preference in the viewing room in order to identify and choose the next Vidbit. The system allows navigation through media blocks in environments like a cinema room, a summer festival, or a public event. It can be configured to support visual concepts, or to integrate a narrative system in which other types of structures in the story demand that the content follows a segmentation of media. uTurn was exhibited during the 5th Artech International Conference, in 2015. The article addresses the creative process towards the production of the digital artifact using Apple's Quartz Composer.

Download Full-text

Concept Discovery for The Interpretation of Landscape Scenicness

Machine Learning and Knowledge Extraction ◽

10.3390/make2040022 ◽

2020 ◽

Vol 2 (4) ◽

pp. 397-413

Author(s):

Pim Arendsen ◽

Diego Marcos ◽

Devis Tuia

Keyword(s):

Vector Spaces ◽

Large Set ◽

Feature Representations ◽

The United Kingdom ◽

New Concepts ◽

Manifold Alignment ◽

Semantic Concepts ◽

Visual Concepts ◽

Alignment Technique ◽

Concept Activation

In this paper, we study how to extract visual concepts to understand landscape scenicness. Using visual feature representations from a Convolutional Neural Network (CNN), we learn a number of Concept Activation Vectors (CAV) aligned with semantic concepts from ancillary datasets. These concepts represent objects, attributes or scene categories that describe outdoor images. We then use these CAVs to study their impact on the (crowdsourced) perception of beauty of landscapes in the United Kingdom. Finally, we deploy a technique to explore new concepts beyond those initially available in the ancillary dataset: Using a semi-supervised manifold alignment technique, we align the CNN image representation to a large set of word embeddings, therefore giving access to entire dictionaries of concepts. This allows us to obtain a list of new concept candidates to improve our understanding of the elements that contribute the most to the perception of scenicness. We do this without the need for any additional data by leveraging the commonalities in the visual and word vector spaces. Our results suggest that new and potentially useful concepts can be discovered by leveraging neighbourhood structures in the word vector spaces.

Download Full-text

Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents

International Journal of Semantic Computing ◽

10.1142/s1793351x19500016 ◽

2019 ◽

Vol 13 (01) ◽

pp. 135-155

Author(s):

Ye Zhang ◽

Ryunosuke Tanishige ◽

Ichiro Ide ◽

Keisuke Doman ◽

Yasutomo Kawanishi ◽

...

Keyword(s):

Multimedia Information ◽

Real World ◽

News Story ◽

Visual Concepts

News videos are valuable multimedia information on real-world events. However, due to the incremental nature of the contents, a sequence of news videos on a related news topic could be redundant and lengthy. Thus, a number of methods have been proposed for their summarization. However, there is a problem that most of these methods do not consider the consistency between the auditory and visual contents. This becomes a problem in the case of news videos, since both contents do not always come from the same source. Considering this, in this paper, we propose a method for summarizing a sequence of news videos considering the consistency of auditory and visual contents. The proposed method first selects key-sentences from the auditory contents (Closed Caption) of each news story in the sequence, and next selects a shot in the news story whose “Visual Concepts” detected from the visual contents are the most consistent with the selected key-sentence. In the end, the audio segment corresponding to each key-sentence is synthesized with the selected shot, and then these clips are concatenated into a summarized video. Results from subjective experiments on summarized videos on several news topics show the effectiveness of the proposed method.

Download Full-text