scholarly journals A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding

2013 ◽  
Vol 2013 ◽  
pp. 1-7 ◽  
Author(s):  
Kehua Guo ◽  
Shigeng Zhang

Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users’ query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches.

2021 ◽  
Vol 11 (18) ◽  
pp. 8769
Author(s):  
Jun Long ◽  
Longzhi Sun ◽  
Liujie Hua ◽  
Zhan Yang

Cross-modal hashing technology is a key technology for real-time retrieval of large-scale multimedia data in real-world applications. Although the existing cross-modal hashing methods have achieved impressive accomplishment, there are still some limitations: (1) some cross-modal hashing methods do not make full consider the rich semantic information and noise information in labels, resulting in a large semantic gap, and (2) some cross-modal hashing methods adopt the relaxation-based or discrete cyclic coordinate descent algorithm to solve the discrete constraint problem, resulting in a large quantization error or time consumption. Therefore, in order to solve these limitations, in this paper, we propose a novel method, named Discrete Semantics-Guided Asymmetric Hashing (DSAH). Specifically, our proposed DSAH leverages both label information and similarity matrix to enhance the semantic information of the learned hash codes, and the ℓ2,1 norm is used to increase the sparsity of matrix to solve the problem of the inevitable noise and subjective factors in labels. Meanwhile, an asymmetric hash learning scheme is proposed to efficiently perform hash learning. In addition, a discrete optimization algorithm is proposed to fast solve the hash code directly and discretely. During the optimization process, the hash code learning and the hash function learning interact, i.e., the learned hash codes can guide the learning process of the hash function and the hash function can also guide the hash code generation simultaneously. Extensive experiments performed on two benchmark datasets highlight the superiority of DSAH over several state-of-the-art methods.


Author(s):  
Harshitha M ◽  
Ch. Rupa ◽  
K. Pujitha Sai ◽  
A. Pravallika ◽  
V. Kusuma Sowmya

Author(s):  
Shu-Ching Chen

The exponential growth of the technological advancements has resulted in high-resolution devices, such as digital cameras, scanners, monitors, and printers, which enable the capturing and displaying of multimedia data in high-density storage devices. Furthermore, more and more applications need to live with multimedia data. However, the gap between the characteristics of various media types and the application requirements has created the need to develop advanced techniques for multimedia data management and the extraction of relevant information from multimedia databases. Though many research efforts have been devoted to the areas of multimedia databases and data management, it is still far from maturity. The purpose of this article is to discuss how the existing techniques, methodologies, and tools addressed relevant issues and challenges to enable a better understanding in multimedia databases and data management. The focuses include: (1) how to develop a formal structure that can be used to capture the distinguishing content of the media data in a multimedia database (MMDB) and to form an abstract space for the data to be queried; (2) how to develop advanced content analysis and retrieval techniques that can be used to bridge the gaps between the semantic meaning and low-level media characteristics to improve multimedia information retrieval; and (3) how to develop query mechanisms that can handle complex spatial, temporal, and/or spatio-temporal relationships of multimedia data to answer the imprecise and incomplete queries issued to an MMDB.


Author(s):  
Bo Yang

In recent years, the rapid expansion of multimedia applications, partly due to the exponential growth of the Internet, has proliferated over the daily life of computer users (Yang & Hurson, 2006). The integration of wireless communication, pervasive computing, and ubiquitous data processing with multimedia database systems has enabled the connection and fusion of distributed multimedia data sources. In addition, the emerging applications, such as smart classroom, digital library, habitat/environment surveillance, traffic monitoring, and battlefield sensing, have provided increasing motivation for conducting research on multimedia content representation, data delivery and dissemination, data fusion and analysis, and contentbased retrieval. Consequently, research on multimedia technologies is of increasing importance in computer society. In contrast with traditional text-based systems, multimedia applications usually incorporate much more powerful descriptions of human thought—video, audio, and images (Karpouzis, Raouzaiou, Tzouveli, Iaonnou, & Kollias, 2003; Liu, Bao, Yu, & Xu, 2005; Yang & Hurson, 2005). Moreover, the large collections of data in multimedia systems make it possible to resolve more complex data operations such as imprecise query or content-based retrieval. For instance, the image database systems may accept an example picture and return the most similar images of the example (Cox, Miller, & Minka, 2000; Hsu, Chua, & Pung, 2000; Huang, Chang, & Huang, 2003). However, the conveniences of multimedia applications come with challenges to the existing data management schemes: • Efficiency: Multimedia applications generally require more resources; however, the storage space and processing power are limited in many practical systems, for example, mobile devices and wireless networks (Yang & Hurson, 2005). Due to the large data volume and complicated operations of multimedia applications, new methods are needed to facilitate efficient representation, retrieval, and processing of multimedia data while considering the technical constraints. • Semantic Gap: There is a gap between user perception of multimedia entities and physical representation/access mechanism of multimedia data. Users often browse and desire to access multimedia data at the object level (“entities” such as human beings, animals, or buildings). However, the existing multimedia retrieval systems tend to access multimedia data based on their lower-level features (“characteristics” such as color patterns and textures), with little regard to combining these features into data objects. This representation gap often leads to higher processing cost and unexpected retrieval results. The representation of multimedia data according to human’s perspective is one of the focuses in recent research activities; however, few existing systems provide automated identification or classification of objects from general multimedia collections. • Heterogeneity: The collections of multimedia data are often diverse and poorly indexed. In a distributed environment, because of the autonomy and heterogeneity of data sources, multimedia data objects are often represented in heterogeneous formats. The difference in data formats further leads to the difficulty of incorporating multimedia data objects under a unique indexing framework. • Semantic Unawareness: The present research on content-based multimedia retrieval is based on feature vectors—features are extracted from audio/video streams or image pixels, empirically or heuristically, and combined into vectors according to the application criteria. Because of the application-specific multimedia formats, the feature-based paradigm lacks scalability and accuracy.


Author(s):  
Hsin-Yu Ha ◽  
Fausto C. Fleites ◽  
Shu-Ching Chen

Nowadays, only processing visual features is not enough for multimedia semantic retrieval due to the complexity of multimedia data, which usually involve a variety of modalities, e.g. graphics, text, speech, video, etc. It becomes crucial to fully utilize the correlation between each feature and the target concept, the feature correlation within modalities, and the feature correlation across modalities. In this paper, the authors propose a Feature Correlation Clustering-based Multi-Modality Fusion Framework (FCC-MMF) for multimedia semantic retrieval. Features from different modalities are combined into one feature set with the same representation via a normalization and discretization process. Within and across modalities, multiple correspondence analysis is utilized to obtain the correlation between feature-value pairs, which are then projected onto the two principal components. K-medoids algorithm, which is a widely used partitioned clustering algorithm, is selected to minimize the Euclidean distance within the resulted clusters and produce high intra-correlated feature-value pair clusters. Majority vote is applied to subsequently decide which cluster each feature belongs to. Once the feature clusters are formed, one classifier is built and trained for each cluster. The correlation and confidence of each classifier are considered while fusing the classification scores, and mean average precision is used to evaluate the final ranked classification scores. Finally, the proposed framework is applied on NUS-wide Lite data set to demonstrate the effectiveness in multimedia semantic retrieval.


2011 ◽  
Vol 268-270 ◽  
pp. 1040-1045
Author(s):  
Dan Wen Chen ◽  
Li Qiong Deng ◽  
Zhi Min Yuan ◽  
Ling Da Wu

How to combine multi-modal features effectively is a difficult problem in news story correlation analysis, this paper puts forward a new two-stage fusion approach based on visual and textual features fusion to solve this problem. First we use a co-clustering method to get the clustering groups of similar stories with the visual and semantic information of news story. And then, on the base of the result of the first step, we use different weighted strategies to analyze the news story correlation in a further way, which aim at the different type of news story. The methods can get a better result of the news story correlation analysis by experiments.


Author(s):  
Shu-Ching Chen

The exponential growth of the technological advancements has resulted in high-resolution devices, such as digital cameras, scanners, monitors, and printers, which enable the capturing and displaying of multimedia data in high-density storage devices. Furthermore, more and more applications need to live with multimedia data. However, the gap between the characteristics of various media types and the application requirements has created the need to develop advanced techniques for multimedia data management and the extraction of relevant information from multimedia databases. Though many research efforts have been devoted to the areas of multimedia databases and data management, it is still far from maturity. The purpose of this article is to discuss how the existing techniques, methodologies, and tools addressed relevant issues and challenges to enable a better understanding in multimedia databases and data management. The focuses include: (1) how to develop a formal structure that can be used to capture the distinguishing content of the media data in a multimedia database (MMDB) and to form an abstract space for the data to be queried; (2) how to develop advanced content analysis and retrieval techniques that can be used to bridge the gaps between the semantic meaning and low-level media characteristics to improve multimedia information retrieval; and (3) how to develop query mechanisms that can handle complex spatial, temporal, and/or spatio-temporal relationships of multimedia data to answer the imprecise and incomplete queries issued to an MMDB.


Author(s):  
Mohammed A. Moharrum ◽  
Stephan Olariu ◽  
Hussein Abdel-Wahab

The objective of this chapter is to introduce the reader to a general architectural framework for a broad array of retrievals of multimedia data required by various applications. This framework contains more than the traditional client/server architecture and even more than the existing three-tier architectures. This chapter introduces the reader to many critical issues involved in multimedia retrieval over the Internet. A new architectural framework is proposed to cover a variety of multimedia applications over the Internet and the World Wide Web. This framework has the three main objectives of (1) proposing a layered architecture to facilitated design and separate different issues, (2) covering a large number of multimedia applications, and finally, (3) making use of existing and well-established technology, such as Mobile Agents, SQL databases, and cache managements schemes. The proposed architectural framework separates issues involved in multimedia retrieval into five layers, namely: keyword searching and data servers, proxy servers, domain and department archives, mobile user agents, and the users. Through these five layers, various customized solutions to a large array of problems will be proposed and applied. The chapter offers, but is not limited to, solutions for different problems that arise in retrieval of multimedia data. A list of important open problems is identified at the end of the chapter.


Author(s):  
Shi Kuo Chang ◽  
Vincenzo Deufemia ◽  
Giuseppe Polese

In this chapter we present normal forms for the design of multimedia database schemes with reduced manipulation anomalies. To this aim we first discuss how to describe the semantics of multimedia attributes based upon the concept of generalized icons, already used in the modeling of multimedia languages. Then, we introduce new extended dependencies involving different types of multimedia data. Such dependencies are based on domain specific similarity measures that are used to detect semantic relationships between complex data types. Based upon these new dependencies, we have defined five normal forms for multimedia databases, some focusing on the level of segmentation of multimedia attributes, others on the level of fragmentation of tables.


Author(s):  
Jun Long ◽  
Lei Zhu ◽  
Xinpan Yuan ◽  
Longzhi Sun

Online social networking techniques and large-scale multimedia retrieval are developing rapidly, which not only has brought great convenience to our daily life, but generated, collected, and stored large-scale multimedia data as well. This trend has put forward higher requirements and greater challenges on massive multimedia retrieval. In this paper, we investigate the problem of image similarity measurement, which is one of the key problems of multimedia retrieval. Firstly, the definition of similarity measurement of images and the related notions are proposed. Then, an efficient similarity measurement framework is proposed. Besides, we present a novel basic method of similarity measurement named SMIN. To improve the performance of similarity measurement, we carefully design a novel indexing structure called SMI Temp Index (SMII for short). Moreover, we establish an index of potential similar visual words off-line to solve to problem that the index cannot be reused. Experimental evaluations on two real image datasets demonstrate that the proposed approach outperforms state-of-the-arts.


Sign in / Sign up

Export Citation Format

Share Document