Multistage utterance verification for keyword recognition-based online spoken content retrieval

2012 ◽  
Vol 58 (3) ◽  
pp. 1000-1005 ◽  
Author(s):  
Jeong-Sik Park ◽  
Gil-Jin Jang ◽  
Ji-Hwan Kim
2021 ◽  
Vol 13 (2) ◽  
pp. 176
Author(s):  
Peng Zheng ◽  
Zebin Wu ◽  
Jin Sun ◽  
Yi Zhang ◽  
Yaoqin Zhu ◽  
...  

As the volume of remotely sensed data grows significantly, content-based image retrieval (CBIR) becomes increasingly important, especially for cloud computing platforms that facilitate processing and storing big data in a parallel and distributed way. This paper proposes a novel parallel CBIR system for hyperspectral image (HSI) repository on cloud computing platforms under the guide of unmixed spectral information, i.e., endmembers and their associated fractional abundances, to retrieve hyperspectral scenes. However, existing unmixing methods would suffer extremely high computational burden when extracting meta-data from large-scale HSI data. To address this limitation, we implement a distributed and parallel unmixing method that operates on cloud computing platforms in parallel for accelerating the unmixing processing flow. In addition, we implement a global standard distributed HSI repository equipped with a large spectral library in a software-as-a-service mode, providing users with HSI storage, management, and retrieval services through web interfaces. Furthermore, the parallel implementation of unmixing processing is incorporated into the CBIR system to establish the parallel unmixing-based content retrieval system. The performance of our proposed parallel CBIR system was verified in terms of both unmixing efficiency and accuracy.


2007 ◽  
Vol 17 (3) ◽  
pp. 336-346 ◽  
Author(s):  
David Vallet ◽  
Pablo Castells ◽  
Miriam Fernndez ◽  
Phivos Mylonas ◽  
Yannis Avrithis
Keyword(s):  

Impact ◽  
2021 ◽  
Vol 2021 (1) ◽  
pp. 9-11
Author(s):  
Lin-shan Lee

Spoken content refers to all content over the Internet which includes human voice, essentially those in multimedia, such As YouTube videos and online courses. Today such content is retrieved via Google primarily based on human-generated text labels, because Google can only retrieve text over the Internet. The goal of this project is to produce technologies to retrieve accurately and efficiently such spoken content directly based on the included audio sounds instead of text labels, because machines today can listen to human voice just as they can read the text. The long term goal is to create a spoken version of Google, which may revolutionize the ways in which humans access information and improve their knowledge. Professor Lin-shan Lee at National Taiwan University is leading this project. He has been a distinguished leader in the global scientific community for the area of teaching machines to speak and listen to human voice for many years.


Sign in / Sign up

Export Citation Format

Share Document