SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants

Amos Azaria; Keren Nivasch

doi:10.3390/s20195577

SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants

Sensors ◽

10.3390/s20195577 ◽

2020 ◽

Vol 20 (19) ◽

pp. 5577

Author(s):

Amos Azaria ◽

Keren Nivasch

Keyword(s):

Deep Learning ◽

Natural Language ◽

Intelligent Agents ◽

State Of The Art ◽

Intelligent Agent ◽

Current State ◽

The Future ◽

Art Methods ◽

Personal Assistants ◽

Unique Dataset

Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset.

Download Full-text

A Controlled Benchmark of Video Violence Detection Techniques

Information ◽

10.3390/info11060321 ◽

2020 ◽

Vol 11 (6) ◽

pp. 321

Author(s):

Nicola Convertini ◽

Vincenzo Dentamaro ◽

Donato Impedovo ◽

Giuseppe Pirlo ◽

Lucia Sarcinella

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Detection Techniques ◽

Detection Systems ◽

Current State ◽

Violence Detection ◽

Learning Techniques ◽

The Future ◽

Feature Based ◽

Art Techniques

This benchmarking study aims to examine and discuss the current state-of-the-art techniques for in-video violence detection, and also provide benchmarking results as a reference for the future accuracy baseline of violence detection systems. In this paper, the authors review 11 techniques for in-video violence detection. They re-implement five carefully chosen state-of-the-art techniques over three different and publicly available violence datasets, using several classifiers, all in the same conditions. The main contribution of this work is to compare feature-based violence detection techniques and modern deep-learning techniques, such as Inception V3.

Download Full-text

A Survey of Graphical Page Object Detection with Deep Neural Networks

10.20944/preprints202104.0739.v1 ◽

2021 ◽

Author(s):

Jwalin Bhatt ◽

Khurram Azeem Hashmi ◽

Muhammad Zeshan Afzal ◽

Didier Stricker

Keyword(s):

Deep Learning ◽

Object Detection ◽

Conceptual Understanding ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Approaches ◽

Document Images ◽

Essential Information ◽

Current State ◽

High Level

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that makes digitization of documents viable. Since the advent of deep learning, the performance of deep learning-based object detection has improved many folds. In this work, we outline and summarize the deep learning approaches for detecting graphical page objects in the document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text

A Robust Context-Based Deep Learning Approach for Highly Imbalanced Hyperspectral Classification

Computational Intelligence and Neuroscience ◽

10.1155/2021/9923491 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Juan F. Ramirez Rochac ◽

Nian Zhang ◽

Lara A. Thompson ◽

Tolessa Deksissa

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Mineral Exploration ◽

Classification Models ◽

Noise Resistance ◽

Deep Convolutional Neural Networks ◽

Current State ◽

Feature Augmentation ◽

Active Research ◽

Hyperspectral Classification

Hyperspectral imaging is an area of active research with many applications in remote sensing, mineral exploration, and environmental monitoring. Deep learning and, in particular, convolution-based approaches are the current state-of-the-art classification models. However, in the presence of noisy hyperspectral datasets, these deep convolutional neural networks underperform. In this paper, we proposed a feature augmentation approach to increase noise resistance in imbalanced hyperspectral classification. Our method calculates context-based features, and it uses a deep convolutional neuronet (DCN). We tested our proposed approach on the Pavia datasets and compared three models, DCN, PCA + DCN, and our context-based DCN, using the original datasets and the datasets plus noise. Our experimental results show that DCN and PCA + DCN perform well on the original datasets but not on the noisy datasets. Our robust context-based DCN was able to outperform others in the presence of noise and was able to maintain a comparable classification accuracy on clean hyperspectral images.

Download Full-text

Time-Dependent Temperature Measurements in Post-Detonation Combustion: Current State-of-the-Art Methods and Emerging Technologies

10.21236/ad1006208 ◽

2016 ◽

Author(s):

William K. Lewis ◽

Nick G. Glumac ◽

Eduardo G. Yukihara

Keyword(s):

Emerging Technologies ◽

State Of The Art ◽

Temperature Measurements ◽

Time Dependent ◽

Current State ◽

Detonation Combustion ◽

Art Methods

Download Full-text

Child Sexual Abuse: The History, Current State of the Art and the Challenges for the Future: A Pediatric Perspective

Handbook of Child Maltreatment - Child Maltreatment ◽

10.1007/978-94-007-7208-3_4 ◽

2013 ◽

pp. 81-97 ◽

Cited By ~ 1

Author(s):

Astrid Heppenstall Heger

Keyword(s):

Sexual Abuse ◽

Child Sexual Abuse ◽

State Of The Art ◽

Current State ◽

The Future

Download Full-text

Computer-aided detection and characterization of stroke lesion – a short review on the current state-of-the art methods

The Imaging Science Journal ◽

10.1080/13682199.2017.1370879 ◽

2017 ◽

Vol 66 (1) ◽

pp. 1-22 ◽

Cited By ~ 4

Author(s):

R. Karthik ◽

R. Menaka

Keyword(s):

State Of The Art ◽

Short Review ◽

Computer Aided Detection ◽

Current State ◽

Computer Aided ◽

Art Methods ◽

Stroke Lesion

Download Full-text

The Current State-of-the-Art and the Future in Airframe Manufacturing Using Superplastic Forming Technologies

Materials Science Forum ◽

10.4028/www.scientific.net/msf.357-359.17 ◽

2001 ◽

Vol 357-359 ◽

pp. 17-22 ◽

Cited By ~ 9

Author(s):

Daniel G. Sanders

Keyword(s):

State Of The Art ◽

Superplastic Forming ◽

Current State ◽

The Future

Download Full-text

State of the Art Artificial Neural Network, Deep Learning, and the Future Generation

Indian Journal of Computer Science ◽

10.17010/ijcs/2017/v2/i6/120440 ◽

2017 ◽

Vol 2 (6) ◽

Author(s):

Munef Abdullah Ahmed ◽

Stefan Trausan-Matu

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Deep Learning ◽

State Of The Art ◽

Future Generation ◽

The Future ◽

Artificial Neural

Download Full-text

Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning

Methods ◽

10.1016/j.ymeth.2018.07.007 ◽

2018 ◽

Vol 151 ◽

pp. 41-54 ◽

Cited By ~ 23

Author(s):

Nicholas Cummins ◽

Alice Baird ◽

Björn W. Schuller

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Speech Analysis ◽

Current State

Download Full-text

The future of multimodal corpora

Revista Brasileira de Linguística Aplicada ◽

10.1590/s1984-63982011000200006 ◽

2011 ◽

Vol 11 (2) ◽

pp. 391-415 ◽

Cited By ~ 7

Author(s):

Dawn Knight

Keyword(s):

Corpus Linguistics ◽

State Of The Art ◽

Future Developments ◽

The Past ◽

Current State ◽

Linguistic Research ◽

Multimodal Corpora ◽

The Future ◽

Multimodal Corpus ◽

Critical Overview

This paper takes stock of the current state-of-the-art in multimodal corpus linguistics, and proposes some projections of future developments in this field. It provides a critical overview of key multimodal corpora that have been constructed over the past decade and presents a wish-list of future technological and methodological advancements that may help to increase the availability, utility and functionality of such corpora for linguistic research.

Download Full-text