Evolution of Reading Comprehension and Question Answering Systems

CoQA: A Conversational Question Answering Challenge

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00266 ◽

2019 ◽

Vol 7 ◽

pp. 249-266 ◽

Cited By ~ 46

Author(s):

Siva Reddy ◽

Danqi Chen ◽

Christopher D. Manning

Keyword(s):

Reading Comprehension ◽

Human Performance ◽

Question Answering ◽

Information Gathering ◽

Free Form ◽

Question Answering Systems ◽

Questions And Answers ◽

Pragmatic Reasoning

Humans gather information through conversations involving a series of interconnected questions and answers. For machines to assist in information gathering, it is therefore essential to enable them to answer conversational questions. We introduce CoQA, a novel dataset for building Conversational Question Answering systems. Our dataset contains 127k questions with answers, obtained from 8k conversations about text passages from seven diverse domains. The questions are conversational, and the answers are free-form text with their corresponding evidence highlighted in the passage. We analyze CoQA in depth and show that conversational questions have challenging phenomena not present in existing reading comprehension datasets (e.g., coreference and pragmatic reasoning). We evaluate strong dialogue and reading comprehension models on CoQA. The best system obtains an F1 score of 65.4%, which is 23.4 points behind human performance (88.8%), indicating that there is ample room for improvement. We present CoQA as a challenge to the community at https://stanfordnlp.github.io/coqa .

Download Full-text

An Efficient Semantic Analysis Technique for the Question Answering Systems

Journal of Engineering and Applied Sciences ◽

10.36478/jeasci.2019.8289.8292 ◽

2019 ◽

Vol 14 (22) ◽

pp. 8289-8292

Author(s):

Ibrahim Mahmoud Ibrahim Alturani ◽

Mohd Pouzi Bin Hamzah

Keyword(s):

Question Answering ◽

Semantic Analysis ◽

Analysis Technique ◽

Question Answering Systems

Download Full-text

Causal Perception in Question-Answering Systems

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems ◽

10.1145/3411764.3445444 ◽

2021 ◽

Author(s):

Po-Ming Law ◽

Leo Yu-Ho Lo ◽

Alex Endert ◽

John Stasko ◽

Huamin Qu

Keyword(s):

Question Answering ◽

Causal Perception ◽

Question Answering Systems

Download Full-text

A framework for enriching Data Warehouse analysis with Question Answering systems

Journal of Intelligent Information Systems ◽

10.1007/s10844-014-0351-2 ◽

2014 ◽

Vol 46 (1) ◽

pp. 61-82 ◽

Cited By ~ 2

Author(s):

Antonio Ferrández ◽

Alejandro Maté ◽

Jesús Peral ◽

Juan Trujillo ◽

Elisa De Gregorio ◽

...

Keyword(s):

Data Warehouse ◽

Question Answering ◽

Question Answering Systems

Download Full-text

A semantic-based technique for question lassification in question answering systems — A hybrid approach

2015 18th International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccitechn.2015.7488039 ◽

2015 ◽

Author(s):

Md Moinul Hoque ◽

Paulo Quaresma

Keyword(s):

Question Answering ◽

Hybrid Approach ◽

Question Answering Systems

Download Full-text

Composing Questions through Conceptual Authoring

Computational Linguistics ◽

10.1162/coli.2007.33.1.105 ◽

2007 ◽

Vol 33 (1) ◽

pp. 105-133 ◽

Cited By ~ 26

Author(s):

Catalina Hallett ◽

Donia Scott ◽

Richard Power

Keyword(s):

Natural Language ◽

Question Answering ◽

Free Text ◽

Risk Averse ◽

Proof Of Concept ◽

Concept System ◽

Complex Queries ◽

Extensive Training ◽

Question Answering Systems ◽

Medical Histories

This article describes a method for composing fluent and complex natural language questions, while avoiding the standard pitfalls of free text queries. The method, based on Conceptual Authoring, is targeted at question-answering systems where reliability and transparency are critical, and where users cannot be expected to undergo extensive training in question composition. This scenario is found in most corporate domains, especially in applications that are risk-averse. We present a proof-of-concept system we have developed: a question-answering interface to a large repository of medical histories in the area of cancer. We show that the method allows users to successfully and reliably compose complex queries with minimal training.

Download Full-text

A study about the future evaluation of Question-Answering systems

Knowledge-Based Systems ◽

10.1016/j.knosys.2017.09.015 ◽

2017 ◽

Vol 137 ◽

pp. 83-93 ◽

Cited By ~ 7

Author(s):

Alvaro Rodrigo ◽

Anselmo Peñas

Keyword(s):

Question Answering ◽

The Future ◽

Question Answering Systems ◽

Future Evaluation

Download Full-text

A Multilingual Semantic Similarity-Based Approach for Question-Answering Systems

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29551-6_54 ◽

2019 ◽

pp. 604-614

Author(s):

Wafa Wali ◽

Fatma Ghorbel ◽

Bilel Gragouri ◽

Fayçal Hamdi ◽

Elisabeth Metais

Keyword(s):

Semantic Similarity ◽

Question Answering ◽

Question Answering Systems

Download Full-text

Translucent Answer Predictions in Multi-Hop Reading Comprehension

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6272 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7700-7707

Author(s):

G P Shrivatsa Bhargav ◽

Michael Glass ◽

Dinesh Garg ◽

Shirish Shevade ◽

Saswati Dana ◽

...

Keyword(s):

Reading Comprehension ◽

Question Answering ◽

State Of The Art ◽

Local Context ◽

The Novel ◽

Loose Coupling ◽

Loosely Coupled ◽

Neural Architecture ◽

Coupled Networks ◽

Novel Design

Research on the task of Reading Comprehension style Question Answering (RCQA) has gained momentum in recent years due to the emergence of human annotated datasets and associated leaderboards, for example CoQA, HotpotQA, SQuAD, TriviaQA, etc. While state-of-the-art has advanced considerably, there is still ample opportunity to advance it further on some important variants of the RCQA task. In this paper, we propose a novel deep neural architecture, called TAP (Translucent Answer Prediction), to identify answers and evidence (in the form of supporting facts) in an RCQA task requiring multi-hop reasoning. TAP comprises two loosely coupled networks – Local and Global Interaction eXtractor (LoGIX) and Answer Predictor (AP). LoGIX predicts supporting facts, whereas AP consumes these predicted supporting facts to predict the answer span. The novel design of LoGIX is inspired by two key design desiderata – local context and global interaction– that we identified by analyzing examples of multi-hop RCQA task. The loose coupling between LoGIX and the AP reveals the set of sentences used by the AP in predicting an answer. Therefore, answer predictions of TAP can be interpreted in a translucent manner. TAP offers state-of-the-art performance on the HotpotQA (Yang et al. 2018) dataset – an apt dataset for multi-hop RCQA task – as it occupies Rank-1 on its leaderboard (https://hotpotqa.github.io/) at the time of submission.

Download Full-text

Evaluating Commonsense in Pre-Trained Language Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6523 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9733-9740 ◽

Cited By ~ 1

Author(s):

Xuhui Zhou ◽

Yue Zhang ◽

Leyang Cui ◽

Dandan Huang

Keyword(s):

Reading Comprehension ◽

Question Answering ◽

Deep Level ◽

Language Models ◽

Future Research ◽

Correct Prediction ◽

Test Cases ◽

Word Sense ◽

Training Set ◽

Text Data

Contextualized representations trained over large raw text data have given remarkable improvements for NLP tasks including question answering and reading comprehension. There have been works showing that syntactic, semantic and word sense knowledge are contained in such representations, which explains why they benefit such tasks. However, relatively little work has been done investigating commonsense knowledge contained in contextualized representations, which is crucial for human question answering and reading comprehension. We study the commonsense ability of GPT, BERT, XLNet, and RoBERTa by testing them on seven challenging benchmarks, finding that language modeling and its variants are effective objectives for promoting models' commonsense ability while bi-directional context and larger training set are bonuses. We additionally find that current models do poorly on tasks require more necessary inference steps. Finally, we test the robustness of models by making dual test cases, which are correlated so that the correct prediction of one sample should lead to correct prediction of the other. Interestingly, the models show confusion on these test cases, which suggests that they learn commonsense at the surface rather than the deep level. We release a test set, named CATs publicly, for future research.

Download Full-text