Bug Localization with Combination of Deep Learning and Information Retrieval

Knowledge Graphs are applied in many fields such as search engines, semantic analysis, and question answering in recent years. However, there are many obstacles for building knowledge graphs as methodologies, data and tools. This paper introduces a novel methodology to build knowledge graph from heterogeneous documents. We use the methodologies of Natural Language Processing and deep learning to build this graph. The knowledge graph can use in Question answering systems and Information retrieval especially in Computing domain

Download Full-text

How Does Execution Information Help with Information-Retrieval Based Bug Localization?

2017 IEEE/ACM 25th International Conference on Program Comprehension (ICPC) ◽

10.1109/icpc.2017.29 ◽

2017 ◽

Cited By ~ 7

Author(s):

Tung Dao ◽

Lingming Zhang ◽

Na Meng

Keyword(s):

Information Retrieval ◽

Bug Localization

Download Full-text

An ensemble information retrieval method for the biomedical domain (Preprint)

10.2196/preprints.28272 ◽

2021 ◽

Author(s):

Zhiqiang Liu ◽

Jingkun Feng ◽

Zhihao Yang ◽

Lei Wang

Keyword(s):

Information Retrieval ◽

Deep Learning ◽

Text Classification ◽

Query Expansion ◽

Ensemble Method ◽

Classification Model ◽

Retrieval Performance ◽

Matching Model ◽

Ranking List ◽

Initial Retrieval

BACKGROUND With the development of biomedicine, the number of biomedical documents has increased rapidly, which brings a great challenge for researchers retrieving the information they need. Information retrieval aims to meet this challenge by searching relevant documents from abundant documents based on the given query. However, sometimes the relevance of search results needs to be evaluated from multiple aspects in some specific retrieval tasks and thereby increases the difficulty of biomedical information retrieval. OBJECTIVE This study aims to find a more systematic method to retrieve relevant scientific literature for a given patient. METHODS In the initial retrieval stage, we supplement query terms through query expansion strategies and apply query boosting to obtain an initial ranking list of relevant documents. In the re-ranking phase, we employ a text classification model and relevance matching model to evaluate documents respectively from different dimensions, then we combine the outputs through logistic regression to re-rank all the documents from the initial ranking list. RESULTS The proposed ensemble method contributes to the improvement of biomedical retrieval performance. Comparing with the existing deep learning-based methods, experimental results show that our method achieves state-of-the-art performance on the data collection provided by TREC 2019 Precision Medicine Track. CONCLUSIONS In this paper, we propose a novel ensemble method based on deep learning. As shown in the experiments, the strategies we used in the initial retrieval phase such as query expansion and query boosting are effective. The application of the text classification model and the relevance matching model can better capture semantic context information and improve retrieval performance.

Download Full-text

Deep Learning Based Question Answering Search Engine

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2172139 ◽

2021 ◽

pp. 25-32

Author(s):

Mrunal Malekar

Keyword(s):

Information Retrieval ◽

Deep Learning ◽

Natural Language ◽

Search Engine ◽

Language Processing ◽

Question Answering ◽

Research Work ◽

Construction Company ◽

Exact Answer ◽

Search For Information

Domain based Question Answering is concerned with building systems which provide answers to natural language questions that are asked specific to a domain. It comes under Information Retrieval and Natural language processing. Using Information Retrieval, one can search for the relevant documents which may contain the answer but it won’t give the exact answer for the question asked. In the presented work, a question answering search engine has been developed which first finds out the relevant documents from a huge textual document data of a construction company and then goes a step beyond to extract answer from the extracted document. The robust question answering system developed uses Elastic Search for Information Retrieval [paragraphs extraction] and Deep Learning for answering the question from the short extracted paragraph. It leverages BERT Deep Learning Model to understand the layers and representations between the question and answer. The research work also focuses on how to improve the search accuracy of the Information Retrieval based Elastic Search engine which returns the relevant documents which may contain the answer.

Download Full-text