A Resolving of Word Sense Ambiguity Using Two-level Document Ranking Method in Information Retrieval

Nowadays, wordnets are extensively used as a major resource in natural language processing and information retrieval tasks. Therefore, the accuracy of wordnets has a direct influence on the performance of the involved applications. This paper presents a fully-automated method for extending a previously developed Persian wordnet to cover more comprehensive and accurate verbal entries. At first, by using a bilingual dictionary, some Persian verbs are linked to Princeton WordNet synsets. A feature set related to the semantic behavior of compound verbs as the majority of Persian verbs is proposed. This feature set is employed in a supervised classification system to select the proper links for inclusion in the wordnet. We also benefit from a pre-existing Persian wordnet, FarsNet, and a similarity-based method to produce a training set. This is the largest automatically developed Persian wordnet with more than 27,000 words, 28,000 PWN synsets and 67,000 word-sense pairs that substantially outperforms the previous Persian wordnet with about 16,000 words, 22,000 PWN synsets and 38,000 word-sense pairs.

Download Full-text

Using syntactic dependency as local context to resolve word sense ambiguity

10.3115/979617.979626 ◽

1997 ◽

Cited By ~ 5

Author(s):

Dekang Lin

Keyword(s):

Local Context ◽

Word Sense ◽

Word Sense Ambiguity ◽

Syntactic Dependency

Download Full-text

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00034 ◽

2018 ◽

Vol 6 ◽

pp. 483-495 ◽

Cited By ~ 13

Author(s):

Sanjeev Arora ◽

Yuanzhi Li ◽

Yingyu Liang ◽

Tengyu Ma ◽

Andrej Risteski

Keyword(s):

Information Retrieval ◽

Random Walk ◽

Algebraic Structure ◽

Sparse Coding ◽

Word Sense ◽

Word Embeddings ◽

Linear Superposition ◽

Empirical Tests ◽

Embedding Methods ◽

Word Senses

Word embeddings are ubiquitous in NLP and information retrieval, but it is unclear what they represent when the word is polysemous. Here it is shown that multiple word senses reside in linear superposition within the word embedding and simple sparse coding can recover vectors that approximately capture the senses. The success of our approach, which applies to several embedding methods, is mathematically explained using a variant of the random walk on discourses model (Arora et al., 2016). A novel aspect of our technique is that each extracted word sense is accompanied by one of about 2000 “discourse atoms” that gives a succinct description of which other words co-occur with that word sense. Discourse atoms can be of independent interest, and make the method potentially more useful. Empirical tests are used to verify and support the theory.

Download Full-text

Refining Aggregation Functions for Improving Document Ranking in Information Retrieval

Lecture Notes in Computer Science - Scalable Uncertainty Management ◽

10.1007/978-3-540-75410-7_19 ◽

2007 ◽

pp. 255-267 ◽

Cited By ~ 4

Author(s):

Mohand Boughanem ◽

Yannick Loiseau ◽

Henri Prade

Keyword(s):

Information Retrieval ◽

Document Ranking ◽

Aggregation Functions

Download Full-text

Evaluating Word Sense Disambiguation Tools for Information Retrieval Task

Lecture Notes in Computer Science - Evaluating Systems for Multilingual and Multimodal Information Access ◽

10.1007/978-3-642-04447-2_13 ◽

2009 ◽

pp. 113-117 ◽

Cited By ~ 3

Author(s):

Fernando Martínez-Santiago ◽

José M. Perea-Ortega ◽

Miguel A. García-Cumbreras

Keyword(s):

Information Retrieval ◽

Word Sense Disambiguation ◽

Word Sense ◽

Retrieval Task ◽

Sense Disambiguation

Download Full-text

A Comparative Analysis of Supervised Word Sense Disambiguation in Information Retrieval

Communication and Intelligent Systems - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-1089-9_10 ◽

2021 ◽

pp. 111-120

Author(s):

Chandrakala Arya ◽

Manoj Diwakar ◽

Shobha Arya

Keyword(s):

Information Retrieval ◽

Comparative Analysis ◽

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation

Download Full-text

Word Sense Language Model for Information Retrieval

Information Retrieval Technology - Lecture Notes in Computer Science ◽

10.1007/11880592_13 ◽

2006 ◽

pp. 158-171

Author(s):

Liqi Gao ◽

Yu Zhang ◽

Ting Liu ◽

Guiping Liu

Keyword(s):

Information Retrieval ◽

Language Model ◽

Word Sense

Download Full-text

TDSS: A New Word Sense Representation Framework for Information Retrieval

Natural Language Understanding and Intelligent Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-50496-4_6 ◽

2016 ◽

pp. 63-75

Author(s):

Liwei Chen ◽

Yansong Feng ◽

Dongyan Zhao

Keyword(s):

Information Retrieval ◽

Word Sense ◽

Sense Representation

Download Full-text

Exploring the Importance of Entities in Semantic Ranking

Information ◽

10.3390/info10020039 ◽

2019 ◽

Vol 10 (2) ◽

pp. 39

Author(s):

Zhenyang Li ◽

Guangluan Xu ◽

Xiao Liang ◽

Feng Li ◽

Lei Wang ◽

...

Keyword(s):

Information Retrieval ◽

Experimental Results ◽

Retrieval Models ◽

Document Ranking ◽

Ranking Models ◽

Ranking Model ◽

Dataset Analysis

In recent years, entity-based ranking models have led to exciting breakthroughs in the research of information retrieval. Compared with traditional retrieval models, entity-based representation enables a better understanding of queries and documents. However, the existing entity-based models neglect the importance of entities in a document. This paper attempts to explore the effects of the importance of entities in a document. Specifically, the dataset analysis is conducted which verifies the correlation between the importance of entities in a document and document ranking. Then, this paper enhances two entity-based models—toy model and Explicit Semantic Ranking model (ESR)—by considering the importance of entities. In contrast to the existing models, the enhanced models assign the weights of entities according to their importance. Experimental results show that the enhanced toy model and ESR can outperform the two baselines by as much as 4.57% and 2.74% on NDCG@20 respectively, and further experiments reveal that the strength of the enhanced models is more evident on long queries and the queries where ESR fails, confirming the effectiveness of taking the importance of entities into account.

Download Full-text