Words Stemming Based on Structural and Semantic Similarity

Mohammad Hassan Dianati; Mohammad Hadi Sadreddini; Amir Hossein Rasekh; Seyed Mostafa Fakhrahmad; Hossein Taghi-Zadeh

doi:10.18495/comengapp.v3i2.57

Words Stemming Based on Structural and Semantic Similarity

Computer Engineering and Applications Journal ◽

10.18495/comengapp.v3i2.57 ◽

2014 ◽

Vol 3 (2) ◽

pp. 89-99 ◽

Cited By ~ 2

Author(s):

Mohammad Hassan Dianati ◽

Mohammad Hadi Sadreddini ◽

Amir Hossein Rasekh ◽

Seyed Mostafa Fakhrahmad ◽

Hossein Taghi-Zadeh

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

A Performance

Words Â stemming Â is Â one Â of Â the Â important Â issues Â in Â the field Â of Â natural Â languageÂ processing Â and Â information retrieval. Â There Â are Â different Â methods Â for stemmingÂ which are mostly language-dependent. Therefore, these Â stemmers are onlyÂ applicable Â to Â particular Â languages. Â Because Â of the importance Â of Â this issue, Â in Â thisÂ paper, the proposed method for stemming is aimed to be language-independent. InÂ the Â proposed Â stemmer, Â a Â bilingual Â dictionary Â is Â used and Â all Â of Â the Â words Â in Â theÂ dictionary are firstly clustered. The wordsâ€™ clustering is based on their structural andÂ semantic similarity. Finally, finding the stem of new coming words is performed byÂ making use of the previously formatted clusters. To evaluate the proposed scheme,Â words Â stemming is Â done on both Â Persian Â and Â English Â languages. Â The encouragingÂ results Â indicate Â the Â good Â performance Â of Â the proposed Â method Â compared Â with Â itsÂ counterparts.

Download Full-text

The effectiveness of a performance-based assistant in an information retrieval environment

PsycEXTRA Dataset ◽

10.1037/e574242012-021 ◽

1984 ◽

Author(s):

Jay Elkerton ◽

Robert C. Williges

Keyword(s):

Information Retrieval ◽

A Performance

Download Full-text

An efficiency model and a performance function for an information retrieval system

Information Storage and Retrieval ◽

10.1016/0020-0271(69)90015-1 ◽

1969 ◽

Vol 5 (3) ◽

pp. 109-122 ◽

Cited By ~ 5

Author(s):

Douglas H. Rothenberg

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Performance Function ◽

A Performance

Download Full-text

LIS4: Lesk Inspired Sense Specific Semantic Similarity using WordNet

Journal of Information & Knowledge Management ◽

10.1142/s0219649221500064 ◽

2021 ◽

pp. 2150006

Author(s):

Saravanakumar Kandasamy ◽

Aswani Kumar Cherukuri

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Gold Standard ◽

Question Answering ◽

Knowledge Based ◽

Benchmark Datasets ◽

Processing Information

Semantic similarity quantification between concepts is one of the inevitable parts in domains like Natural Language Processing, Information Retrieval, Question Answering, etc. to understand the text and their relationships better. Last few decades, many measures have been proposed by incorporating various corpus-based and knowledge-based resources. WordNet and Wikipedia are two of the Knowledge-based resources. The contribution of WordNet in the above said domain is enormous due to its richness in defining a word and all of its relationship with others. In this paper, we proposed an approach to quantify the similarity between concepts that exploits the synsets and the gloss definitions of different concepts using WordNet. Our method considers the gloss definitions, contextual words that are helping in defining a word, synsets of contextual word and the confidence of occurrence of a word in other word’s definition for calculating the similarity. The evaluation based on different gold standard benchmark datasets shows the efficiency of our system in comparison with other existing taxonomical and definitional measures.

Download Full-text

Lexical Co-Occurrence and Contextual Window-Based Approach With Semantic Similarity for Query Expansion

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch070 ◽

2018 ◽

pp. 1552-1575

Author(s):

Jagendra Singh ◽

Rakesh Kumar

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Efficient Method ◽

Query Expansion ◽

Ad Hoc ◽

Hybrid Approach ◽

Information Retrieval System ◽

Optimal Combination ◽

Benchmark Datasets ◽

Pseudo Feedback

Query expansion (QE) is an efficient method for enhancing the efficiency of information retrieval system. In this work, we try to capture the limitations of pseudo-feedback based QE approach and propose a hybrid approach for enhancing the efficiency of feedback based QE by combining corpus-based, contextual based information of query terms, and semantic based knowledge of query terms. First of all, this paper explores the use of different corpus-based lexical co-occurrence approaches to select an optimal combination of query terms from a pool of terms obtained using pseudo-feedback based QE. Next, we explore semantic similarity approach based on word2vec for ranking the QE terms obtained from top pseudo-feedback documents. Further, we combine co-occurrence statistics, contextual window statistics, and semantic similarity based approaches together to select the best expansion terms for query reformulation. The experiments were performed on FIRE ad-hoc and TREC-3 benchmark datasets. The statistics of our proposed experimental results show significant improvement over baseline method.

Download Full-text

Information Retrieval by Semantic Similarity

International Journal on Semantic Web and Information Systems ◽

10.4018/jswis.2006070104 ◽

2006 ◽

Vol 2 (3) ◽

pp. 55-73 ◽

Cited By ~ 97

Author(s):

Angelos Hliaoutakis ◽

Giannis Varelas ◽

Epimenidis Voutsakis ◽

Euripides G.M. Petrakis ◽

Evangelos Milios

Keyword(s):

Information Retrieval ◽

Semantic Similarity

Download Full-text

A Hybrid Semantic Similarity Measure for Spatial Information Retrieval

Spatial Cognition and Computation ◽

10.1080/13875860802645087 ◽

2009 ◽

Vol 9 (1) ◽

pp. 30-63 ◽

Cited By ~ 10

Author(s):

Angela Schwering ◽

Werner Kuhn

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Similarity Measure ◽

Spatial Information ◽

Semantic Similarity Measure

Download Full-text

Information retrieval from web databases using semantic similarity

2013 International Conference on Green Computing, Communication and Conservation of Energy (ICGCE) ◽

10.1109/icgce.2013.6823556 ◽

2013 ◽

Author(s):

G. Muthugurunathan ◽

R. Sarasu

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Web Databases

Download Full-text

Study on Application of Domain Ontology in Semantic Information Retrieval

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.433-435.1662 ◽

2013 ◽

Vol 433-435 ◽

pp. 1662-1665

Author(s):

Huan Hai Yang ◽

Ming Yu Sun

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Calculation Method ◽

Semantic Information ◽

Domain Ontology ◽

Semantic Retrieval ◽

Retrieval Model ◽

Retrieval Method ◽

Similarity Calculation ◽

Semantic Information Retrieval

Considering weakness of the traditional retrieval method based on keyword matching, the paper introduced semantic into information retrieval, and proposed a semantic retrieval model based on ontology. The paper offered a construction method of domain ontology and implemented semantic reasoning using Jena and improved a semantic similarity calculation method.

Download Full-text

A Semantic Framework for Evaluating Topical Search Methods

CLEI electronic journal ◽

10.19153/cleiej.14.1.2 ◽

2011 ◽

Vol 14 (1) ◽

Cited By ~ 2

Author(s):

Rocío L. Cecchini ◽

Carlos M. Lorenzetti ◽

Ana G. Maguitman ◽

Filippo Menczer

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Evaluation Framework ◽

Evaluation Metrics ◽

Digital Information ◽

Actual Performance ◽

Semantic Framework ◽

Similarity Data ◽

Retrieval Systems ◽

Information Retrieval Systems

The absence of reliable and efficient techniques to evaluate information retrieval systems has become a bottleneck in the development of novel retrieval methods. In traditional approaches users or hired evaluators provide manual assessments of relevance. However these approaches are neither efficient nor reliable since they do not scale with the complexity and heterogeneity of available digital information. Automatic approaches, on the other hand, could be efficient but disregard semantic data, which is usually important to assess the actual performance of the evaluated methods. This article proposes to use topic ontologies and semantic similarity data derived from these ontologies to implement an automatic semantic evaluation framework for information retrieval systems. The use of semantic simi- larity data allows to capture the notion of partial relevance, generalizing traditional evaluation metrics, and giving rise to novel performance measures such as semantic precision and semantic harmonic mean. The validity of the approach is supported by user studies and the application of the proposed framework is illustrated with the evaluation of topical retrieval systems. The evaluated systems include a baseline, a supervised version of the Bo1 query refinement method and two multi-objective evolutionary algorithms for context-based retrieval. Finally, we discuss the advantages of ap- plying evaluation metrics that account for semantic similarity data and partial relevance over existing metrics based on the notion of total relevance.

Download Full-text

Semantic Similarity Measures for Medical Information Retrieval

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/213922020 ◽

2020 ◽

Vol 9 (2) ◽

pp. 2310-2319

Author(s):

Karim Gasmi

Keyword(s):

Information Retrieval ◽

Semantic Similarity ◽

Medical Information ◽

Similarity Measures ◽

Medical Information Retrieval

Download Full-text