word relatedness Latest Research Papers

AbstractWe present a new Japanese dataset, Japanese Word Similarity and Association Norm (JWSAN), comprising human rating scores of similarity and association for 2145 word pairs, with a clear distinction between word similarity and word association. Computational models of human semantic memory or mental lexicon, such as distributed semantic models, must predict not only association but also similarity. People can distinguish between word similarity and association. However, although the SimLex-999 dataset is publicly available for English, there is no Japanese similarity dataset with a clear distinction between the two types of word relatedness. JWSAN is the first large Japanese dataset with similarity and association ratings, containing noun, verb, and adjective word pairs. It is also characterized by data collection from a sufficient number of age- and-gender-controlled assessors, with similarity and association ratings obtained via a web-based survey conducted of 6450 native speakers of Japanese. In addition, the effects of the gender and age of the raters were also examined; these factors were only given scant consideration in the past. This dataset can act as a benchmark for improving distributed semantic models in Japanese.

Download Full-text

Open-Ended Questions

Employee Surveys and Sensing ◽

10.1093/oso/9780190939717.003.0013 ◽

2020 ◽

pp. 202-218

Author(s):

Subhadra Dutta ◽

Eric M. O’Rourke

Keyword(s):

Machine Learning ◽

Language Processing ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Written Language ◽

Text Data ◽

Employee Survey ◽

Trade Offs ◽

Word Relatedness ◽

Survey Responses

Natural language processing (NLP) is the field of decoding human written language. This chapter responds to the growing interest in using machine learning–based NLP approaches for analyzing open-ended employee survey responses. These techniques address scalability and the ability to provide real-time insights to make qualitative data collection equally or more desirable in organizations. The chapter walks through the evolution of text analytics in industrial–organizational psychology and discusses relevant supervised and unsupervised machine learning NLP methods for survey text data, such as latent Dirichlet allocation, latent semantic analysis, sentiment analysis, word relatedness methods, and so on. The chapter also lays out preprocessing techniques and the trade-offs of growing NLP capabilities internally versus externally, points the readers to available resources, and ends with discussing implications and future directions of these approaches.

Download Full-text

LEARNING WORD RELATEDNESS OVER TIME FOR TEMPORAL RANKING

Jurnal Ilmu Komputer dan Informasi ◽

10.21609/jiki.v12i2.745 ◽

2019 ◽

Vol 12 (2) ◽

pp. 91

Author(s):

Dinda Sigmawaty ◽

Mirna Adriani

Keyword(s):

Temporal Patterns ◽

Word Embeddings ◽

Average Precision ◽

Temporal Aspects ◽

Word Relatedness ◽

Temporal Model ◽

Different Characteristics ◽

Significant Attention ◽

Bahasa Indonesia ◽

Over Time

Queries and ranking with temporal aspects gain significant attention in field of Information Retrieval. While searching for articles published over time, the relevant documents usually occur in certain temporal patterns. Given a query that is implicitly time sensitive, we develop a temporal ranking using the important times of query by drawing from the distribution of query trend relatedness over time. We also combine the model with Dual Embedding Space Model (DESM) in the temporal model according to document timestamp. We apply our model using three temporal word embeddings algorithms to learn relatedness of words from news archive in Bahasa Indonesia: (1) QT-W2V-Rank using Word2Vec (2) QT-OW2V-Rank using OrthoTrans-Word2Vec (3) QT-DBE-Rank using Dynamic Bernoulli Embeddings. The highest score was achieved with static word embeddings learned separately over time, called QT-W2V-Rank, which is 66% in average precision and 68% in early precision. Furthermore, studies of different characteristics of temporal topics showed that QT-W2V-Rank is also more effective in capturing temporal patterns such as spikes, periodicity, and seasonality than the baselines.

Download Full-text

A Lexical Resource-Constrained Topic Model for Word Relatedness

IEEE Access ◽

10.1109/access.2019.2909104 ◽

2019 ◽

Vol 7 ◽

pp. 55261-55268

Author(s):

Yongjing Yin ◽

Jiali Zeng ◽

Hongji Wang ◽

Keqing Wu ◽

Bin Luo ◽

...

Keyword(s):

Topic Model ◽

Resource Constrained ◽

Lexical Resource ◽

Word Relatedness

Download Full-text

Improving text relatedness by incorporating phrase relatedness with word relatedness

Computational Intelligence ◽

10.1111/coin.12152 ◽

2018 ◽

Vol 34 (3) ◽

pp. 939-966

Author(s):

Rashadul Hasan Rakib ◽

Aminul Islam ◽

Evangelos Milios

Keyword(s):

Word Relatedness

Download Full-text

Learning Word Relatedness over Time

10.18653/v1/d17-1121 ◽

2017 ◽

Cited By ~ 4

Author(s):

Guy D. Rosin ◽

Eytan Adar ◽

Kira Radinsky

Keyword(s):

Word Relatedness ◽

Over Time

Download Full-text

LWCR: multi-Layered Wikipedia representation for Computing word Relatedness

Neurocomputing ◽

10.1016/j.neucom.2016.08.045 ◽

2016 ◽

Vol 216 ◽

pp. 816-843 ◽

Cited By ~ 12

Author(s):

Mohamed Ben Aouicha ◽

Mohamed Ali Hadj Taieb ◽

Abdelmajid Ben Hamadou

Keyword(s):

Word Relatedness

Download Full-text

Efficient Computation of Co-occurrence Based Word Relatedness

Proceedings of the 2015 ACM Symposium on Document Engineering - DocEng '15 ◽

10.1145/2682571.2797088 ◽

2015 ◽

Cited By ~ 1

Author(s):

Jie Mei ◽

Xinxin Kou ◽

Zhimin Yao ◽

Andrew Rau-Chaplin ◽

Aminul Islam ◽

...

Keyword(s):

Efficient Computation ◽

Word Relatedness

Download Full-text

A Random Walk–Based Model for Identifying Semantic Orientation

Computational Linguistics ◽

10.1162/coli_a_00192 ◽

2014 ◽

Vol 40 (3) ◽

pp. 539-562 ◽

Cited By ~ 1

Author(s):

Ahmed Hassan ◽

Amjad Abu-Jbara ◽

Wanchen Lu ◽

Dragomir Radev

Keyword(s):

Random Walk ◽

Language Processing ◽

Response Analysis ◽

Markov Random Walk ◽

Text Filtering ◽

Semantic Orientation ◽

On Line ◽

Word Relatedness ◽

Standard Set ◽

Weakly Supervised

Automatically identifying the sentiment polarity of words is a very important task that has been used as the essential building block of many natural language processing systems such as text classification, text filtering, product review analysis, survey response analysis, and on-line discussion mining. We propose a method for identifying the sentiment polarity of words that applies a Markov random walk model to a large word relatedness graph, and produces a polarity estimate for any given word. The model can accurately and quickly assign a polarity sign and magnitude to any word. It can be used both in a semi-supervised setting where a training set of labeled words is used, and in a weakly supervised setting where only a handful of seed words is used to define the two polarity classes. The method is experimentally tested using a gold standard set of positive and negative words from the General Inquirer lexicon. We also show how our method can be used for three-way classification which identifies neutral words in addition to positive and negative words. Our experiments show that the proposed method outperforms the state-of-the-art methods in the semi-supervised setting and is comparable to the best reported values in the weakly supervised setting. In addition, the proposed method is faster and does not need a large corpus. We also present extensions of our methods for identifying the polarity of foreign words and out-of-vocabulary words.

Download Full-text

Exploring ESA to Improve Word Relatedness

10.3115/v1/s14-1006 ◽

2014 ◽

Cited By ~ 1

Author(s):

Nitish Aggarwal ◽

Kartik Asooja ◽

Paul Buitelaar

Keyword(s):

Word Relatedness

Download Full-text

word relatedness
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

JWSAN: Japanese word similarity and association norm

Open-Ended Questions

LEARNING WORD RELATEDNESS OVER TIME FOR TEMPORAL RANKING

A Lexical Resource-Constrained Topic Model for Word Relatedness

Improving text relatedness by incorporating phrase relatedness with word relatedness

Learning Word Relatedness over Time

LWCR: multi-Layered Wikipedia representation for Computing word Relatedness

Efficient Computation of Co-occurrence Based Word Relatedness

A Random Walk–Based Model for Identifying Semantic Orientation

Exploring ESA to Improve Word Relatedness

Export Citation Format

word relatednessRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

JWSAN: Japanese word similarity and association norm

Open-Ended Questions

LEARNING WORD RELATEDNESS OVER TIME FOR TEMPORAL RANKING

A Lexical Resource-Constrained Topic Model for Word Relatedness

Improving text relatedness by incorporating phrase relatedness with word relatedness

Learning Word Relatedness over Time

LWCR: multi-Layered Wikipedia representation for Computing word Relatedness

Efficient Computation of Co-occurrence Based Word Relatedness

A Random Walk–Based Model for Identifying Semantic Orientation

Exploring ESA to Improve Word Relatedness

word relatedness
Recently Published Documents