topic word Latest Research Papers

Topic word extraction is the task of identifying single or multi-word expressions that represent the main topics of a document. In this paper, two improved algorithms for extracting and discovering topic words are proposed in the Rapid Topic word Detection (RTD) Algorithm and CategoryTextRank (CTextRank) Algorithm, which can effectively obtain information by extracting and filtering the topic words in the text. The algorithms overcome the shortcomings of traditional topic words discovering algorithms that require deep linguistic knowledge, domain or language specific annotated corpora. The two algorithms we proposed can process both short and long text. The biggest advantage of the algorithms is that they are unsupervised machine learning algorithms. They need not be trained to process text directly to get topic words. The Accuracy rate, recall rate and F-measure index have been greatly improved when using the two algorithms which show that the results obtained compare favorably with previously published results on datasets Inspec and SemEval. The first algorithm Rapid Topicword Detection improves the metrics compared to PositionRank and TextRank, the second algorithm CategoryTextRank improves the metrics compared to TextRank, SingleRank and TF-IDF.

Download Full-text

Topic Word Embedding-Based Methods for Automatically Extracting Main Aspects from Product Reviews

Applied Sciences ◽

10.3390/app10113831 ◽

2020 ◽

Vol 10 (11) ◽

pp. 3831 ◽

Cited By ~ 1

Author(s):

Sang-Min Park ◽

Sung Joon Lee ◽

Byung-Won On

Keyword(s):

Prior Knowledge ◽

Topic Model ◽

Unsupervised Clustering ◽

Word Embedding ◽

Experimental Results ◽

Product Reviews ◽

Baseline Method ◽

Text Documents ◽

Novel Approach ◽

Topic Word

Detecting the main aspects of a particular product from a collection of review documents is so challenging in real applications. To address this problem, we focus on utilizing existing topic models that can briefly summarize large text documents. Unlike existing approaches that are limited because of modifying any topic model or using seed opinion words as prior knowledge, we propose a novel approach of (1) identifying starting points for learning, (2) cleaning dirty topic results through word embedding and unsupervised clustering, and (3) automatically generating right aspects using topic and head word embedding. Experimental results show that the proposed methods create more clean topics, improving about 25% of Rouge–1, compared to the baseline method. In addition, through the proposed three methods, the main aspects suitable for given data are detected automatically.

Download Full-text

Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection

Springer Series in Reliability Engineering - Reliability and Statistical Computing ◽

10.1007/978-3-030-43412-0_12 ◽

2020 ◽

pp. 193-206

Author(s):

Laura Kölbl ◽

Michael Grottke

Keyword(s):

Weak Signals ◽

Word Selection ◽

Topic Word

Download Full-text

Inductive Document Network Embedding with Topic-Word Attention

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-030-45439-5_22 ◽

2020 ◽

pp. 326-340 ◽

Cited By ~ 1

Author(s):

Robin Brochier ◽

Adrien Guille ◽

Julien Velcin

Keyword(s):

Network Embedding ◽

Word Attention ◽

Topic Word

Download Full-text

Organization by the Teacher of Educational Cooperation Between Primary Schoolchildren

Primary Education ◽

10.12737/1998-0728-2019-42-47 ◽

2019 ◽

Vol 7 (6) ◽

pp. 42-47

Author(s):

Evgeniya Kovaleva

Keyword(s):

Word Formation ◽

Educational Process ◽

Russian Language ◽

Modern School ◽

Primary Schoolchildren ◽

The Russian Language ◽

Topic Word

The article reveals the peculiarities of the formation among young schoolchildren of the ability to cooperate in the course of educational interaction in order to successfully educate each student and educate a cohesive class team. The relevance and effectiveness of such approach to the educational process, which is considered as the most appropriate to the tasks set for the modern school, is substantiated. Examples of tasks in the Russian language on the topic “Word-formation” are given, which schoolchildren perform in groups or in pairs

Download Full-text

iLDA: An interactive latent Dirichlet allocation model to improve topic quality

Journal of Information Science ◽

10.1177/0165551518822455 ◽

2019 ◽

Vol 46 (1) ◽

pp. 23-40 ◽

Cited By ~ 3

Author(s):

Yezheng Liu ◽

Fei Du ◽

Jianshan Sun ◽

Yuanchun Jiang

Keyword(s):

Latent Dirichlet Allocation ◽

Academic Research ◽

High Quality ◽

Allocation Model ◽

Subjective Knowledge ◽

Latent Dirichlet Allocation Model ◽

User Interests ◽

Weighted Sum Method ◽

Topic Word ◽

Dirichlet Allocation

User-generated content has been an increasingly important data source for analysing user interests in both industries and academic research. Since the proposal of the basic latent Dirichlet allocation (LDA) model, plenty of LDA variants have been developed to learn knowledge from unstructured user-generated contents. An intractable limitation for LDA and its variants is that low-quality topics whose meanings are confusing may be generated. To handle this problem, this article proposes an interactive strategy to generate high-quality topics with clear meanings by integrating subjective knowledge derived from human experts and objective knowledge learned by LDA. The proposed interactive latent Dirichlet allocation (iLDA) model develops deterministic and stochastic approaches to obtain subjective topic-word distribution from human experts, combines the subjective and objective topic-word distributions by a linear weighted-sum method, and provides the inference process to draw topics and words from a comprehensive topic-word distribution. The proposed model is a significant effort to integrate human knowledge with LDA-based models by interactive strategy. The experiments on two real-world corpora show that the proposed iLDA model can draw high-quality topics with the assistance of subjective knowledge from human experts. It is robust under various conditions and offers fundamental supports for the applications of LDA-based topic modelling.

Download Full-text

Sense-Based Topic Word Embedding Model for Item Recommendation

IEEE Access ◽

10.1109/access.2019.2909578 ◽

2019 ◽

Vol 7 ◽

pp. 44748-44760 ◽

Cited By ~ 2

Author(s):

Ya Xiao ◽

Zhijie Fan ◽

Chengxiang Tan ◽

Qian Xu ◽

Wenye Zhu ◽

...

Keyword(s):

Word Embedding ◽

Topic Word

Download Full-text

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

International Journal of Semantic Computing ◽

10.1142/s1793351x18400184 ◽

2018 ◽

Vol 12 (03) ◽

pp. 399-423 ◽

Cited By ~ 2

Author(s):

Shaheen Syed ◽

Marco Spruit

Keyword(s):

Full Text ◽

Latent Dirichlet Allocation ◽

Research Articles ◽

Text Data ◽

Topic Distribution ◽

Dirichlet Priors ◽

Abstract Data ◽

Scientific Research Articles ◽

Topic Word ◽

Dirichlet Allocation

Latent Dirichlet Allocation (LDA) has gained much attention from researchers and is increasingly being applied to uncover underlying semantic structures from a variety of corpora. However, nearly all researchers use symmetrical Dirichlet priors, often unaware of the underlying practical implications that they bear. This research is the first to explore symmetrical and asymmetrical Dirichlet priors on topic coherence and human topic ranking when uncovering latent semantic structures from scientific research articles. More specifically, we examine the practical effects of several classes of Dirichlet priors on 2000 LDA models created from abstract and full-text research articles. Our results show that symmetrical or asymmetrical priors on the document–topic distribution or the topic–word distribution for full-text data have little effect on topic coherence scores and human topic ranking. In contrast, asymmetrical priors on the document–topic distribution for abstract data show a significant increase in topic coherence scores and improved human topic ranking compared to a symmetrical prior. Symmetrical or asymmetrical priors on the topic–word distribution show no real benefits for both abstract and full-text data.

Download Full-text

Network log analysis based on the topic word mover's distance

2018 Chinese Control And Decision Conference (CCDC) ◽

10.1109/ccdc.2018.8407832 ◽

2018 ◽

Author(s):

Renai Chen ◽

Qing Gao ◽

Weiliang Ji ◽

Fei Long ◽

Qiang Ling

Keyword(s):

Log Analysis ◽

Topic Word

Download Full-text

Ontology topic word extraction based on text corpus

Artificial Intelligence and Industrial Application ◽

10.2495/aiia140621 ◽

2015 ◽

Author(s):

J. L. Wang ◽

Y. B. Hou

Keyword(s):

Text Corpus ◽

Topic Word

Download Full-text

topic word
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Two Improved Topic Word Detection Algorithms

Topic Word Embedding-Based Methods for Automatically Extracting Main Aspects from Product Reviews

Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection

Inductive Document Network Embedding with Topic-Word Attention

Organization by the Teacher of Educational Cooperation Between Primary Schoolchildren

iLDA: An interactive latent Dirichlet allocation model to improve topic quality

Sense-Based Topic Word Embedding Model for Item Recommendation

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

Network log analysis based on the topic word mover's distance

Ontology topic word extraction based on text corpus

Export Citation Format

topic wordRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Two Improved Topic Word Detection Algorithms

Topic Word Embedding-Based Methods for Automatically Extracting Main Aspects from Product Reviews

Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection

Inductive Document Network Embedding with Topic-Word Attention

Organization by the Teacher of Educational Cooperation Between Primary Schoolchildren

iLDA: An interactive latent Dirichlet allocation model to improve topic quality

Sense-Based Topic Word Embedding Model for Item Recommendation

Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation

Network log analysis based on the topic word mover's distance

Ontology topic word extraction based on text corpus

topic word
Recently Published Documents