Map of science with topic modeling: Comparison of unsupervised learning and human‐assigned subject classification

Arho Suominen; Hannes Toivanen

doi:10.1002/asi.23596

Mining FDA drug labels using an unsupervised learning technique - topic modeling

BMC Bioinformatics ◽

10.1186/1471-2105-12-s10-s11 ◽

2011 ◽

Vol 12 (Suppl 10) ◽

pp. S11 ◽

Cited By ~ 48

Author(s):

Halil Bisgin ◽

Zhichao Liu ◽

Hong Fang ◽

Xiaowei Xu ◽

Weida Tong

Keyword(s):

Unsupervised Learning ◽

Topic Modeling ◽

Learning Technique

Download Full-text

Humanistic interpretation and machine learning

Synthese ◽

10.1007/s11229-020-02806-w ◽

2020 ◽

Author(s):

Juho Pääkkönen ◽

Petri Ylikoski

Keyword(s):

Social Sciences ◽

Machine Learning ◽

Unsupervised Learning ◽

Text Analysis ◽

Topic Modeling ◽

Scientific Evidence ◽

Original Text ◽

The Social ◽

Unsupervised Approach ◽

Social Scientific

Abstract This paper investigates how unsupervised machine learning methods might make hermeneutic interpretive text analysis more objective in the social sciences. Through a close examination of the uses of topic modeling—a popular unsupervised approach in the social sciences—it argues that the primary way in which unsupervised learning supports interpretation is by allowing interpreters to discover unanticipated information in larger and more diverse corpora and by improving the transparency of the interpretive process. This view highlights that unsupervised modeling does not eliminate the researchers’ judgments from the process of producing evidence for social scientific theories. The paper shows this by distinguishing between two prevalent attitudes toward topic modeling, i.e., topic realism and topic instrumentalism. Under neither can modeling provide social scientific evidence without the researchers’ interpretive engagement with the original text materials. Thus the unsupervised text analysis cannot improve the objectivity of interpretation by alleviating the problem of underdetermination in interpretive debate. The paper argues that the sense in which unsupervised methods can improve objectivity is by providing researchers with the resources to justify to others that their interpretations are correct. This kind of objectivity seeks to reduce suspicions in collective debate that interpretations are the products of arbitrary processes influenced by the researchers’ idiosyncratic decisions or starting points. The paper discusses this view in relation to alternative approaches to formalizing interpretation and identifies several limitations on what unsupervised learning can be expected to achieve in terms of supporting interpretive work.

Download Full-text

Prior knowledge and correlational structure in unsupervised learning.

Canadian Journal of Experimental Psychology/Revue canadienne de psychologie expérimentale ◽

10.1037/cjep20070012 ◽

2007 ◽

Vol 61 (2) ◽

pp. 109-127 ◽

Cited By ~ 3

Author(s):

John P. Clapper

Keyword(s):

Unsupervised Learning ◽

Prior Knowledge

Download Full-text

Innovative Approach to Information Search by Example of a Patent Analysis of an Important Substitution Plan

Экономическая наука современной России ◽

10.33293/1609-1442-2020-1(88)-143-157 ◽

2020 ◽

pp. 143-157

Author(s):

Maria A. Milkova

Keyword(s):

Information Search ◽

Topic Modeling ◽

Cognitive Biases ◽

A Priori ◽

Import Substitution ◽

Innovative Approach ◽

Iterative Search ◽

Comprehensive Picture ◽

Priori Information ◽

Selection Of

Nowadays the process of information accumulation is so rapid that the concept of the usual iterative search requires revision. Being in the world of oversaturated information in order to comprehensively cover and analyze the problem under study, it is necessary to make high demands on the search methods. An innovative approach to search should flexibly take into account the large amount of already accumulated knowledge and a priori requirements for results. The results, in turn, should immediately provide a roadmap of the direction being studied with the possibility of as much detail as possible. The approach to search based on topic modeling, the so-called topic search, allows you to take into account all these requirements and thereby streamline the nature of working with information, increase the efficiency of knowledge production, avoid cognitive biases in the perception of information, which is important both on micro and macro level. In order to demonstrate an example of applying topic search, the article considers the task of analyzing an import substitution program based on patent data. The program includes plans for 22 industries and contains more than 1,500 products and technologies for the proposed import substitution. The use of patent search based on topic modeling allows to search immediately by the blocks of a priori information – terms of industrial plans for import substitution and at the output get a selection of relevant documents for each of the industries. This approach allows not only to provide a comprehensive picture of the effectiveness of the program as a whole, but also to visually obtain more detailed information about which groups of products and technologies have been patented.

Download Full-text

Unsupervised learning of object identities and their parts in a hierarchical visual memory

Frontiers in Computational Neuroscience ◽

10.3389/conf.neuro.10.2009.14.168 ◽

1970 ◽

Author(s):

Jenia Jitsev ◽

Christoph von der Malsburg

Keyword(s):

Unsupervised Learning ◽

Visual Memory

Download Full-text

Topic Modeling Approach to Understand Changes in Customer Perceptions on Hotel Services in Seoul

Journal of Korea Service Management Society ◽

10.15706/jksms.2016.17.3.010 ◽

2016 ◽

Vol 17 (3) ◽

pp. 217-231 ◽

Cited By ~ 1

Author(s):

김건 ◽

윤혜정

Keyword(s):

Topic Modeling ◽

Customer Perceptions ◽

Modeling Approach

Download Full-text

A Topic Modeling Analysis on the Major Social Issues of the Students’ Human Rights Ordinance in Korea

Asian Journal of Education ◽

10.15753/aje.2017.12.18.4.683 ◽

2017 ◽

Vol 18 (4) ◽

pp. 683-711

Author(s):

Hyun-Jeong Park ◽

Hanna Kim ◽

YuJung Hong

Keyword(s):

Human Rights ◽

Topic Modeling ◽

Social Issues ◽

Modeling Analysis

Download Full-text

Research Trends of Consumer Education Using Topic Modeling

Consumer Policy and Education Review ◽

10.15790/cope.2020.16.2.083 ◽

2020 ◽

Vol 16 (2) ◽

pp. 83-115

Author(s):

Mira Kim ◽

◽

Hye Sun Hwang ◽

Xu Li

Keyword(s):

Topic Modeling ◽

Research Trends ◽

Consumer Education

Download Full-text

The Effect of Reference List in the Article on Topic Modeling using LDA

Korean Journal of Sport Studies ◽

10.23949/kjpe.2019.07.58.6.16 ◽

2019 ◽

Vol 58 (6) ◽

pp. 197-207

Author(s):

Juhae Baeck ◽

Hyungil Kwon ◽

Mihwa Choi ◽

Yi-Hsiu Lin

Keyword(s):

Topic Modeling ◽

Reference List

Download Full-text

Classification of Observations through Combination of the Dimension Reduction and the Cluster Analysis

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i8.13 ◽

2017 ◽

Vol 7 (8) ◽

pp. 30

Author(s):

Hyeuk Kim

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Cluster Analysis ◽

Unsupervised Learning ◽

Principal Component ◽

Component Analysis ◽

Baseball Players ◽

Partitioning Around Medoids ◽

Different Characteristics

Unsupervised learning in machine learning divides data into several groups. The observations in the same group have similar characteristics and the observations in the different groups have the different characteristics. In the paper, we classify data by partitioning around medoids which have some advantages over the k-means clustering. We apply it to baseball players in Korea Baseball League. We also apply the principal component analysis to data and draw the graph using two components for axis. We interpret the meaning of the clustering graphically through the procedure. The combination of the partitioning around medoids and the principal component analysis can be used to any other data and the approach makes us to figure out the characteristics easily.

Download Full-text