A Method for Chinese Text Classification Based on Three-Dimensional Vector Space Model

A Kind of Self-Constructed Category Dictionary in Chinese Text Classification

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.644-650.2206 ◽

2014 ◽

Vol 644-650 ◽

pp. 2206-2210

Author(s):

Kun Zhou ◽

Ya Ping Dai ◽

Feng Gao ◽

Ji Hong Zou

Keyword(s):

Vector Space ◽

Chinese Text ◽

Text Classification ◽

Feature Vector ◽

Vector Space Model ◽

Recall Rate ◽

Support Vector ◽

Space Model ◽

Chinese Text Classification ◽

Feature Vector Space

By means of word-segmentation technology in TRIP database and each word that appears in a database will be account in detail, a kind of self-constructed category dictionary (SCC-dictionary) in Chinese text classification is proposed. For solving high dimension and sparseness problem exit in vector space model, a four-dimensional feature vector space model (FFVSM) is presented in this paper. With Support Vector Machine (SVM) algorithm, the text classifier is designed. Experimental results show there are two achievements in this paper: first, SCC-dictionary can replace the artificial-written dictionary with the same effect; second, the FFVSM will not only reduce the computing load than high-dimensional feature vector space model, but also keep the precision of classification as 86.87%, recall rate as 95.12%, and F1 value as 90.81%.

Download Full-text

Handwriting Detection Model Based on Four-Dimensional Vector Space Model

Journal of Mathematics Research ◽

10.5539/jmr.v10n4p32 ◽

2018 ◽

Vol 10 (4) ◽

pp. 32

Author(s):

Lin Li ◽

Xiuteng Duan ◽

Yutong Li

Keyword(s):

Vector Space ◽

Word Length ◽

Vector Space Model ◽

Criminal Investigation ◽

Sentence Structure ◽

Reference Vector ◽

Dimensional Vector ◽

Dimensional Vector Space ◽

Space Model ◽

Detection Model

Handwriting detection is mainly used in the criminal investigation. We can use four-dimensional vector space model to build a model for handwriting detection. This article selects feature quantities such as word frequency, language style, average word length, and sentence structure from the texts and quantizes them, transforming them into relations between vectors. After quantifying and normalizing the features in an author's article in advance, we can obtain a standard reference vector. Then we do the same processing on the target text database, and compare it with the standard reference vector in terms of the modulus value and the included angle. Then we could estimate whether the author is the owner of database value. The simulation result shows that the model is more accurate and the author of particular texts can be obtained.

Download Full-text

Identities Generalizing the Theorems of Pappus and Desargues

Symmetry ◽

10.3390/sym13081382 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1382

Author(s):

Roger D. Maddux

Keyword(s):

Projective Plane ◽

Vector Space ◽

Invariant Theory ◽

Lattice Theory ◽

Three Dimensional ◽

Dimensional Vector ◽

Dimensional Vector Space

The Theorems of Pappus and Desargues (for the projective plane over a field) are generalized here by two identities involving determinants and cross products. These identities are proved to hold in the three-dimensional vector space over a field. They are closely related to the Arguesian identity in lattice theory and to Cayley-Grassmann identities in invariant theory.

Download Full-text

Text Classification Based on Enriched Vector Space Model

Proceedings of the 18th International Conference on Computer Systems and Technologies - CompSysTech'17 ◽

10.1145/3134302.3134343 ◽

2017 ◽

Author(s):

Tsvetanka Georgieva-Trifonova

Keyword(s):

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Space Model

Download Full-text

Improving Term Weighting Schemes for Short Text Classification in Vector Space Model

IEEE Access ◽

10.1109/access.2019.2953918 ◽

2019 ◽

Vol 7 ◽

pp. 166578-166592

Author(s):

Surender Singh Samant ◽

N. L. Bhanu Murthy ◽

Aruna Malapati

Keyword(s):

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Term Weighting ◽

Weighting Schemes ◽

Short Text ◽

Space Model

Download Full-text

Vector Space Model of Text Classification Based on Inertia Contribution of Document

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Emerging Technologies for Developing Countries ◽

10.1007/978-3-030-05198-3_14 ◽

2018 ◽

pp. 155-165

Author(s):

Demba Kandé ◽

Fodé Camara ◽

Reine Marie Marone ◽

Samba Ndiaye

Keyword(s):

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Space Model

Download Full-text

A Kind of Text Classification Method Based on Fuzzy Vector Space Model and Neural Networks

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.347-350.2856 ◽

2013 ◽

Vol 347-350 ◽

pp. 2856-2859

Author(s):

Jun Hui Pan ◽

Hui Li

Keyword(s):

Neural Networks ◽

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Classification Method ◽

Space Model ◽

Output Layer ◽

Input Layer ◽

Hidden Layer ◽

Fuzzy Vector

A kind of text classification method based on fuzzy vector space model and neural networks is proposed in the paper according to the problems that a text can be belongs to many types during the text classification. Fuzzy theory is adopted in the method to look the occurring position of feature items in text on as the important degree (membership) reflecting text subject, and fully considered the position information while the features are extracted, thus the fuzzy feature vectors are constructed, as a result, the text classification is close to the manual classification method. The established networks are constituted of input layer, hidden layer and output layer, the input layer completes the inputs of classification samples, hidden layer extracts the implicit pattern features of input samples, the output layer is used to output the classification results. Finally the effectiveness of this method is proved by some documents of Wan Fang data in experimental section. (Abstract)

Download Full-text

Design and analysis of a general vector space model for data classification in Internet of Things

EURASIP Journal on Wireless Communications and Networking ◽

10.1186/s13638-019-1581-3 ◽

2019 ◽

Vol 2019 (1) ◽

Cited By ~ 3

Author(s):

Jinguo Sang ◽

Shanchen Pang ◽

Yang Zha ◽

Fan Yang

Keyword(s):

Internet Of Things ◽

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Classification Algorithm ◽

Space Model ◽

Amount Of Information ◽

Access Information ◽

Weighting Methods ◽

General Vector

AbstractThe amount of information increases explosively in Internet of Things, because more and more data are sensed by large amount of sensors. The explosive growth of information makes it difficult to access information efficiently, so it is an effective method to decrease the amount of information to be transferred on network by text classification. This paper proposes a new text classification algorithm based on vector space model. This algorithm improves the feature selection and weighting methods by introducing synonym replacement to traditional text classification algorithms. The experimental results show that the proposed classification algorithm has considerably improved the precision and recall of classification.

Download Full-text

Analysis of Text Classification with various Term Weighting Schemes in Vector Space Model

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d1938.0891020 ◽

2020 ◽

Vol 9 (10) ◽

pp. 390-393

Keyword(s):

Vector Space ◽

Text Classification ◽

Naive Bayes ◽

Information Gain ◽

Vector Space Model ◽

Naïve Bayes ◽

Weighting Scheme ◽

Term Weighting ◽

Space Model ◽

Weighting Methods

Term Weighting Scheme (TWS) is a key component of the matching mechanism when using the vector space model In the context of information retrieval (IR) from text documents, the this paper described a new approach of term weighting methods to improve the classification performance. In this study, we propose an effective term weighting scheme, which gives highest accuracy with compare to the text classification methods. We compared performance parameter of KNN and Naïve Bayes Classification with different Weighting Method, Weight information gain, SVM and proposed method.We have implemented many term-weighting methods (TWM) on Amazon data collections in combination with Information-Gain and SVM and KNN algorithm and Naïve Bayes Algorithm.

Download Full-text

A Chinese text classification model based on vector space and semantic meaning

Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826) ◽

10.1109/icmlc.2004.1382361 ◽

2005 ◽

Author(s):

Bao-Yi Wang ◽

Shao-Min Zhang

Keyword(s):

Vector Space ◽

Chinese Text ◽

Text Classification ◽

Classification Model ◽

Semantic Meaning ◽

Chinese Text Classification ◽

Model Based

Download Full-text