A Hybrid SOM-Based Document Organization System

Document organization is necessary for better utilization of documents. The major problem of organization online documents is so complex because documents should be grouped into its appropriate group during its appearance on the web. Classification is one of the best solutions to organize the documents. Naive Bayes categorization is playing a vital role in document organization. It is one of the simplest probabilistic Bayesian categorization and assumption that the effect of an attribute value on a given category is independent of the values. The document classification is the essential task of organization and necessary for efficient control of textual fact systems. The files may be classified as unconfirmed, supervised and semi supervised methods. In this paper, to review and study of various types of document organization approach using naive Bayesian classification and other related existing document organization methods.

Download Full-text

Organization of American States: Document Organization, Distribution and Control

International Documents for the 80’s ◽

10.1515/9783110842579-025 ◽

1982 ◽

pp. 168-184

Author(s):

A. C. Keefer

Keyword(s):

Organization Of American States ◽

American States ◽

Document Organization ◽

And Control

Download Full-text

Semantic mapping and K-means applied to hybrid SOM-based document organization system construction

Proceedings of the 2008 ACM symposium on Applied computing - SAC '08 ◽

10.1145/1363686.1363945 ◽

2008 ◽

Cited By ~ 1

Author(s):

Renato Fernandes Corrêa ◽

Teresa Bernarda Ludermir

Keyword(s):

Semantic Mapping ◽

System Construction ◽

Document Organization

Download Full-text

ORGANIZAÇÃO DOCUMENTAL PARA DISSEMINAÇÃO DA INFORMAÇÃO: a construção de um tesauro experimental sobre os crustáceos usados na culinária alagoana = DOCUMENTAL ORGANIZATION FOR INFORMATION DISSEMINATION: the construction of an experimental thesaurus on the crustaceans used in Alagoas cuisine

Revista Bibliomar ◽

10.18764/2526-6160v20n1.2021.8 ◽

2021 ◽

Vol 20 (1) ◽

pp. 168

Author(s):

Paulo Daniel Marcos dos Santos ◽

Daiana da Conceição Alves de Magalhães ◽

Nelma Camêlo de Araujo

Keyword(s):

Information Dissemination ◽

Semantic Relationship ◽

Specific Subject ◽

Document Organization

O tesauro é um método de esquema de listagem que funciona como instrumento de organização documental onde as palavras apresentam relação semântica dentro de um assunto/tema específico, e essa relação é estabelecida hierarquicamente por meio de descritores que estabelecem padrão e maior especificidade do tema trabalhado, a partir do tema escolhido “Crustáceos utilizados na culinária alagoana”. Durante a produção do presente artigo, tornou-se perceptível que a literatura acerca do tema frutos do mar no litoral alagoano não se encontra disposta de modo organizado, logo, foi constatado que o tema seria conveniente para elaboração deste tesauro experimental. Com base nas instruções do sistema Tesauro foi possível analisar, recuperar e indexar a informação, tornando o resgate desse tema disponível como um registro documental padronizado contribuindo para outros pesquisadores/estudiosos que demonstrem interesse na temática, seja pelo tema crustáceos ou pela estruturação.ABSTRACTThe thesaurus is a method of listing scheme that works as an instrument of document organization where words have a semantic relationship within a specific subject/theme, and this relationship is established hierarchically through descriptors that establish a pattern and greater specificity of the theme being worked on, from the chosen theme “Crustaceans used in Alagoas cuisine”. During the production of this article, it became noticeable that the literature on the theme of seafood on the coast of Alagoas is not arranged in an organized way, so it was found that the topic would be convenient for the elaboration of this experimental thesaurus. Based on the instructions of the Thesaurus system, it was possible to analyse, retrieve and index the information, making the rescue of this theme available as a standardized documental record, contributing to other researchers/scholars who show interest in the theme, whether by the theme of crustaceans or by structuring.

Download Full-text

Document Organization

Vectorworks for Entertainment Design ◽

10.4324/9780429290671-6 ◽

2020 ◽

pp. 29-54

Author(s):

Kevin Lee Allen

Keyword(s):

Document Organization

Download Full-text

A Similarity Rough Set Model for Document Representation and Document Clustering

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2011.p0125 ◽

2011 ◽

Vol 15 (2) ◽

pp. 125-133 ◽

Cited By ~ 3

Author(s):

Nguyen Chi Thanh ◽

◽

Koichi Yamada ◽

Muneyuki Unehara

Keyword(s):

Vector Space ◽

Rough Set ◽

Document Clustering ◽

Vector Space Model ◽

Document Representation ◽

Space Model ◽

Large Sets ◽

Tolerance Rough Set ◽

Document Organization ◽

The One

Document clustering is a textmining technique for unsupervised document organization. It helps the users browse and navigate large sets of documents. Ho et al. proposed a Tolerance Rough Set Model (TRSM) [1] for improving the vector space model that represents documents by vectors of terms and applied it to document clustering. In this paper we analyze their model to propose a new model for efficient clustering of documents. We introduce Similarity Rough Set Model (SRSM) as another model for presenting documents in document clustering. The model is evaluated by experiments on test collections. The experiment results show that the SRSM document clusteringmethod outperforms the one with TRSM and the results of SRSM are less affected by the value of parameter than TRSM.

Download Full-text