Text retrieval based on the feature conversion of vector space

Parallel text retrieval on a high performance supercomputer using the Vector Space Model

Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '95 ◽

10.1145/215206.215332 ◽

1995 ◽

Cited By ~ 4

Author(s):

P. Efraimidis ◽

C. Glymidakis ◽

B. Mamalis ◽

P. Spirakis ◽

B. Tampakas

Keyword(s):

Vector Space ◽

High Performance ◽

Vector Space Model ◽

Text Retrieval ◽

Space Model ◽

Parallel Text

Download Full-text

INFORMATION RETRIEVAL TUGAS AKHIR DAN PERHITUNGAN KEMIRIPAN DOKUMEN MENGACU PADA ABSTRAK MENGGUNAKAN VECTOR SPACE MODEL

Simetris Jurnal Teknik Mesin Elektro dan Ilmu Komputer ◽

10.24176/simet.v8i1.1016 ◽

2017 ◽

Vol 8 (1) ◽

pp. 355-362

Author(s):

Putri Elfa Mas'udia ◽

Martono Dwi Atmadja ◽

Lis Diana Mustafa

Keyword(s):

Information Retrieval ◽

Vector Space ◽

Real Time ◽

Vector Space Model ◽

Text Retrieval ◽

Space Model ◽

Stop Word ◽

Information Text

Pencarian pada database yang biasa dilakukan mahasiswa hanya mampu mencari judul yang sesuai berdasarkan kata kunci yang diinputkan, misalnya, jika kata kunci yang dimasukkan adalah “sistem cerdas” maka akan ditampilkan semua dokumen yang mengandung kata “sistem cerdas” namun sistem tidak bisa mengukur mana dokumen yang paling mirip. Untuk dapat melakukan pencarian berdasar substansi yang paling mirip, terdapat teknologi yang disebut information Text Retrieval. Dalam penelitian ini akan dikembangkan suatu sistem temu kembali informasi judul tugas akhir dan perhitungan kemiripan dokumen menggunakan vector space model. Sistem secara otomatis akan melakukan indexing secara offline dan temu kembali (retrieval) secara real time. Proses retrieval dimulai dengan mengambil query dari pengguna, menerapkan stop word removal sehingga dihasilkan keyword yang compaq tetapi dapatmewakili query tersebut, kemudian sistem menghitung kemiripan antarakeyword dengan daftar dokumen yang diwakili oleh term-term di dalam index. Dokumen akan ditampilkan diurutkan berdasarkan dokumen yang paling mirip.Dari hasil pengujian terlihat ketika keyword “android” dimasukkan maka akan tampil empat dokumen yang diurutkan sesuai tingkat kemiripannya, yaitu docId 3 dengan tingkat kemiripan 0.9512, docId 4 dengan tingkat kemiripan 0.5020, docId 2 dengan tingkat kemiripan 0.2671, docId 8 dengan tingkat kemiripan 0.1522.

Download Full-text

A generalized vector space model for text retrieval based on semantic relatedness

10.3115/1609179.1609188 ◽

2009 ◽

Cited By ~ 28

Author(s):

George Tsatsaronis ◽

Vicky Panagiotopoulou

Keyword(s):

Vector Space ◽

Vector Space Model ◽

Semantic Relatedness ◽

Text Retrieval ◽

Space Model

Download Full-text

Conversion Problems concerning Automated Mapping from lCD-10 to lCD-9

Methods of Information in Medicine ◽

10.1055/s-0038-1634529 ◽

1998 ◽

Vol 37 (03) ◽

pp. 254-259 ◽

Cited By ~ 9

Author(s):

A. Zaiss ◽

R. Brunner ◽

D. Spinner ◽

R. Klar ◽

S. Schulz

Keyword(s):

Vector Space ◽

Information Content ◽

Classification System ◽

Text Retrieval ◽

Retrieval Method ◽

Consistent System ◽

Automated Mapping ◽

Structural Differences ◽

Icd 10 ◽

Future Revision

AbstractThe increasing parallel use of ICD-9 and ICD-10 complicates the comparability of coded diagnoses. This is the reason why we developed a symmetric table for interactive conversion between ICD-9 and ICD-10, based on a vector space text-retrieval method that resulted in unambiguous mapping from ICD-9 to ICD-10 in 64%, from ICD-10 to ICD-9 in 87% of all three- and four-character classes of the tabular list. Out of the remaining 13% of multi-valued relations, a table for automated mapping from ICD-10 to ICD-9 was created. In 9% of cases, the selection offered no problems. A compromise between preserving information content and maintaining the logical integrity had to be found in 2.4%; in 1.6% automated mapping was impossible because of newly defined concepts and structural differences between ICD-9 and ICD-10 that are not counterbalanced by a consistent system of residual categories. We recommend that in a future revision of the ICD, compatibility with the then existing classification system should be considered.

Download Full-text

On Semi - Pre irresolute Topological Vector Space

Diyala Journal for Pure Science ◽

10.24237/djps.1403.433c ◽

2018 ◽

Vol 14 (3) ◽

pp. 184-192

Author(s):

Radhi Ali ◽

◽

Jalal Hussein Bayati ◽

Suhad Hameed

Keyword(s):

Vector Space ◽

Topological Vector Space

Download Full-text

Extended Vector Space Model with Semantic Relatedness on Java Archive Search Engine

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v1i2.372 ◽

2015 ◽

Vol 1 (2) ◽

Cited By ~ 2

Author(s):

Oscar Karnalim

Keyword(s):

Vector Space ◽

Search Engine ◽

Vector Space Model ◽

Semantic Relatedness ◽

Space Model

Download Full-text

Aplikasi Deteksi Kemiripan Tugas Paper

Matrik Jurnal Manajemen Teknik Informatika dan Rekayasa Komputer ◽

10.30812/matrik.v15i2.39 ◽

2017 ◽

Vol 15 (2) ◽

pp. 5

Author(s):

Anthony Anggrawan ◽

Azhari

Keyword(s):

Information Retrieval ◽

Vector Space ◽

Vector Space Model ◽

Mean Average Precision ◽

Average Precision ◽

Information Searching ◽

Space Model ◽

Model Method

Information searching based on users’ query, which is hopefully able to find the documents based on users’ need, is known as Information Retrieval. This research uses Vector Space Model method in determining the similarity percentage of each student’s assignment. This research uses PHP programming and MySQL database. The finding is represented by ranking the similarity of document with query, with mean average precision value of 0,874. It shows how accurate the application with the examination done by the experts, which is gained from the evaluation with 5 queries that is compared to 25 samples of documents. If the number of counted assignments has higher similarity, thus the process of similarity counting needs more time, it depends on the assignment’s number which is submitted.

Download Full-text

A Ranking model of proximal and structural text retrieval based on region algebra

Proceedings of the conference on SIGGRAPH 2004 course notes - GRAPH '04 ◽

10.3115/1075178.1075185 ◽

2003 ◽

Cited By ~ 3

Author(s):

Katsuya Masuda

Keyword(s):

Text Retrieval ◽

Ranking Model

Download Full-text

Unifying Bayesian Inference and Vector Space Models for Improved Decipherment

10.3115/v1/p15-1081 ◽

2015 ◽

Cited By ~ 1

Author(s):

Qing Dou ◽

Ashish Vaswani ◽

Kevin Knight ◽

Chris Dyer

Keyword(s):

Bayesian Inference ◽

Vector Space ◽

Vector Space Models

Download Full-text

Aplikasi Rekomendasi Buku Pada Katalog Perpustakaan Universitas Multimedia Nusantara Menggunakan Vector Space Model

Jurnal ULTIMATICS ◽

10.31937/ti.v9i2.639 ◽

2018 ◽

Vol 9 (2) ◽

pp. 97-105

Author(s):

Richard Firdaus Oeyliawan ◽

Dennis Gunawan

Keyword(s):

Vector Space ◽

Vector Space Model ◽

Vector Model ◽

Library Management ◽

Space Model ◽

Library Management System ◽

Index Terms ◽

Library Catalogue ◽

Language Sample ◽

F Measure

Library is one of the facilities which provides information, knowledge resource, and acts as an academic helper for readers to get the information. The huge number of books which library has, usually make readers find the books with difficulty. Universitas Multimedia Nusantara uses the Senayan Library Management System (SLiMS) as the library catalogue. SLiMS has many features which help readers, but there is still no recommendation feature to help the readers finding the books which are relevant to the specific book that readers choose. The application has been developed using Vector Space Model to represent the document in vector model. The recommendation in this application is based on the similarity of the books description. Based on the testing phase using one-language sample of the relevant books, the F-Measure value gained is 55% using 0.1 as cosine similarity threshold. The books description and variety of languages affect the F-Measure value gained. Index Terms—Book Recommendation, Porter Stemmer, SLiMS Universitas Multimedia Nusantara, TF-IDF, Vector Space Model

Download Full-text