A common architecture for different text processing techniques in an information retrieval environment

Fast text processing for information retrieval

10.3115/112405.112480 ◽

1991 ◽

Cited By ~ 2

Author(s):

Tomek Strzalkowski ◽

Barabara Vauthey

Keyword(s):

Information Retrieval ◽

Text Processing

Download Full-text

RESEARCHING METHODS FOR PROCESSING TEXT INFORMATION AND REVIEWING THE STAGES OF AN ARTIFICIAL INTELLIGENCE MODEL CREATION AT PRODUCING CHATBOTS

Automation and modeling in design and management of ◽

10.30987/2658-6436-2021-2-19-23 ◽

2021 ◽

Vol 2021 (2) ◽

pp. 19-23

Author(s):

Anastasiya Ivanova ◽

Aleksandr Kuz'menko ◽

Rodion Filippov ◽

Lyudmila Filippova ◽

Anna Sazonova ◽

...

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Machine Learning ◽

Text Processing ◽

Text Format ◽

Machine Learning Methods ◽

Methods And Techniques ◽

Text Information ◽

Processing Techniques ◽

Machine Processing

The task of producing a chatbot based on a neural network supposes machine processing of the text, which in turn involves using various methods and techniques for analyzing phrases and sentences. The article considers the most popular solutions and models for data analysis in the text format: methods of lemmatization, vectorization, as well as machine learning methods. Particular attention is paid to the text processing techniques, after their analyzing the best method was identified and tested.

Download Full-text

Information Retrieval Using Xquery Processing Techniques

International Journal of Database Management Systems ◽

10.5121/ijdms.2011.3104 ◽

2011 ◽

Vol 3 (1) ◽

pp. 50-58 ◽

Cited By ~ 2

Author(s):

E.J.Thomson Fredrick ◽

G Radhamani

Keyword(s):

Information Retrieval ◽

Processing Techniques

Download Full-text

Verification of Uncurated Protein Annotations

Information Retrieval in Biomedicine ◽

10.4018/978-1-60566-274-9.ch016 ◽

2010 ◽

pp. 301-314

Author(s):

Francisco M. Couto ◽

Mário J. Silva ◽

Vivian Lee ◽

Emily Dimmer ◽

Evelyn Camon ◽

...

Keyword(s):

Information Retrieval ◽

Molecular Biology ◽

Domain Knowledge ◽

Text Processing ◽

The Other ◽

Research Projects ◽

Evidence Text ◽

Biological Sources ◽

Biology Research ◽

Automatic Text

Molecular Biology research projects produced vast amounts of data, part of which has been preserved in a variety of public databases. However, a large portion of the data contains a significant number of errors and therefore requires careful verification by curators, a painful and costly task, before being reliable enough to derive valid conclusions from it. On the other hand, research in biomedical information retrieval and information extraction are nowadays delivering Text Mining solutions that can support curators to improve the efficiency of their work to deliver better data resources. Over the past decades, automatic text processing systems have successfully exploited biomedical scientific literature to reduce the researchers’ efforts to keep up to date, but many of these systems still rely on domain knowledge that is integrated manually leading to unnecessary overheads and restrictions in its use. A more efficient approach would acquire the domain knowledge automatically from publicly available biological sources, such as BioOntologies, rather than using manually inserted domain knowledge. An example of this approach is GOAnnotator, a tool that assists the verification of uncurated protein annotations. It provided correct evidence text at 93% precision to the curators and thus achieved promising results. GOAnnotator was implemented as a web tool that is freely available at http://xldb.di.fc.ul.pt/rebil/tools/goa/.

Download Full-text

Using text processing techniques to automatically enrich a domain ontology

Proceedings of the international conference on Formal Ontology in Information Systems - FOIS '01 ◽

10.1145/505168.505194 ◽

2001 ◽

Cited By ~ 68

Author(s):

Paola Velardi ◽

Paolo Fabriani ◽

Michele Missikoff

Keyword(s):

Text Processing ◽

Domain Ontology ◽

Processing Techniques

Download Full-text

Appearance of New Terms in Accounting Language: A Preliminary Examination of Accounting Pronouncements and Financial Statements

Journal of Emerging Technologies in Accounting ◽

10.2308/jeta.2008.5.1.17 ◽

2008 ◽

Vol 5 (1) ◽

pp. 17-36 ◽

Cited By ~ 4

Author(s):

Margaret R. Garnsey ◽

Ingrid E. Fisher

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Preliminary Analysis ◽

Financial Statements ◽

Preliminary Examination ◽

Statistical Natural Language Processing ◽

Processing Techniques ◽

Initial Results

ABSTRACT: Accounting language evolves as the transactions and organizations it provides guidance for change. We provide a preliminary analysis of terms used in official accounting pronouncements and annual corporate financial statements. Initial results show statistical natural language-processing techniques provide a means of identifying new terms as they enter the lexicon. These techniques should be valuable in deriving a complete accounting lexicon as well as in constructing and maintaining an accounting thesaurus to support information retrieval.

Download Full-text

Text Processing Techniques in Approaches for Automated Composition of Domain Models

Proceedings of the 16th International Conference on Evaluation of Novel Approaches to Software Engineering ◽

10.5220/0010533904890500 ◽

2021 ◽

Author(s):

Viktorija Gribermane ◽

Erika Nazaruka

Keyword(s):

Text Processing ◽

Domain Models ◽

Automated Composition ◽

Processing Techniques

Download Full-text

Improving the Accuracy of Text Classification using Stemming Method, A Case of Non-formal Indonesian Conversation

10.21203/rs.3.rs-41431/v2 ◽

2020 ◽

Author(s):

Rianto Rianto ◽

Achmad Benny Mutiara ◽

Eri Prasetyo Wibowo ◽

Paulus Insap Santosa

Keyword(s):

Support Vector Machine ◽

Information Retrieval ◽

Text Classification ◽

Experimental Evaluation ◽

Hate Speech ◽

Text Processing ◽

High Accuracy ◽

Support Vector ◽

Support Vector Machine Algorithm ◽

Text Data

Abstract Stemming has long been used in data pre-processing in information retrieval, which aims to make affix words into root words. However, there are not many stemming methods for non-formal Indonesian text processing. The existing stemming method has high accuracy for formal Indonesian, but low for non-formal Indonesian. Thus, the stemming method which has high accuracy for non-formal Indonesian classifier model is still an open-ended challenge. This study introduces a new stemming method to solve problems in the non-formal Indonesian text data pre-processing. Furthermore, this study aims to provide comprehensive research on improving the accuracy of text classifier models by strengthening on stemming method. Using the Support Vector Machine algorithm, a text classifier model is developed, and its accuracy is checked. The experimental evaluation was done by testing 550 datasets in Indonesian using two different stemming methods. The results show that using the proposed stemming method, the text classifier model has higher accuracy than the existing methods with a score of 0.85 and 0.73, respectively. In the future, the proposed stemming method can be used to develop the Indonesian text classifier model which can be used for various purposes including text clustering, summarization, detecting hate speech, and other text processing applications.

Download Full-text

Robust text processing and information retrieval

Proceedings of the workshop on Human Language Technology - HLT '93 ◽

10.3115/1075671.1075787 ◽

1993 ◽

Author(s):

Tomek Strzalkowski

Keyword(s):

Information Retrieval ◽

Text Processing

Download Full-text

Tweet Retrieval and Analysing the Trends

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b6102.0710221 ◽

2021 ◽

Vol 10 (2) ◽

pp. 34-38

Author(s):

Utkarsh Malik ◽

◽

Harpreet Kaur ◽

Aditi Chaudhary ◽

◽

...

Keyword(s):

Social Media ◽

Data Science ◽

Text Processing ◽

Political Polarization ◽

Product Analysis ◽

Protest Movements ◽

Social Media Platforms ◽

Media Platform ◽

Processing Techniques ◽

Share Information

We can’t disregard the importance of Social Media in Today’s Technology Era. Internet is almost in every hand. People uses various Social Media platforms to express themselves and their thinking about various topics such as Politics, Entertainment, Sports, etc. In the Data Science industry, trend analysis can be used for several purposes like marketing or product analysis. Twitter data has been used to analyze political polarization and the spread of protest movements. Twitter is one of the most popular social media platform that allows the users to spread and share information. Twitter publishes the list of recent or latest topics named as “Trending Topics” which shows all the happenings in the world and what are the people’s opinions about those topics. This Trend Analyzer will work on a given set of tweets and generates a graph based on the tweets and showsthe comparative popularity of the used hashtags. This Analyzer will examine a set of tweets using Python and text-processing techniques

Download Full-text