Natural Language Grammar Induction of Indonesian Language Corpora Using Genetic Algorithm

Design of GA and Ontology based NLP Frameworks for Online Opinion Mining

Recent Patents on Engineering ◽

10.2174/1872212112666180115162726 ◽

2019 ◽

Vol 13 (2) ◽

pp. 159-165

Author(s):

Manik Sharma ◽

Gurvinder Singh ◽

Rajinder Singh

Keyword(s):

Genetic Algorithm ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Opinion Mining ◽

Hybrid Genetic Algorithm ◽

Online Reviews ◽

Middle Tier ◽

Complete Set ◽

Mining Model

Background: For almost every domain, a tremendous degree of data is accessible in an online and offline mode. Billions of users are daily posting their views or opinions by using different online applications like WhatsApp, Facebook, Twitter, Blogs, Instagram etc. Objective: These reviews are constructive for the progress of the venture, civilization, state and even nation. However, this momentous amount of information is useful only if it is collectively and effectively mined. Methodology: Opinion mining is used to extract the thoughts, expression, emotions, critics, appraisal from the data posted by different persons. It is one of the prevailing research techniques that coalesce and employ the features from natural language processing. Here, an amalgamated approach has been employed to mine online reviews. Results: To improve the results of genetic algorithm based opining mining patent, here, a hybrid genetic algorithm and ontology based 3-tier natural language processing framework named GAO_NLP_OM has been designed. First tier is used for preprocessing and corrosion of the sentences. Middle tier is composed of genetic algorithm based searching module, ontology for English sentences, base words for the review, complete set of English words with item and their features. Genetic algorithm is used to expedite the polarity mining process. The last tier is liable for semantic, discourse and feature summarization. Furthermore, the use of ontology assists in progressing more accurate opinion mining model. Conclusion: GAO_NLP_OM is supposed to improve the performance of genetic algorithm based opinion mining patent. The amalgamation of genetic algorithm, ontology and natural language processing seems to produce fast and more precise results. The proposed framework is able to mine simple as well as compound sentences. However, affirmative preceded interrogative, hidden feature and mixed language sentences still be a challenge for the proposed framework.

Download Full-text

Genetic algorithm based sentence packaging in natural language text generation

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/537/4/042003 ◽

2019 ◽

Vol 537 ◽

pp. 042003

Author(s):

Dmitry Devyatkin ◽

Vadim Isakov ◽

Alexander Shvets

Keyword(s):

Genetic Algorithm ◽

Natural Language ◽

Text Generation ◽

Natural Language Text ◽

Language Text

Download Full-text

Fuzzy Linguistic Summarization with Genetic Algorithm: An Application with Operational and Financial Healthcare Data

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s021848851750026x ◽

2017 ◽

Vol 25 (04) ◽

pp. 599-620 ◽

Cited By ~ 3

Author(s):

Tunahan Altintop ◽

Ronald R. Yager ◽

Diyar Akay ◽

Fatih Emre Boran ◽

Muhammet Ünal

Keyword(s):

Genetic Algorithm ◽

Natural Language ◽

Real Data ◽

Threshold Value ◽

Healthcare Services ◽

Vital Role ◽

Healthcare Data ◽

Medical Healthcare ◽

Fuzzy Linguistic ◽

First Time

It is now well recognized that knowledge extracted from rich healthcare data play a vital role for delivery, management and planning of healthcare services. So far, however, there is not much study done on the domain of operational and financial healthcare data since, up to now, a great deal of works are dedicated to clinical/medical healthcare data for the purposes of diagnosis and treatment of diseases. In this paper, an attempt is made, by applying fuzzy linguistic summarization, for the first time to discover knowledge from operational and financial healthcare data. Fuzzy linguistic summarization, in its simplest term, provides natural language based summaries from a dataset in a human consistent way along with a degree of truth attached to each summary. While basically valuable, its benefit can be increased by only generating summaries with a degree of truth above than an indicated threshold value. A genetic algorithm is developed within this context in order to eliminate less promising and useless linguistic summaries. We assess the proposed approach experimentally on a real data and evaluate the generated summaries to gain actionable insights from them.

Download Full-text

Variational Bayesian Grammar Induction for Natural Language

Grammatical Inference: Algorithms and Applications - Lecture Notes in Computer Science ◽

10.1007/11872436_8 ◽

2006 ◽

pp. 84-96 ◽

Cited By ~ 8

Author(s):

Kenichi Kurihara ◽

Taisuke Sato

Keyword(s):

Natural Language ◽

Grammar Induction ◽

Variational Bayesian

Download Full-text

Natural language grammar induction with a generative constituent-context model

Pattern Recognition ◽

10.1016/j.patcog.2004.03.023 ◽

2005 ◽

Vol 38 (9) ◽

pp. 1407-1419 ◽

Cited By ~ 18

Author(s):

Dan Klein ◽

Christopher D. Manning

Keyword(s):

Natural Language ◽

Grammar Induction ◽

Context Model

Download Full-text

An inductive method with genetic algorithm for learning phrase-structure-rule of natural language

Wuhan University Journal of Natural Sciences ◽

10.1007/bf02900899 ◽

1996 ◽

Vol 1 (3-4) ◽

pp. 640-644 ◽

Cited By ~ 1

Author(s):

Houfeng Wang ◽

Dawei Dai

Keyword(s):

Genetic Algorithm ◽

Natural Language ◽

Phrase Structure ◽

Inductive Method

Download Full-text

Stochastic mutation approach for grammar induction using Genetic Algorithm

2010 2nd International Conference on Electronic Computer Technology ◽

10.1109/icectech.2010.5479969 ◽

2010 ◽

Cited By ~ 1

Author(s):

N. S. Choubey ◽

M. U. Kharat

Keyword(s):

Genetic Algorithm ◽

Grammar Induction

Download Full-text

Generating Creative Language - Theories, Practice and Evaluation

10.31237/osf.io/9f3zm ◽

2020 ◽

Author(s):

Mika Hämäläinen

Keyword(s):

Genetic Algorithm ◽

Natural Language ◽

Ad Hoc ◽

Theoretical Perspective ◽

Theoretical Work ◽

Computational Creativity ◽

Practical Solution ◽

Language Generation ◽

Communicative Context ◽

Creative Language

This thesis presents approaches to computationally creative natural language generation focusing on theoretical foundations, practical solutions and evaluation. I defend that a theoretical definition is crucial for computational creativity and that the practical solution must closely follow the theoretical definition. Finally, evaluation must be based on the underlying theory and what was actually modelled in the practical solution. A theoretical void in the existing theoretical work on computational creativity is identified. The existing theories do not explicitly take into account the communicative nature of natural language. Therefore, a new theoretical framework is elaborated that identifies how computational creativity can take place in a setting that has a clear communicative goal. This introduces a communicative-creative trade off that sets limits to creativity in such a communicative context. My framework divides creativity in three categories: message creativity, contextual creativity and communicative creativity. Any computationally creative NLG approach not taking communicativity into account is called mere surface generation.I propose a novel master-apprentice approach for creative language generation. The approach consists of a genetic algorithm, the fitness functions of which correspond to different parameters defined as important for the creative task in question from a theoretical perspective. The output of the genetic algorithm together with possible human authored data are used to train the apprentice, which is a sequence-to-sequence neural network model. The role of the apprentice in the system is to approximate creative autonomy.Evaluation is approached from three different perspectives in this work: ad-hoc and abstract, theory-based and abstract, and theory-based and concrete. The first perspective is the most common one in the current literature and its shortcomings are demonstrated and discussed. This starts a gradual shift towards more meaningful evaluation by first using proper theories to define the task being modelled and finally reducing the room for subjective interpretation by suggesting the use of concrete evaluation questions.

Download Full-text

Towards Adversarial Genetic Text Generation

10.5121/csit.2021.110407 ◽

2021 ◽

Author(s):

Deniz Kavi

Keyword(s):

Genetic Algorithm ◽

Natural Language ◽

Language Processing ◽

Text Classification ◽

Future Research ◽

Grading System ◽

Text Generation ◽

Recent Success ◽

Clustering Model ◽

Better Than

Text generation is the task of generating natural language, and producing outputs similar to or better than human texts. Due to deep learning’s recent success in the field of natural language processing, computer generated text has come closer to becoming indistinguishable to human writing. Genetic Algorithms have not been as popular in the field of text generation. We propose a genetic algorithm combined with text classification and clustering models which automatically grade the texts generated by the genetic algorithm. The genetic algorithm is given poorly generated texts from a Markov chain, these texts are then graded by a text classifier and a text clustering model. We then apply crossover to pairs of texts, with emphasis on those that received higher grades. Changes to the grading system and further improvements to the genetic algorithm are to be the focus of future research.

Download Full-text

Natural Language Grammar Induction using a Constituent-Context Model

Advances in Neural Information Processing Systems 14 ◽

10.7551/mitpress/1120.003.0009 ◽

2002 ◽

Keyword(s):

Natural Language ◽

Grammar Induction ◽

Context Model

Download Full-text