Recognition of Tibetan Maximal-length Noun Phrases Based on Syntax Tree

Author(s):  
Congjun Long ◽  
Xuewen Zhou ◽  
Maoke Zhou

Frequently corresponding to syntactic components, the Maximal-length Noun Phrase (MNP) possesses abundant syntactic and semantic information and acts a certain semantic role in sentences. Recognition of MNP plays an important role in Natural Language Processing and lays the foundation for analyzing and understanding sentence structure and semantics. By comparing the essence of different MNPs, this article defines the MNP in the Tibetan language from the perspective of syntax tree. A total of 6,038 sentences are extracted from the syntax tree corpus, the structure type, boundary feature, and frequency of MNPs are analyzed, and the MNPs are recognized by applying the sequence tagging model and the syntactic analysis model. The accuracy, recall, and F1 score of the recognition results of applying sequence tagging model are 87.14%, 84.72%, and 85.92%, respectively. The accuracy, recall, and F1 score of the recognition results of applying syntactic analysis model are 87.66%, 87.63%, and 87.65%, respectively.

Author(s):  
Changshun Du ◽  
Lei Huang

Text sentiment analysis is one of the most important tasks in the field of public opinion monitoring, service evaluation and satisfaction analysis under network environments. Compared with the traditional Natural Language Processing analysis tools, convolution neural networks can automatically learn useful features from sentences and improve the performance of the affective analysis model. However, the original convolution neural network model ignores sentence structure information which is very important for text sentiment analysis. In this paper, we add piece-wise pooling to the convolution neural network, which allows the model to obtain the sentence structure. And the main features of different sentences are extracted to analyze the emotional tendencies of the text. At the same time, the user’s feedback involves many different fields, and there is less labeled data. In order to alleviate the sparsity of the data, this paper also uses the generative adversarial network to make common feature extractions, so that the model can obtain the common features associated with emotions in different fields, and improves the model’s Generalization ability with less training data. Experiments on different datasets demonstrate the effectiveness of this method.


2020 ◽  
pp. 116-128
Author(s):  
Isaac Oduro ◽  
Olivia Donkor

This paper discusses relative clause formation in Akan proverbs and normal sentences with particular attention to their similarities and differences. It explores the comparison of the relative clauses in Akan sentences and other specialized genres such as the proverbs. The paper further analyzes the relative clause occurring in both sentence and proverb structures in order to establish sameness and dichotomy in the syntactic uniqueness in both structures. Purposive sampling technique was employed to select the proverbs for this study. In all, twenty-three (23) proverbs were selected for the study. The study adopted the functional grammar approach in the analysis. The study revealed that the relative clause formation in some Akan proverbs and Akan normal sentences has both overt and covert antecedent noun phrases (ANPs). The headless antecedent noun phrase which is seen as a pronominal also undergoes binary mutation in order to account for the antecedent noun phrase and the relativizer which introduces the relative clause. There are also differences in the syntactic positions of the relative clause more especially the sentence structure type. Finally, there is a difference in the syntactic position of the resumptive pronouns in both structures.


Anaphora resolution is a procedure of replacing pronouns with its referring nouns which may be available in the same sentence or in different sentences within the same document. Even multiple approaches are available for anaphora resolution; there may be some space for semantic approaches which provides solution better than syntactic or corpus based approaches. In syntactic approaches, all noun phrases from the previous or current sentences are checked for getting constrains agreement with the anaphor and score value is calculated for a pair of Noun-Pronoun based on these constrains. The pair getting the highest positive score is treated as the result. In some cases the sentence structure is not changed, hence the score corresponding to the feature values and syntactic structures are same. Here the actual replacement of pronoun with noun depends on the meaning of the sentence especially the meaning of the verb. To resolve such situations, here we are proposing a method which is a combination of syntactic and semantic approaches based on ThemeSets or thematic sets. Using ThemeSets we are exploiting the role of verbal lexemes associated with the noun or pronoun for the resolution of anaphora. Anaphora resolution in semantic way has great importance in the modern era of artificial intelligence which enhance multi-dimensional research in the area of natural language processing in a better way.


2021 ◽  
Author(s):  
A.K.M. Nuhil Mehdy

Privacy and its importance to society have been studied for centuries. While our understanding and continued theory building to hypothesize how users make privacy disclosure decisions has increased over time, the struggle to find a one-size solution that satisfies the requirements of each individual remains unsolved. Depending on culture, gender, age, and other situational factors, the concept of privacy and users' expectations of how their privacy should be protected varies from person to person. The goal of this dissertation is to design and develop tools and algorithms to support personal privacy management for end-users. The foundation of this research is based on ensuring the appropriate flow of information based on a user's unique set of personalized rules, policies, and principles. This goal is achieved by building a context-aware and user-centric privacy framework that applies insights from the users' privacy decision-making process, natural language processing (NLP), and formal specification and verification techniques. We conducted a survey (N=401) based on the theory of planned behavior (TPB) to measure the way users' perceptions of privacy factors and intent to disclose information are affected by three situational factors embodied by hypothetical scenarios: information type, recipients' role, and trust source. To help build usable privacy tools, we developed multiple NLP models based on novel architectures and ground truth datasets, that can precisely recognize privacy disclosures through text by utilizing state-of-the-art semantic and syntactic analysis, the hidden pattern of sentence structure, tone of the author, and metadata from the content. We also designed a methodology to formally model, validate, and verify personalized privacy disclosure behavior based on the analysis of the users' situational decision-making process. A robust model checking tool (UPPAAL) is used to represent users' self-reported privacy disclosure behavior by an extended form of finite state automata (FSA). Further, reachability analysis is performed for the verification of privacy properties through computation tree logic (CTL) formulas. Most importantly, we study the correctness, explainability, usability, and acceptance of the proposed methodologies. This dissertation, through extensive amounts of experimental results, contributes several insights to the area of user-tailored privacy modeling and personalized privacy systems.


2018 ◽  
pp. 35-38
Author(s):  
O. Hyryn

The article deals with natural language processing, namely that of an English sentence. The article describes the problems, which might arise during the process and which are connected with graphic, semantic, and syntactic ambiguity. The article provides the description of how the problems had been solved before the automatic syntactic analysis was applied and the way, such analysis methods could be helpful in developing new analysis algorithms. The analysis focuses on the issues, blocking the basis for the natural language processing — parsing — the process of sentence analysis according to their structure, content and meaning, which aims to analyze the grammatical structure of the sentence, the division of sentences into constituent components and defining links between them.


2014 ◽  
Vol 19 (1) ◽  
pp. 1-18 ◽  
Author(s):  
EDITH KAAN ◽  
JOSEPH KIRKHAM ◽  
FRANK WIJNEN

According to recent views of L2-sentence processing, L2-speakers do not predict upcoming information to the same extent as do native speakers. To investigate L2-speakers’ predictive use and integration of syntactic information across clauses, we recorded event-related potentials (ERPs) from advanced L2-learners and native speakers while they read sentences in which the syntactic context did or did not allow noun-ellipsis (Lau, E., Stroud, C., Plesch, S., & Phillips, C. (2006). The role of structural prediction in rapid syntactic analysis. Brain and Language, 98, 74–88.) Both native and L2-speakers were sensitive to the context when integrating words after the potential ellipsis-site. However, native, but not L2-speakers, anticipated the ellipsis, as suggested by an ERP difference between elliptical and non-elliptical contexts preceding the potential ellipsis-site. In addition, L2-learners displayed a late frontal negativity for ungrammaticalities, suggesting differences in repair strategies or resources compared with native speakers.


2021 ◽  
Author(s):  
Carolinne Roque e Faria ◽  
Cinthyan Renata Sachs Camerlengo de Barb

Technology is becoming expressively popular among agribusiness producers and is progressing in all agricultural area. One of the difficulties in this context is to handle data in natural language to solve problems in the field of agriculture. In order to build up dialogs and provide rich researchers, the present work uses Natural Language Processing (NLP) techniques to develop an automatic and effective computer system to interact with the user and assist in the identification of pests and diseases in the soybean farming, stored in a database repository to provide accurate diagnoses to simplify the work of the agricultural professional and also for those who deal with a lot of information in this area. Information on 108 pests and 19 diseases that damage Brazilian soybean was collected from Brazilian bibliographic manuals with the purpose to optimize the data and improve production, using the spaCy library for syntactic analysis of NLP, which allowed the pre-process the texts, recognize the named entities, calculate the similarity between the words, verify dependency parsing and also provided the support for the development requirements of the CAROLINA tool (Robotized Agronomic Conversation in Natural Language) using the language belonging to the agricultural area.


2021 ◽  
Vol 2 (1) ◽  
pp. 1-19
Author(s):  
Muhammad Muchlish Huda ◽  
Samsul Arifin ◽  
Miftakhul Ma’arif

In the context of composing Arabic sentences, the rules of kaifiatul ikhbar are included in the rules which are basic and foundation. The kaifiatul ikhbar rules are formed from the composition of the mubtada and khobar and are used in various forms of sentences, including the sentence structure of the marriage consent. Accuracy in pronouncing the kabul marriage license sentence including the arrangement of the preacher and khobar becomes important considering this kabul consent will be a legal requirement or cancellation of a marriage contract. This study attempts to analyze and present several forms of kabul mariage agreement and syntactic analysis, especially in the kaida of kaifiatul ikhbar. By using a library approach and linguistic analysis from its syntactic aspects, the results of this study indicate that there are 11 forms of kaifiyatul ikhbar with various syntactic analysis specifications. This shows that the form of kaifiatul ikhbar in the marriage contract is actually not only one forms, but with a variety of sentence forms


2020 ◽  
Vol 7 (1) ◽  
pp. 86
Author(s):  
Tri Yulianty Karyaningsih

This paper aims to discuss the comparison between possessive constructions in Russian and Indonesian noun phrases. Since both of the languages have different grammatical systems, their possessive constructions may also be different. The differences are discussed using a contrastive analysis approach. However, the similarities between them are also taken into consideration following one of the practical purposes of contrastive analysis, namely, to aid the translation process. The theory employed in this research is eclectic. The research method employed in this research is descriptive method with contrastive analysis model. In addition, for translation analysis, word-for-word and literal methods are used here. The data in this research are collected from the Russian National Corpus and some selected literary works in Russian and Indonesian. The result suggests that there are some structural differences and similarities between Russian and Indonesian in terms of word order, attributive categories, and grammatical categories of the elements constituting noun phrases. The results of this comparison can be referred to in the translation of possessive construction of both languages so that the closest equivalent is found following the rules of each language. 


Sign in / Sign up

Export Citation Format

Share Document