An application and e aluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese

Hideki Mima; Sophia Ananiadou

doi:10.1075/term.6.2.04mim

An application and e aluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese

Terminology ◽

10.1075/term.6.2.04mim ◽

2000 ◽

Vol 6 (2) ◽

pp. 175-194 ◽

Cited By ~ 15

Author(s):

Hideki Mima ◽

Sophia Ananiadou

Keyword(s):

Independent Method ◽

The Internet ◽

Recognition Method ◽

Knowledge Mining ◽

Independent Term ◽

Statistical Knowledge ◽

C Value ◽

Automatic Term Recognition ◽

Domain Independent ◽

Technical Terms

Technical terms are important for knowledge mining, especially as vast amounts of multi-lingual documents are available over the Internet. Thus, a domain and language-independent method for term recognition is necessary to automatically recognize terms from Internet documents. The C-/NC-value method is an efficient domain-independent multi-word term recognition method which combines linguistic and statistical knowledge. Although the C-value/NC-value method is originally based on the recognition of nested terms in English, our aim is to evaluate the application of the method to other languages and to show its feasibility for multi-language environments. In this article, we describe the application of the C/NC-value method to Japanese texts. Several experiments analysing the performance of the method using the NACSIS Japanese AI-domain corpus demonstrate that the method can be utilized to realize a practical domain-and language-independent term rec- ognition system.

Download Full-text

The C-value/NC-value domain-independent method for multi-word term extraction

Journal of Natural Language Processing ◽

10.5715/jnlp.6.3_145 ◽

1999 ◽

Vol 6 (3) ◽

pp. 145-179 ◽

Cited By ~ 59

Author(s):

Katerina T. Frantzi ◽

Sophia Ananiadou

Keyword(s):

Independent Method ◽

Term Extraction ◽

C Value ◽

Domain Independent

Download Full-text

Automatic Term Recognition Method for Military Domain

Journal of Physics Conference Series ◽

10.1088/1742-6596/2078/1/012031 ◽

2021 ◽

Vol 2078 (1) ◽

pp. 012031

Author(s):

Ani Song ◽

Xiaoxia Jia ◽

Wei Jiang

Keyword(s):

Word Segmentation ◽

Word Formation ◽

Recognition Method ◽

Data Set ◽

The Core ◽

C Value ◽

Automatic Term Recognition ◽

Term Candidate ◽

Candidate Set ◽

Better Than

Abstract With the development of military intelligence, higher requirements are put forward for automatic term recognition in military field. In view of the characteristics of flexible and diverse naming of military requirement documents without annotated corpus, the method of this paper uses the existing military domain core database, and matches the data set and core database by Aho-Corasic algorithm and word segmentation technology, so that the terms to be recognized in the data set can be divided into three types. The possible rules of word formation of military terms are summarized and phrases that conform to the rules of word formation are found in the documents as the term candidate set. The core library and TF-IDF method are used to calculate the value of the candidate terms, and the candidate terms whose value is greater than the threshold are selected iteratively as the real terms. The experimental results show that the F1 value of this method reaches 0.719, which is better than the traditional C-value method. Therefore, the method proposed in this paper can achieve better automatic term recognition effect for military requirement documents without annotation.

Download Full-text

Otolaryngology and the Internet

Otolaryngology ◽

10.1016/s0194-5998(96)70147-x ◽

1996 ◽

Vol 115 (1) ◽

pp. 123-131 ◽

Cited By ~ 4

Author(s):

Michael Rothschild

Keyword(s):

Clinical Medicine ◽

Neck Surgery ◽

Academic Departments ◽

The Internet ◽

Extensive Experience ◽

The Past ◽

National Organizations ◽

On Line ◽

Cultural Entity ◽

Technical Terms

During the past 25 years, the Internet has grown tremendously. Starting as four academic computers linked by the Department of Defense, it has become a major technical and cultural entity that is accessible to millions of persons outside the realm of government and academia. The field of medicine has been well served by this telecommunications system, in which many applications have been developed to assist in research, clinical medicine, and education. More recently, resources of specific interest to otolaryngologists have been implemented at various academic departments and national organizations. This review is intended to simplify the Internet for otolaryngologists who do not have extensive experience in computers or telecommunication. The Internet is described in basic, minimally technical terms, and specific examples are provided of ways that on-line resources can be used in the practice of otolaryngology-head and neck surgery.

Download Full-text

A New Domain-Independent Method for Improving Reliability and Reducing risk Based on Algebraic Inequalities

Proceedings of the 29th European Safety and Reliability Conference (ESREL) ◽

10.3850/978-981-11-2724-3_0773-cd ◽

2019 ◽

Author(s):

Michael Todinov

Keyword(s):

Independent Method ◽

Domain Independent

Download Full-text

Automatic term recognition based on statistics of compound nouns

Terminology ◽

10.1075/term.6.2.05nak ◽

2000 ◽

Vol 6 (2) ◽

pp. 195-210 ◽

Cited By ~ 42

Author(s):

Hiroshi Nakagawa

Keyword(s):

Experimental Evaluation ◽

Extraction Method ◽

Recognition Task ◽

Experimental Results ◽

Test Collection ◽

Term Extraction ◽

Academic Conference ◽

Compound Nouns ◽

Automatic Term Recognition ◽

Technical Terms

The NTCIR1 TMREC group called for participation of the term recognition task which is a part of NTCIR1 held in 1999. As an activity of TMREC, they have provided us with the test collection of the term recognition task. The goal of this task is to automatically recognize and extract terms from the text corpus which consists of 1,870 abstracts gathered from the NACSIS Academic Conference Database. This article describes the term extraction method we have proposed to extract terms consisting of simple and compound nouns and the experimental evaluation of the proposed method with this NTCIR TMREC test collection. The basic idea of scoring a simple noun N of our term extraction method is to count how many nouns are conjoined with N to make compound nouns. Then we extend this score to measure the score of compound nouns because most of technical terms are compound nouns. Our method has a parameter to tune the degree of preference either for longer compound nouns or for shorter compound nouns. As for term candidates, in addition to noun sequences, we may add variations such as patterns of "A no B" that roughly means "B of A" or "A’ś B" and/or "A na B" where "A na" is an adjective. Experimental results of our method are promising, namely recall of 0.83, precision of 0.46 and F-value of 0.59 for exactly matched extracted terms when we take into account top scoring 16,000 extracted terms.

Download Full-text

Domain-independent method of detecting inconsistencies in SBVR-based business rules

Proceedings of the International Workshop on Formal Methods for Analysis of Business Systems - ForMABS 2016 ◽

10.1145/2975941.2975943 ◽

2016 ◽

Cited By ~ 9

Author(s):

Pavan Kumar Chittimalli ◽

Kritika Anand

Keyword(s):

Independent Method ◽

Business Rules ◽

Domain Independent

Download Full-text

A Microservice-Based Framework for Developing Internet of Things and People Applications

Proceedings ◽

10.3390/proceedings2019031085 ◽

2019 ◽

Vol 31 (1) ◽

pp. 85 ◽

Cited By ~ 1

Author(s):

Aurora Macías ◽

Elena Navarro ◽

Pascual González

Keyword(s):

Internet Of Things ◽

The Internet ◽

Reference Architecture ◽

Design Development ◽

Application Layer ◽

New Paradigm ◽

P Application ◽

Collection And Management ◽

Domain Independent ◽

The Internet Of Things

The Internet of things (IoT) is characterized by billions of heterogeneous, distributed, and intelligent objects—both from the digital and the physical worlds—running applications and services. Objects are connected through heterogeneous platforms providing support for the collection and management of data that need to be understood. Since IoT systems are composed by a variety of objects and services, a key aspect for engineering them is their architecture. The new paradigm called Internet of people (IoP) is not unaware of this need. In IoP, humans play an important role so that design considering aspects as context becomes critical for making the most of these applications. This work presents a context-aware, serverless, microservice-based, and cloud-centric framework for the Internet of things and people (IoT-P) applications that extends the three-layer classic IoT reference architecture. It integrates most of the aspects considered by the architecture of IoT solutions emerging from different perspectives, being also domain independent. This work focuses on the application paradigm of IoT neglected by most proposals. This framework, combined with a previous work, offers a higher separation of concerns (SoC) degree than other proposals, by splitting the application layer into different sublayers or subsystems based on their responsibilities and tracing atomic components to serverless microservices, to facilitate the design, development, and deployment of IoT-P applications. An IoT-P application in the healthcare domain is presented to illustrate how this framework can be put into practice.

Download Full-text

Using Pareto optimality to coordinate distributed agents

Artificial intelligence for engineering design analysis and manufacturing ◽

10.1017/s0890060400002821 ◽

1995 ◽

Vol 9 (4) ◽

pp. 269-281 ◽

Cited By ~ 38

Author(s):

Charles J. Petrie ◽

Teresa A. Webster ◽

Mark R. Cutkosky

Keyword(s):

Pareto Optimality ◽

Software Tools ◽

The Internet ◽

Engineering Systems ◽

Electrical Cable ◽

Distributed Agents ◽

Cable Harness ◽

Domain Independent ◽

Independent Property

AbstractPareto optimality is a domain-independent property that can be used to coordinate distributed engineering agents. Within a model of design called Redux, some aspects of dependency-directed backtracking can be interpreted as tracking Pareto optimality. These concepts are implemented in a framework, called Next-Link, that coordinates legacy engineering systems. This framework allows existing software tools to communicate with each other and a Redux agent over the Internet. The functionality is illustrated with examples from the domain of electrical cable harness design.

Download Full-text

IoTCrawler: Challenges and Solutions for Searching the Internet of Things

Sensors ◽

10.3390/s21051559 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1559

Author(s):

Thorben Iggena ◽

Eushay Bin Bin Ilyas ◽

Marten Fischer ◽

Ralf Tönjes ◽

Tarek Elsaleh ◽

...

Keyword(s):

Internet Of Things ◽

Rapid Development ◽

Data Sources ◽

The Internet ◽

Privacy And Security ◽

Extensive Evaluation ◽

Data Fragmentation ◽

Independent Design ◽

Domain Independent ◽

The Internet Of Things

Due to the rapid development of the Internet of Things (IoT) and consequently, the availability of more and more IoT data sources, mechanisms for searching and integrating IoT data sources become essential to leverage all relevant data for improving processes and services. This paper presents the IoT search framework IoTCrawler. The IoTCrawler framework is not only another IoT framework, it is a system of systems which connects existing solutions to offer interoperability and to overcome data fragmentation. In addition to its domain-independent design, IoTCrawler features a layered approach, offering solutions for crawling, indexing and searching IoT data sources, while ensuring privacy and security, adaptivity and reliability. The concept is proven by addressing a list of requirements defined for searching the IoT and an extensive evaluation. In addition, real world use cases showcase the applicability of the framework and provide examples of how it can be instantiated for new scenarios.

Download Full-text

Using Algebraic Inequalities to Reduce Uncertainty and Risk

ASCE-ASME J Risk and Uncert in Engrg Sys Part B Mech Engrg ◽

10.1115/1.4048403 ◽

2020 ◽

Vol 6 (4) ◽

Author(s):

Michael Todinov

Keyword(s):

Independent Method ◽

Large Batch ◽

Right Hand ◽

Real Process ◽

Left And Right ◽

Series Systems ◽

Domain Independent ◽

Counterintuitive Result ◽

Parallel Series

Abstract The paper discusses applications of the domain-independent method of algebraic inequalities, for reducing uncertainty and risk. Algebraic inequalities have been used for revealing the intrinsic reliability of competing systems and ranking the systems in terms of reliability in the absence of knowledge related to the reliabilities of their components. An algebraic inequality has also been used to establish the principle of the well-ordered parallel-series systems which, in turn, has been applied to maximize the reliability of common parallel-series systems. The paper introduces linking an abstract inequality to a real process by a meaningful interpretation of the variables entering the inequality and its left- and right-hand parts. The meaningful interpretation of a simple algebraic inequality led to a counterintuitive result. If two varieties of items are present in a large batch, the probability of selecting randomly two items of different variety is smaller than the probability of selecting randomly two items of the same variety.

Download Full-text