An application and e aluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese

Terminology ◽  
2000 ◽  
Vol 6 (2) ◽  
pp. 175-194 ◽  
Author(s):  
Hideki Mima ◽  
Sophia Ananiadou

Technical terms are important for knowledge mining, especially as vast amounts of multi-lingual documents are available over the Internet. Thus, a domain and language-independent method for term recognition is necessary to automatically recognize terms from Internet documents. The C-/NC-value method is an efficient domain-independent multi-word term recognition method which combines linguistic and statistical knowledge. Although the C-value/NC-value method is originally based on the recognition of nested terms in English, our aim is to evaluate the application of the method to other languages and to show its feasibility for multi-language environments. In this article, we describe the application of the C/NC-value method to Japanese texts. Several experiments analysing the performance of the method using the NACSIS Japanese AI-domain corpus demonstrate that the method can be utilized to realize a practical domain-and language-independent term rec- ognition system.

2021 ◽  
Vol 2078 (1) ◽  
pp. 012031
Author(s):  
Ani Song ◽  
Xiaoxia Jia ◽  
Wei Jiang

Abstract With the development of military intelligence, higher requirements are put forward for automatic term recognition in military field. In view of the characteristics of flexible and diverse naming of military requirement documents without annotated corpus, the method of this paper uses the existing military domain core database, and matches the data set and core database by Aho-Corasic algorithm and word segmentation technology, so that the terms to be recognized in the data set can be divided into three types. The possible rules of word formation of military terms are summarized and phrases that conform to the rules of word formation are found in the documents as the term candidate set. The core library and TF-IDF method are used to calculate the value of the candidate terms, and the candidate terms whose value is greater than the threshold are selected iteratively as the real terms. The experimental results show that the F1 value of this method reaches 0.719, which is better than the traditional C-value method. Therefore, the method proposed in this paper can achieve better automatic term recognition effect for military requirement documents without annotation.


1996 ◽  
Vol 115 (1) ◽  
pp. 123-131 ◽  
Author(s):  
Michael Rothschild

During the past 25 years, the Internet has grown tremendously. Starting as four academic computers linked by the Department of Defense, it has become a major technical and cultural entity that is accessible to millions of persons outside the realm of government and academia. The field of medicine has been well served by this telecommunications system, in which many applications have been developed to assist in research, clinical medicine, and education. More recently, resources of specific interest to otolaryngologists have been implemented at various academic departments and national organizations. This review is intended to simplify the Internet for otolaryngologists who do not have extensive experience in computers or telecommunication. The Internet is described in basic, minimally technical terms, and specific examples are provided of ways that on-line resources can be used in the practice of otolaryngology-head and neck surgery.


Terminology ◽  
2000 ◽  
Vol 6 (2) ◽  
pp. 195-210 ◽  
Author(s):  
Hiroshi Nakagawa

The NTCIR1 TMREC group called for participation of the term recognition task which is a part of NTCIR1 held in 1999. As an activity of TMREC, they have provided us with the test collection of the term recognition task. The goal of this task is to automatically recognize and extract terms from the text corpus which consists of 1,870 abstracts gathered from the NACSIS Academic Conference Database. This article describes the term extraction method we have proposed to extract terms consisting of simple and compound nouns and the experimental evaluation of the proposed method with this NTCIR TMREC test collection. The basic idea of scoring a simple noun N of our term extraction method is to count how many nouns are conjoined with N to make compound nouns. Then we extend this score to measure the score of compound nouns because most of technical terms are compound nouns. Our method has a parameter to tune the degree of preference either for longer compound nouns or for shorter compound nouns. As for term candidates, in addition to noun sequences, we may add variations such as patterns of "A no B" that roughly means "B of A" or "A’ś B" and/or "A na B" where "A na" is an adjective. Experimental results of our method are promising, namely recall of 0.83, precision of 0.46 and F-value of 0.59 for exactly matched extracted terms when we take into account top scoring 16,000 extracted terms.


Proceedings ◽  
2019 ◽  
Vol 31 (1) ◽  
pp. 85 ◽  
Author(s):  
Aurora Macías ◽  
Elena Navarro ◽  
Pascual González

The Internet of things (IoT) is characterized by billions of heterogeneous, distributed, and intelligent objects—both from the digital and the physical worlds—running applications and services. Objects are connected through heterogeneous platforms providing support for the collection and management of data that need to be understood. Since IoT systems are composed by a variety of objects and services, a key aspect for engineering them is their architecture. The new paradigm called Internet of people (IoP) is not unaware of this need. In IoP, humans play an important role so that design considering aspects as context becomes critical for making the most of these applications. This work presents a context-aware, serverless, microservice-based, and cloud-centric framework for the Internet of things and people (IoT-P) applications that extends the three-layer classic IoT reference architecture. It integrates most of the aspects considered by the architecture of IoT solutions emerging from different perspectives, being also domain independent. This work focuses on the application paradigm of IoT neglected by most proposals. This framework, combined with a previous work, offers a higher separation of concerns (SoC) degree than other proposals, by splitting the application layer into different sublayers or subsystems based on their responsibilities and tracing atomic components to serverless microservices, to facilitate the design, development, and deployment of IoT-P applications. An IoT-P application in the healthcare domain is presented to illustrate how this framework can be put into practice.


Author(s):  
Charles J. Petrie ◽  
Teresa A. Webster ◽  
Mark R. Cutkosky

AbstractPareto optimality is a domain-independent property that can be used to coordinate distributed engineering agents. Within a model of design called Redux, some aspects of dependency-directed backtracking can be interpreted as tracking Pareto optimality. These concepts are implemented in a framework, called Next-Link, that coordinates legacy engineering systems. This framework allows existing software tools to communicate with each other and a Redux agent over the Internet. The functionality is illustrated with examples from the domain of electrical cable harness design.


Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1559
Author(s):  
Thorben Iggena ◽  
Eushay Bin Bin Ilyas ◽  
Marten Fischer ◽  
Ralf Tönjes ◽  
Tarek Elsaleh ◽  
...  

Due to the rapid development of the Internet of Things (IoT) and consequently, the availability of more and more IoT data sources, mechanisms for searching and integrating IoT data sources become essential to leverage all relevant data for improving processes and services. This paper presents the IoT search framework IoTCrawler. The IoTCrawler framework is not only another IoT framework, it is a system of systems which connects existing solutions to offer interoperability and to overcome data fragmentation. In addition to its domain-independent design, IoTCrawler features a layered approach, offering solutions for crawling, indexing and searching IoT data sources, while ensuring privacy and security, adaptivity and reliability. The concept is proven by addressing a list of requirements defined for searching the IoT and an extensive evaluation. In addition, real world use cases showcase the applicability of the framework and provide examples of how it can be instantiated for new scenarios.


Author(s):  
Michael Todinov

Abstract The paper discusses applications of the domain-independent method of algebraic inequalities, for reducing uncertainty and risk. Algebraic inequalities have been used for revealing the intrinsic reliability of competing systems and ranking the systems in terms of reliability in the absence of knowledge related to the reliabilities of their components. An algebraic inequality has also been used to establish the principle of the well-ordered parallel-series systems which, in turn, has been applied to maximize the reliability of common parallel-series systems. The paper introduces linking an abstract inequality to a real process by a meaningful interpretation of the variables entering the inequality and its left- and right-hand parts. The meaningful interpretation of a simple algebraic inequality led to a counterintuitive result. If two varieties of items are present in a large batch, the probability of selecting randomly two items of different variety is smaller than the probability of selecting randomly two items of the same variety.


Sign in / Sign up

Export Citation Format

Share Document