Automatic generation of hypertext knowledge bases

Author(s):  
Udo Hahn ◽  
Ulrich Reimer
Doklady BGUIR ◽  
2020 ◽  
Vol 18 (5) ◽  
pp. 44-52
Author(s):  
Li Wenzu

This article proposes an approach for designing a general subsystem of automatic generation of questions in intelligent learning systems. The designed subsystem allows various types of questions to be automatically generated based on information from the knowledge bases and save the generated questions in the subsystem knowledge base for future use. The main part of the subsystem is the automatic generation module of questions, which allows one to generate questions of various types based on existing question generation strategies in combination with the structural characteristics of knowledge bases built using OSTIS technology. In this article, a variety of strategies for automatically generated questions are proposed, the use of which allows various types of questions to be automatically generated, such as multiple-choice questions, fill-in-the-blank questions, questions of definition interpretation and etc. The most important part of the subsystem is the knowledge base, which stores the ontology of questions, including the question instances themselves. In this article, the knowledge base is constructed based on OSTIS technical standards. The type classification of automatically generated questions was developed, as well as the subject area for storing generated questions and the corresponding ontology described in the knowledge base of the subsystem. The generated questions are stored in the subsystem knowledge base in the form of SC-code, which is the OSTIS technology standard. When testing users, these automatically generated questions are converted to the corresponding natural language form through the natural language interface. Compared with the existing approaches, the approach proposed in this article has certain advantages, and the subsystem designed using this approach can be used in various OSTISbased systems driven by OSTIS technology.


2020 ◽  
Vol 10 (4) ◽  
pp. 477-488
Author(s):  
I.N. Fomin ◽  

The concept of intellectual support of the process of automated formation of calculation models of power supply in information systems of energy enterprises, based on the principles of knowledge management, is proposed. The concept includes carrying out ontological data analysis and the formation of the corresponding knowledge bases. The stages of the knowledge management process about the parameters of the calculated power supply models have been decomposed in order to develop tools for their automatic generation in billing systems. Sets of parameters of power supply objects and parameters characterizing the volume of energy consumption, price indicators and parameters for calculating the cost of consumed electricity, which, along with the relations between entities and their functional connections, determine the structure of ontologies, have been determined. For the first time, the definitions of the computational model of power supply are given as a semantic model consisting of a set of basic concepts of the electric power industry, and as a system of knowledge about methods of storing and processing information about the values of energy consumption. An ontology of the process of forming calculation models using the Protégé ontological editor is built. Requirements for the knowledge base of the system for supporting the formation of calculation models in information systems are formulated and the possibility of using data mining technologies with mechanisms for checking the consistency, sufficiency and continuity of knowledge through the use of methods for forming fuzzy rules is substantiated. This makes it possible to substantiate the possibility of applying the principles of fuzzy logic for the automated generation of calculation models of power supply in billing systems.


2018 ◽  
Vol 18 (1) ◽  
pp. 95-108 ◽  
Author(s):  
Klesti Hoxha ◽  
Artur Baxhaku

Abstract Named Entity Recognition (NER) is an important task in many NLP pipelines. It has become especially important for knowledge bases that power many of the nowadays information retrieval systems. In order to cope with the high demand for annotated training corpora for supervised NER systems, automatic generation approaches have been proposed. In this paper we report on the first automatically generated NE annotated corpus for Albanian. News articles from Albanian news media were used as a document source. They were automatically tagged using a custom generated gazetteer from the Albanian Wikipedia. Our evaluation results show that this corpus can be used as a baseline corpus for human annotated ones or as a training corpus where no other is available.


Author(s):  
D. I. Egoshkin ◽  
N. A. Guk ◽  
S. F. Siryk

In this article the problem of automatic generation of a knowledge base which consists of production rules for training dataset using fuzzy logic methods and a rule for comparing the values of an output variable is considered. An algorithm for the formation of fuzzy production rules is proposed. An actual problem of development and improvement of artificial intelligence algorithms and fuzzy logic application for solving a wider range of problems is considered. With the help of such systems are possible to eliminate the difficulties of formalizing knowledge about technological processes; also it is possible to organize recognition of nonstandard and emergency situations without using precise mathematical models and classical decision theory based on the tool of mathematical equations. The development of this area is relevant, as the number of tasks are constantly increasing, and the amount of knowledge becomes too large to handle them manually. The construction of an exact mathematical model for poorly formalized objects and processes are very difficult task, due to the lack of complete information. The situation becomes even more complicated if the properties of the object or process change dynamically. Therefore, the development of mathematical methods and algorithms that allow structuring the system of rules and determining the order of their calls to control consistency and completeness to optimize the number of rules, are an actual task. Modern approaches to the automation of these processes are considered. These approaches significantly improve the work of expert systems, but they allow to work only with static knowledge bases, limit the number of logical inferences and are not applicable for cases when it is necessary to add new logical rules to the existing system. In this article, an approach is developed that makes it possible to expand the knowledge base of the expert system with new rules in the process of exploitation. The developed algorithm has following advantages: high speed of problem solving; the ability that allows expanding the number of system responses without changing the scope of the rules and the program itself; expanding the range of application of fuzzy logic algorithms. The developed algorithm has following disadvantages: if the system's response database has objects that are similar to each other, they can have the same center of gravity, which in turn leads to additional checks; the minimum distance for mapping the object should be selected experimentally. The application of this algorithm can be seen on the website of the program, which classifies, maps an arbitrary user in a set of comic book characters database "CMD - Combat Marvel DC" [8]. The approach that was proposed has been successfully implemented using the C/C ++ and JavaScript languages, and JSON open-standard file format that uses human-readable text to transmit data objects consisting of attribute–value pairs and array data types. Software that was used for development: NetBeans IDE, MinGW, GNU Compiler Collection, WhiteStarUML, GitHub, WebGL, Chrome, Mozilla Firefox, Opera


1988 ◽  
Vol 9 (2-3) ◽  
pp. 182-188 ◽  
Author(s):  
Udo Hahn ◽  
Ulrich Reimer

Author(s):  
Luisa Lugli ◽  
Stefania D’Ascenzo ◽  
Roberto Nicoletti ◽  
Carlo Umiltà

Abstract. The Simon effect lies on the automatic generation of a stimulus spatial code, which, however, is not relevant for performing the task. Results typically show faster performance when stimulus and response locations correspond, rather than when they do not. Considering reaction time distributions, two types of Simon effect have been individuated, which are thought to depend on different mechanisms: visuomotor activation versus cognitive translation of spatial codes. The present study aimed to investigate whether the presence of a distractor, which affects the allocation of attentional resources and, thus, the time needed to generate the spatial code, changes the nature of the Simon effect. In four experiments, we manipulated the presence and the characteristics of the distractor. Findings extend previous evidence regarding the distinction between visuomotor activation and cognitive translation of spatial stimulus codes in a Simon task. They are discussed with reference to the attentional model of the Simon effect.


1994 ◽  
Vol 33 (05) ◽  
pp. 454-463 ◽  
Author(s):  
A. M. van Ginneken ◽  
J. van der Lei ◽  
J. H. van Bemmel ◽  
P. W. Moorman

Abstract:Clinical narratives in patient records are usually recorded in free text, limiting the use of this information for research, quality assessment, and decision support. This study focuses on the capture of clinical narratives in a structured format by supporting physicians with structured data entry (SDE). We analyzed and made explicit which requirements SDE should meet to be acceptable for the physician on the one hand, and generate unambiguous patient data on the other. Starting from these requirements, we found that in order to support SDE, the knowledge on which it is based needs to be made explicit: we refer to this knowledge as descriptional knowledge. We articulate the nature of this knowledge, and propose a model in which it can be formally represented. The model allows the construction of specific knowledge bases, each representing the knowledge needed to support SDE within a circumscribed domain. Data entry is made possible through a general entry program, of which the behavior is determined by a combination of user input and the content of the applicable domain knowledge base. We clarify how descriptional knowledge is represented, modeled, and used for data entry to achieve SDE, which meets the proposed requirements.


Sign in / Sign up

Export Citation Format

Share Document