scholarly journals Organizing Tagged Knowledge: Similarity Measures and Semantic Fluency in Structure Mining

2020 ◽  
Vol 142 (3) ◽  
Author(s):  
Thurston Sexton ◽  
Mark Fuge

Abstract Recovering a system’s underlying structure from its historical records (also called structure mining) is essential to making valid inferences about that system’s behavior. For example, making reliable predictions about system failures based on maintenance work order data requires determining how concepts described within the work order are related. Obtaining such structural information is challenging, requiring system understanding, synthesis, and representation design. This is often either too difficult or too time consuming to produce. Consequently, a common approach to quickly elicit tacit structural knowledge from experts is to gather uncontrolled keywords as record labels—i.e., “tags.” One can then map those tags to concepts within the structure and quantitatively infer relationships between them. Existing models of tag similarity tend to either depend on correlation strength (e.g., overall co-occurrence frequencies) or on conditional strength (e.g., tag sequence probabilities). A key difficulty in applying either model is understanding under what conditions one is better than the other for overall structure recovery. In this paper, we investigate the core assumptions and implications of these two classes of similarity measures on structure recovery tasks. Then, using lessons from this characterization, we borrow from recent psychology literature on semantic fluency tasks to construct a tag similarity measure that emulates how humans recall tags from memory. We show through empirical testing that this method combines strengths of both common modeling paradigms. We also demonstrate its potential as a preprocessor for structure mining tasks via a case study in semi-supervised learning on real excavator maintenance work orders.

2005 ◽  
Vol 51 (12) ◽  
pp. 325-329 ◽  
Author(s):  
X. Wang ◽  
X. Bai ◽  
J. Qiu ◽  
B. Wang

The performance of a pond–constructed wetland system in the treatment of municipal wastewater in Kiaochow city was studied; and comparison with oxidation ponds system was conducted. In the post-constructed wetland, the removal of COD, TN and TP is 24%, 58.5% and 24.8% respectively. The treated effluent from the constructed wetland can meet the Chinese National Agricultural and Irrigation Standard. The comparison between pond–constructed wetland system and oxidation pond system shows that total nitrogen removal in a constructed wetland is better than that in an oxidation pond and the TP removal is inferior. A possible reason is the low dissolved oxygen concentration in the wetland. Constructed wetlands can restrain the growth of algae effectively, and can produce obvious ecological and economical benefits.


Author(s):  
Ilona Bidzan-Bluma

Objective: It is estimated that twin-to-twin transfusion syndrome (TTTS) occurs in 10–15% of monochorionic twin pregnancies. One of the fetuses takes on the role of donor and the other of recipient. The treatment administered involves serial amnioreduction and laser photocoagulation of the communicating blood vessels. After TTTS, children may have deficiencies in psychomotor functioning, in particular in cognitive functions, expressive language, and motor skills. Few scientific reports indicate that twins after TTTS do not demonstrate significant differences in tests which measure intellectual functioning. Methods: The cognitive functioning of twins in the late childhood period was compared using the following tools: an analysis of their medical history, an interview with their parents, and neuropsychological tests allowing the evaluation of their whole profile of cognitive functions. Case Study: Cognitive functioning in the late childhood period was analyzed in a pair of 11-year-old male twins (juvenile athletes), a donor and a recipient, who had developed TTTS syndrome in the prenatal period. Results: Comparison of the cognitive functioning profile of the donor and recipient revealed that children with a history of TTTS develop normally in terms of cognitive and motor functioning in late childhood. A comparative analysis of the donor and recipient was more favorable for the recipient, who had a higher level of general intelligence, visual–motor memory, and semantic fluency. Conclusions: The fact that both the donor and the recipient chose to pursue athletics suggests that gross motor skills are their strongest suit. Playing sports as a method of rehabilitation of cognitive function of children born prematurely after TTTS could contribute to the improvement of cognitive functioning.


2021 ◽  
Vol 126 (3) ◽  
pp. 2311-2327
Author(s):  
Yuto Chikazawa ◽  
Marie Katsurai ◽  
Ikki Ohmukai

AbstractResearchers often use their native languages to present and exchange ideas. To construct an individual author’s complete profile, a list of their English and non-English academic publications must be constructed. This paper presents a practical approach for multilingual author matching across different academic databases. Our approach automatically links the academic records of a target database to a researcher identifier of a source database. First, we extracted a comprehensive set of records in the target database, whose author names were identical to the researcher names in the source database. Then, we calculated multiple author similarity measures, which can be adopted in certain entity pairs from different language databases. Finally, we aggregated the measures to output an improved score that indicates the likelihood of each record as being the researcher’s work. Our method was found to be easy to implement, and its performance was evaluated in real database management settings. Experiments were conducted using DBLP and PubMed as the target English databases. As the Japanese database, KAKEN was the source for identifying researcher information. The results demonstrated each similarity measure’s performance, from which we observed that the score aggregation achieved stable performance. Our method can lessen human efforts to associate various scholarly contributions.


2001 ◽  
Vol 44 (6) ◽  
pp. 109-117 ◽  
Author(s):  
M. A. Mathegana ◽  
L. K. Chauke ◽  
F. A.O. Otieno

The primary purpose of an improved water supply and sanitation is the achievement of acceptable health and hygiene standards as well as the sustainable improvement of the environment. Many governments recognize this and so they budget for large sums of money to improve these services to the communities. The purpose of this study was to investigate the different gaps in environmental health and hygiene practices with the aim of suggesting a strategy of improving this in the Northern Province of South Africa. To do this, 231 households and 30 schools were surveyed. Workshops and visits to different government departments were also used. This paper reports the results from this study which indicate that the situation in schools was not any better than that in households, with more than 90% of the villages still dependent on the unimproved pit latrines and 56,6% relying on standpipes which were (70% of the time) non-operational. The main problems identified seem to those associated with implementation and maintenance. The study concludes that with proper training of the water committees and their active involvement with the government and NGOs, environmental health and hygiene problems can be minimized or eliminated.


2020 ◽  
Vol 2020 ◽  
pp. 1-10
Author(s):  
Dandan Yang

This paper investigates the three-way clustering involving fuzzy covering, thresholds acquisition, and boundary region processing. First of all, a valid fuzzy covering of the universe is constructed on the basis of an appropriate fuzzy similarity relation, which helps capture the structural information and the internal connections of the dataset from the global perspective. Due to the advantages of valid fuzzy covering, we explore the valid fuzzy covering instead of the raw dataset for RFCM algorithm-based three-way clustering. Subsequently, from the perspective of semantic interpretation of balancing the uncertainty changes in fuzzy sets, a method of partition thresholds acquisition combining linear and nonlinear fuzzy entropy theory is proposed. Furthermore, boundary regions in three-way clustering correspond to the abstaining decisions and generate uncertain rules. In order to improve the classification accuracy, the k-nearest neighbor (kNN) algorithm is utilized to reduce the objects in the boundary regions. The experimental results show that the performance of the proposed three-way clustering based on fuzzy covering and kNN-FRFCM algorithm is better than the compared algorithms in most cases.


2002 ◽  
Vol os9 (1) ◽  
pp. 9-13 ◽  
Author(s):  
Raman Bedi ◽  
Jackie A Champion ◽  
Roger Davies

Introduction In order to promote training and education in special-needs dentistry an attempt was made to introduce problem-based learning (PBL) as a method of postgraduate dental education. The aim of this paper was to review the principles of PBL and report on a case study using this methodology. Method The case study was of a PBL session, on the subject of ‘problems of obtaining appropriate dental care for people with epilepsy’, undertaken at a national conference. Delegates were asked to complete a pre- and post-session questionnaire on PBL and their attitudes to the session. Results The session received a mixed response. Only 33 (35%) thought the session was valuable and only 20 (31%) thought it was better than conventional teaching methods and yet over half (55%) said they would like to attend more PBL in special-needs dentistry. Professionals complementary to dentistry were more likely to find the PBL session of value and to prefer the method to a more conventional format than dentists were (chi-square=5.5, df=1, p<0.05 and chi-square=5.9, df=1, p<0.05 respectively). Conclusion Valuable feedback was received from delegates. This will enable improvements to be made in future courses so that the effectiveness of PBL can be optimised.


2012 ◽  
Vol 518-523 ◽  
pp. 5886-5893
Author(s):  
Lu Cang Wang ◽  
Wei Li ◽  
Jing Gao

“The Project of Nomadic Settlement” is one of the major construction tasks for “Gannan Important Water Supply Ecological Functional Area of Yellow River”. Because of the distribution of population and settlements have obvious discreteness and wavering in alpine pasture, it is necessary to plan and guide agricultural and grazing villages during the process of the construction of nomadic settlements, spatial displacement and integration of population and settlement. The nomadic habitation mode in Luqu county undergoes four stages. At present, it adopts four settlement modes, that is, centralized settlement mode in the county town, settlement mode in the village, settlement along the highway mode and dispersed settlement mode, involving a total of 2,645households,13,783people and be arranged in 21 settlements. The paper adopts 14 indicators related conditions of economic development, social development conditions, geographic conditions, measures the overall strength of 24 administrative villages in Luqu, the whole villages are divided into four grade. The results show that the suburban villages are better than the surrounding villages and towns, pure pastoral farming are better than farming-pastoral villages. Accordingly, 24 villages are divided into four types—community-based villages, developing villages, controlling villages, and revoking-merging villages. Finally, it also proposes the path on village plan guidelines.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Oscar Daniel Rivera Baena ◽  
Maria Valentina Clavijo Mesa ◽  
Carmen Elena Patino Rodriguez ◽  
Fernando Jesus Guevara Carazas

PurposeThis paper aims to determine the stage of the life cycle where the trucks of a waste collection fleet from a Colombian city are located through a reliability approach. The reliability analysis and the evaluation of curve of operational costs allow to know the moment in which it is necessary to make decisions regarding an asset, its maintenance or possible replacement.Design/methodology/approachFor a dataset presented as maintenance work orders, the time to failures (TTFs) for each vehicle in the fleet were calculated. Then, a probability density function for those TTFs was fitted to locate each vehicle in a region of the bathtub curve and to calculate the reliability of the whole fleet. A general functional analysis was also developed to understand the function of the vehicles.FindingsIt was possible to determine that the largest proportion of the fleet was in the final stage of the life cycle, in this sense, the entire fleet represent critical assets which in most of cases could be worth replacement or overhaul.Originality/valueIn this study, an address is exposed for the identification of critical equipment by reliability and statistical analysis. This analysis is also integrated with the maintenance management process. This is a broadly interested topic since it allows to support the maintenance and operational decision-making process, indicating the focus of resource allocation all over the entire asset life cycle.


Sign in / Sign up

Export Citation Format

Share Document