Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML

Author(s):  
Toshiyuki Shimizu ◽  
Norimasa Terada ◽  
Masatoshi Yoshikawa

The movement of clients from desktop to mobility devices, made a major stage in the portable trade. All the up and coming advancements, parts, delicate products are very composed by the portable. As versatility is unavoidable prerequisite by the clients, the outline of programming with less battery utilization are generally invited. The calculation procedure is relative to the battery utilization. The calculation at the cell phones genuinely influences the series of the portable. Hence making the calculation at the cloud has an awesome arrangement in diminishing the battery utilization. The delegate calculation inquiry is a productive approach to safeguard the battery of the mobile devices. Indeed, even the encryption/unscrambling of records takes control so proposing IOPE for scrambling the document which is a basic plan


2021 ◽  
pp. 1-12
Author(s):  
Anita Ramalingam ◽  
Subalalitha Chinnaudayar Navaneethakrishnan

Thirukkural, a Tamil classic literature, which was written in 300 BCE is a didactic literature. Though Thirukkural comprises 1330 couplets which are organized into three sections and 133 chapters, in order to retrieve meaningful Thirukkural for a given query in search systems, a better organization of the Thirukkural is needed. This paper lays such a foundation by classifying the Thirukkural into ten new categories called superclasses that is helpful for building a better Information Retrieval (IR) system. The classifier is trained using Multinomial Naïve Bayes algorithm. Each superclass is further classified into two subcategories based on the didactic information. The proposed classification framework is evaluated using precision, recall and F-score metrics and achieved an overall F-score of 82.33% and a comparison analysis has been done with the Support Vector Machine, Logistic Regression and Random Forest algorithms. An IR system is built on top of the proposed system and the performance comparison has been done with the Google search and a locally built keyword search. The proposed classification framework has achieved a mean average precision score of 89%, whereas the Google search and keyword search have yielded 59% and 68% respectively.


2018 ◽  
Vol 8 (2) ◽  
pp. 57-77 ◽  
Author(s):  
Abubakar Roko ◽  
Shyamala Doraisamy ◽  
Azreen Azman ◽  
Azrul Hazri Jantan

In this article, an indexing scheme that includes the named entity category for each indexed term is proposed. Based on this, two methods are proposed, one to infer the semantics of an XML element based on its data content, called the confidence value of the element, and the second method computes the proximity scores of the query terms. The confidence value of an element is obtained based on the probability of a named entity category in the data content of the underlying XML element. The proximity score of the query terms measures the proximity and ordering of the query term within an XML element. The article then shows how a ranking function uses the confidence value of an XML element and proximity score to mitigate the impact of higher frequency terms and compute the relevance between a keyword query and an XML fragment. Finally, a keyword search system is introduced and experiments show that the proposed system outperforms existing approaches in terms of search quality and achieve a higher efficiency.


2017 ◽  
Vol 35 (3) ◽  
pp. 398-409
Author(s):  
Gracielle Mendonça Rodrigues Gomes ◽  
Beatriz Valadares Cendon

Purpose The study aims to propose the use of the semiotics inspection method (SIM) which is an interpretative and qualitative method from semiotics engineering (SE) for the evaluation of the communicability of systems and to evaluate digital libraries and information retrieval systems (IRS). The paper presents the results of the application of this method in the evaluation of the quality of the communicability of the interface and search system of the Coordination for the Improvement of Higher Education Personnel (CAPES) Portal of e-Journals, a major scientific digital library in Brazil. There are proposed solutions to improve this system included. Design/methodology/approach The study used the SIM to evaluate the system. Two evaluators inspected the system. They performed the comparison and the analysis of three types of metamessages (metalinguistic, static and dynamic). The metamessages generated by the evaluators were contrasted to find inconsistencies and ambiguities in the CAPES Portal of e-Journals. Finally, the last step of the method was the final assessment about the inspection. Findings The evaluators identified 52 problems of communicability. These problems were ranked according to severity ratings established by Nielsen (1994). They were grouped in ten types of problems present in the interface and in the search system of the CAPES Portal of e-Journals. Originality value This research contributes theoretically to the field of information retrieval and to the area of human–computer interaction and, in particular, to the theory of SE by adapting SE methods that allow the evaluation of communicability to the context of the scientific IRS. Results obtained through scientific methods should contribute to development of the interface and search tools of IRS to better support query formulation and retrieval of relevant information and more efficiently satisfy the information needs of individuals.


2014 ◽  
Vol 70 (6) ◽  
pp. 970-996 ◽  
Author(s):  
Paula Goodale ◽  
Paul David Clough ◽  
Samuel Fernando ◽  
Nigel Ford ◽  
Mark Stevenson

Purpose – The purpose of this paper is to investigate the effects of cognitive style on navigating a large digital library of cultural heritage information; specifically, the paper focus on the wholist/analytic dimension as experienced in the field of educational informatics. The hypothesis is that wholist and analytic users have characteristically different approaches when they explore, search and interact with digital libraries, which may have implications for system design. Design/methodology/approach – A detailed interactive IR evaluation of a large cultural heritage digital library was undertaken, along with the Riding CSA test. Participants carried out a range of information tasks, and the authors analysed their task performance, interactions and attitudes. Findings – The hypothesis on the differences in performance and behaviour between wholist and analytic users is supported. However, the authors also find that user attitudes towards the system are opposite to expectations and that users give positive feedback for functionality that supports activities in which they are cognitively weaker. Research limitations/implications – There is scope for testing results in a larger scale study, and/or with different systems. In particular, the findings on user attitudes warrant further investigation. Practical implications – Findings on user attitudes suggest that systems which support areas of weakness in users’ cognitive abilities are valued, indicating an opportunity to offer diverse functionality to support different cognitive weaknesses. Originality/value – A model is proposed suggesting a converse relationship between behaviour and attitudes; to support individual users displaying search/navigation behaviour mapped onto the strengths of their cognitive style, but placing greater value on interface features that support aspects in which they are weaker.


Sign in / Sign up

Export Citation Format

Share Document