Typology-based semantic labeling of numeric tabular data

Semantic Web ◽  
2020 ◽  
Vol 12 (1) ◽  
pp. 5-20
Author(s):  
Ahmad Alobaid ◽  
Emilia Kacprzak ◽  
Oscar Corcho

A lot of tabular data are being published on the Web. Semantic labeling of such data may help in their understanding and exploitation. However, many challenges need to be addressed to do this automatically. With numbers, it can be even harder due to the possible difference in measurement accuracy, rounding errors, and even the frequency of their appearance. Multiple approaches have been proposed in the literature to tackle the problem of semantic labeling of numeric values in existing tabular datasets. However, they also suffer from several shortcomings: closely coupled with entity-linking, rely on table context, need to profile the knowledge graph, and require manual training of the model. Above all, however, they all treat different types of numeric values evenly. In this paper, we tackle these problems and validate our hypothesis: whether taking into account the typology of numeric data in semantic labeling yields better results.

Semantic Web ◽  
2021 ◽  
pp. 1-23
Author(s):  
Steven J. Baskauf ◽  
Jessica K. Baskauf

The W3C Generating RDF from Tabular Data on the Web Recommendation provides a mechanism for mapping CSV-formatted data to any RDF graph model. Since the Wikibase data model used by Wikidata can be expressed as RDF, this Recommendation can be used to document tabular snapshots of parts of the Wikidata knowledge graph in a simple form that is easy for humans and applications to read. Those snapshots can be used to document how subgraphs of Wikidata have changed over time and can be compared with the current state of Wikidata using its Query Service to detect vandalism and value added through community contributions.


1986 ◽  
Vol 19 (5) ◽  
pp. 63-66 ◽  
Author(s):  
R C Gold

The article traces the use of flowmeters in the production and processing of crude oil and its refined products. Emphasis is placed on meters used for accounting purposes and wherever possible, the significance of measurement uncertainty at the various stages of oil production is expressed in financial terms. The article concludes that improved measurement accuracy is more likely to be achieved from improved calibration and operating procedures than from the use of different types of flowmeters.


2014 ◽  
Vol 30 (4) ◽  
pp. 15-17 ◽  

Purpose – This paper aims to review the latest management developments across the globe and pinpoint practical implications from cutting-edge research and case studies. Design/methodology/approach – This briefing is prepared by an independent writer who adds their own impartial comments and places the articles in context. Findings – Becoming increasingly reliant on the web as a principal source of finding information is altering our brains and the way that we obtain and hold knowledge. We are becoming less reliant on our memories to hold knowledge, instead using technology – and search engines like Google in particular – to deposit and retrieve information. Practical implications – The paper provides strategic insights and practical thinking that have influenced some of the world's leading organizations. Social implications – The paper provides strategic insights and practical thinking that can have a broader social impact. Originality/value – The briefing saves busy executives and researchers hours of reading time by selecting only the very best, most pertinent information and presenting it in a condensed and easy-to-digest format.


2019 ◽  
Vol 2 ◽  
pp. 1-8
Author(s):  
Jan Wilkening ◽  
Keni Han ◽  
Mathias Jahnke

<p><strong>Abstract.</strong> In this article, we present a method for visualizing multi-dimensional spatio-temporal data in an interactive web-based geovisualization. Our case study focuses on publicly available weather data in Germany. After processing the data with Python and desktop GIS, we integrated the data as web services in a browser-based application. This application displays several weather parameters with different types of visualisations, such as static maps, animated maps and charts. The usability of the web-based geovisualization was evaluated with a free-examination and a goal-directed task, using eye-tracking analysis. The evaluation focused on the question how people use static maps, animated maps and charts, dependent on different tasks. The results suggest that visualization elements such as animated maps, static maps and charts are particularly useful for certain types of tasks, and that more answering time correlates with less accurate answers.</p>


Author(s):  
Emrah Inan ◽  
Burak Yonyul ◽  
Fatih Tekbacak

Most of the data on the web is non-structural, and it is required that the data should be transformed into a machine operable structure. Therefore, it is appropriate to convert the unstructured data into a structured form according to the requirements and to store those data in different data models by considering use cases. As requirements and their types increase, it fails using one approach to perform on all. Thus, it is not suitable to use a single storage technology to carry out all storage requirements. Managing stores with various type of schemas in a joint and an integrated manner is named as 'multistore' and 'polystore' in the database literature. In this paper, Entity Linking task is leveraged to transform texts into wellformed data and this data is managed by an integrated environment of different data models. Finally, this integrated big data environment will be queried and be examined by presenting the method.


2020 ◽  
Vol 13 (1) ◽  
pp. 79-85
Author(s):  
Caterina Viganò ◽  
Laura Molteni ◽  
Alberto Varinelli ◽  
Calogero Virzì ◽  
Sara Russo ◽  
...  

Background: The technological evolution has given the opportunities to develop new models of education, like online teaching. However, Internet Problematic Use and Internet Addiction are becoming frequently represented among adolescents with a prevalence that varies worldwide from 2% to 20% of the high school population. Objective: The aim of this study was to analyse the risk of Internet Addiction in a High Schools student sample comparing two different types of schools (online and traditional teaching) and analyzing the associations between pathological use of Internet and socio-demographic factors connected to the different educational orientations and to the daily usage of Internet. Methods: Students were enrolled from four different orientation school programs (different high school, technical and economical Institute, vocational schools). Each student completed a self-reported test to collect socio-demographic data and th Internet Addiction Test (IAT) from K. Young to assess the risk of Internet Addiction. The Mann-Whitney test for quantitative variables was used for statistical analysis. Results: 522 students were enrolled, 243 students from online teaching and 279 from traditional teaching schools. Internet Addiction was observed in 1,16% of the total sample, while 53.83% of subjects was at risk of development Internet Addiction. No significant difference was found between the two different types of teaching, nor considering gender. Considering the amount of time spent on the web in portion of the sample at risk of developing Internet Addiction, the Traditional Teaching group spent between 4 and 7 hours a day on the Web, while the Online Teaching group between 1 to 3 hours/daily. However, no statistically significant difference was found. Conclusion: Although our data demonstrate that there is no clear association between online education and problematic use of Internet, the excessive use of Internet is linked to a massive waste of personal energy in terms of time and social life.


Author(s):  
Paolo Massa

This chapter discusses the concept of trust and how trust is used and modeled in online systems currently available on the Web or on the Internet. It starts by describing the concept of information overload and introducing trust as a possible and powerful way to deal with it. It then provides a classification of the systems that currently use trust and, for each category, presents the most representative examples. In these systems, trust is considered as the judgment expressed by one user about another user, often directly and explicitly, sometimes indirectly through an evaluation of the artifacts produced by that user or his/her activity on the system. We hence use the term “trust” to indicate different types of social relationships between two users, such as friendship, appreciation, and interest. These trust relationships are used by the systems in order to infer some measure of importance about the different users and influence their visibility on the system. We conclude with an overview of the open and interesting challenges for online systems that use and model trust information.


Author(s):  
Zahid Ashraf Wani ◽  
Huma Shafiq

Nowadays, we all rely on cyberspace for our information needs. We make use of different types of search tools. Some of them have specialization in a specific format or two, while few can crawl a good portion of the web irrespective of formats. Therefore, it is very imperative for information professionals to have thorough understandings of these tools. As such, the chapter is an endeavor to delve deep and highlight various trends in online information retrieval from primitive to modern ones. The chapter also made an effort to envisage the future requirements and expectation keeping in view the ever-increasing dependence on diverse species of information retrieval tools.


Author(s):  
Pankaj Kamthan

In recent years, there has been a steady shift in the nature of Web applications. The vehicle of this transition of Web applications is us, the people. The ability to post photographs or videos, exchange music snippets with peers, and annotate a piece of information, are but a few exemplars of this phenomenon. Indeed, the pseudonym Web 2.0 (O’Reilly, 2005) has been used to describe the apparent “socialization” of the Web. In spite of the significant prospects offered by humancentric Web applications, the mere fact that virtually anyone can set up such applications claiming to sell products and services or upload/post unscrutinized information on a topic as being “definitive,” raises the issues of credibility from a consumers’ viewpoint. Therefore, establishing credibility is essential for an organization’s reputation and for building consumers’ trust. The rest of the article is organized as follows. We first provide the background necessary for later discussion. This is followed by the introduction of a framework within which different types of credibility in the context of human-centric Web applications can be systematically addressed and thereby improved. Next, challenges and directions for future research are outlined. Finally, concluding remarks are given.


Sign in / Sign up

Export Citation Format

Share Document