UML for Document Modeling: Designing Document Structures for Massive and Systematic Production of XML-based Web Contents

Author(s):  
Alejandro Bia ◽  
Jaime Gómez
Author(s):  
Ronald Haentjens Dekker ◽  
David J. Birnbaum

The XML tree paradigm has several well-known limitations for document modeling and processing. Some of these have received a lot of attention (especially overlap), and some have received less (e.g., discontinuity, simultaneity, transposition, white space as crypto-overlap). Many of these have work-arounds, also well known, but—as is implicit in the term “work-around”—these work-arounds have disadvantages. Because they get the job done, however, and because XML has a large user community with diverse levels of technological expertise, it is difficult to overcome inertia and move to a technology that might offer a more comprehensive fit with the full range of document structures with which researchers need to interact both intellectually and programmatically. A high-level analysis of why XML has the limitations it has can enable us to explore how an alternative model of Text as Graph (TAG) might address these types of structures and tasks in a more natural and idiomatic way than is available within an XML paradigm.


2013 ◽  
Vol 28 (4) ◽  
pp. 640-659 ◽  
Author(s):  
M. Grassi ◽  
C. Morbidoni ◽  
M. Nucci ◽  
S. Fonda ◽  
F. Piazza
Keyword(s):  

2004 ◽  
Vol 45 (1) ◽  
pp. 19-34 ◽  
Author(s):  
Norihito Fujita ◽  
Yuichi Ishikawa ◽  
Atsushi Iwata ◽  
Rauf Izmailov

2013 ◽  
Vol 427-429 ◽  
pp. 2554-2557
Author(s):  
Jin Kun Pan ◽  
Dong Sheng Li

With the popularity of location-based services, Web contents are being geo-tagged and spatial keyword queries that retrieve objects satisfying both spatial and keyword conditions are gaining in prevalence. The existing spatial keyword queries focus on exact match or prefix match of the keywords cannot satisfy the demand of wildcard based imprecise match in many realistic scenes. Aiming to solve this problem, two methods which are fit for different situation are put forward: the inverted file and R-tree integrated index which fits for the situation that requires high time efficiency and the Prefix Bloom Filter and R-tree integrated index which fits for the situation requiring high space efficiency. The effectiveness of the two indexes is valid through experiments.


Author(s):  
Abdul Razaque ◽  
Bakhytzhan Valiyev ◽  
Bandar Alotaibi ◽  
Munif Alotaibi ◽  
Saule Amanzholova ◽  
...  

The Dark Web is known as a place triggering a variety of criminal activities. Anonymization techniques enable illegal operations, leading to the loss of confidential information and its further use as bait, a trade product or even a crime tool. Despite technical progress, there is still not enough awareness of the Dark Web and its secret activity. In this study, we introduced the Dark Web Enhanced Analysis (DWEA), in order to analyze and gather information about the content accessed on the Dark Net based on data characteristics. The research was performed to identify how the Dark Web has been influenced by recent global events, such as the COVID-19 epidemic. The research included the usage of a crawler, which scans the network and collects data for further analysis with machine learning. The result of this work determines the influence of the COVID-19 epidemic on the Dark Net.


Author(s):  
David Martín ◽  
Ortzi Torices ◽  
Hugo Salas ◽  
Carlos Lamsfus ◽  
Aurkene Alzua-Sorzabal

2019 ◽  
pp. 161-179
Author(s):  
Erdal Ozkaya ◽  
Rafiqul Islam
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document