scholarly journals Text Semantic Annotation: A Distributed Methodology Based on Community Coherence

Algorithms ◽  
2020 ◽  
Vol 13 (7) ◽  
pp. 160
Author(s):  
Christos Makris ◽  
Georgios Pispirigos ◽  
Michael Angelos Simos

Text annotation is the process of identifying the sense of a textual segment within a given context to a corresponding entity on a concept ontology. As the bag of words paradigm’s limitations become increasingly discernible in modern applications, several information retrieval and artificial intelligence tasks are shifting to semantic representations for addressing the inherent natural language polysemy and homonymy challenges. With extensive application in a broad range of scientific fields, such as digital marketing, bioinformatics, chemical engineering, neuroscience, and social sciences, community detection has attracted great scientific interest. Focusing on linguistics, by aiming to identify groups of densely interconnected subgroups of semantic ontologies, community detection application has proven beneficial in terms of disambiguation improvement and ontology enhancement. In this paper we introduce a novel distributed supervised knowledge-based methodology employing community detection algorithms for text annotation with Wikipedia Entities, establishing the unprecedented concept of community Coherence as a metric for local contextual coherence compatibility. Our experimental evaluation revealed that deeper inference of relatedness and local entity community coherence in the Wikipedia graph bears substantial improvements overall via a focus on accuracy amelioration of less common annotations. The proposed methodology is propitious for wider adoption, attaining robust disambiguation performance.

Information ◽  
2020 ◽  
Vol 11 (4) ◽  
pp. 199 ◽  
Author(s):  
Christos Makris ◽  
Georgios Pispirigos ◽  
Ioannis Orestis Rizos

Presently, due to the extended availability of gigantic information networks and the beneficial application of graph analysis in various scientific fields, the necessity for efficient and highly scalable community detection algorithms has never been more essential. Despite the significant amount of published research, the existing methods—such as the Girvan–Newman, random-walk edge betweenness, vertex centrality, InfoMap, spectral clustering, etc.—have virtually been proven incapable of handling real-life social graphs due to the intrinsic computational restrictions that lead to mediocre performance and poor scalability. The purpose of this article is to introduce a novel, distributed community detection methodology which in accordance with the community prediction concept, leverages the reduced complexity and the decreased variance of the bagging ensemble methods, to unveil the subjacent community hierarchy. The proposed approach has been thoroughly tested, meticulously compared against different classic community detection algorithms, and practically proven exceptionally scalable, eminently efficient, and promisingly accurate in unfolding the underlying community structure.


2021 ◽  
Vol 54 (3) ◽  
pp. 1-35
Author(s):  
Matteo Magnani ◽  
Obaida Hanteer ◽  
Roberto Interdonato ◽  
Luca Rossi ◽  
Andrea Tagarelli

A multiplex network models different modes of interaction among same-type entities. In this article, we provide a taxonomy of community detection algorithms in multiplex networks. We characterize the different algorithms based on various properties and we discuss the type of communities detected by each method. We then provide an extensive experimental evaluation of the reviewed methods to answer three main questions: to what extent the evaluated methods are able to detect ground-truth communities, to what extent different methods produce similar community structures, and to what extent the evaluated methods are scalable. One goal of this survey is to help scholars and practitioners to choose the right methods for the data and the task at hand, while also emphasizing when such choice is problematic.


2021 ◽  
pp. 1-17
Author(s):  
Mohammed Al-Andoli ◽  
Wooi Ping Cheah ◽  
Shing Chiang Tan

Detecting communities is an important multidisciplinary research discipline and is considered vital to understand the structure of complex networks. Deep autoencoders have been successfully proposed to solve the problem of community detection. However, existing models in the literature are trained based on gradient descent optimization with the backpropagation algorithm, which is known to converge to local minima and prove inefficient, especially in big data scenarios. To tackle these drawbacks, this work proposed a novel deep autoencoder with Particle Swarm Optimization (PSO) and continuation algorithms to reveal community structures in complex networks. The PSO and continuation algorithms were utilized to avoid the local minimum and premature convergence, and to reduce overall training execution time. Two objective functions were also employed in the proposed model: minimizing the cost function of the autoencoder, and maximizing the modularity function, which refers to the quality of the detected communities. This work also proposed other methods to work in the absence of continuation, and to enable premature convergence. Extensive empirical experiments on 11 publically-available real-world datasets demonstrated that the proposed method is effective and promising for deriving communities in complex networks, as well as outperforming state-of-the-art deep learning community detection algorithms.


2018 ◽  
Vol 32 (14) ◽  
pp. 1850166 ◽  
Author(s):  
Lilin Fan ◽  
Kaiyuan Song ◽  
Dong Liu

Semi-supervised community detection is an important research topic in the field of complex network, which incorporates prior knowledge and topology to guide the community detection process. However, most of the previous work ignores the impact of the noise from prior knowledge during the community detection process. This paper proposes a novel strategy to identify and remove the noise from prior knowledge based on harmonic function, so as to make use of prior knowledge more efficiently. Finally, this strategy is applied to three state-of-the-art semi-supervised community detection methods. A series of experiments on both real and artificial networks demonstrate that the accuracy of semi-supervised community detection approach can be further improved.


2015 ◽  
Vol 719-720 ◽  
pp. 1198-1202
Author(s):  
Ming Yang Zhou ◽  
Zhong Qian Fu ◽  
Zhao Zhuo

Practical networks have community and hierarchical structure. These complex structures confuse the community detection algorithms and obscure the boundaries of communities. This paper proposes a delicate method which synthesizes spectral analysis and local synchronization to detect communities. Communities emerge automatically in the multi-dimension space of nontrivial eigenvectors. Its performance is compared to that of previous methods and applied to different practical networks. Our results perform better than that of other methods. Besides, it’s more robust for networks whose communities have different edge density and follow various degree distributions. This makes the algorithm a valuable tool to detect and analysis large practical networks with various community structures.


Sign in / Sign up

Export Citation Format

Share Document