hyperlink analysis
Recently Published Documents


TOTAL DOCUMENTS

39
(FIVE YEARS 0)

H-INDEX

8
(FIVE YEARS 0)

2020 ◽  
Vol 38 (5/6) ◽  
pp. 1073-1093
Author(s):  
Chaoqun Wang ◽  
Zhongyi Hu ◽  
Raymond Chiong ◽  
Yukun Bao ◽  
Jiang Wu

Purpose The aim of this study is to propose an efficient rule extraction and integration approach for identifying phishing websites. The proposed approach can elucidate patterns of phishing websites and identify them accurately. Design/methodology/approach Hyperlink indicators along with URL-based features are used to build the identification model. In the proposed approach, very simple rules are first extracted based on individual features to provide meaningful and easy-to-understand rules. Then, the F-measure score is used to select high-quality rules for identifying phishing websites. To construct a reliable and promising phishing website identification model, the selected rules are integrated using a simple neural network model. Findings Experiments conducted using self-collected and benchmark data sets show that the proposed approach outperforms 16 commonly used classifiers (including seven non–rule-based and four rule-based classifiers as well as five deep learning models) in terms of interpretability and identification performance. Originality/value Investigating patterns of phishing websites based on hyperlink indicators using the efficient rule-based approach is innovative. It is not only helpful for identifying phishing websites, but also beneficial for extracting simple and understandable rules.


2019 ◽  
Vol 9 (4) ◽  
pp. 36-49
Author(s):  
Vasantha Thangasamy

Information available on the internet is wide, diverse, and dynamic. Since an enormous amount of information is available online, finding similarity between webpages using efficient hyperlink analysis is a challenging task. In this article, the researcher proposes an improved PageSim algorithm which measurse the importance of a webpage based on the PageRank values of connected webpage. Therefore, the proposed algorithm uses heterogeneous propagation of the PageRank score, based on the prestige measure of each webpage. The existing and the improved PageSim algorithms are implemented with a sample web graph. Real time Citation Networks, namely the ZEWAIL Citation Network and the DBLP Citation Network are used to test and compare the existing and improved PageSim algorithms. By using this proposed algorithm, it has been found that a similarity score between two different webpages significantly increases based on common information features and significantly decreases based on distinct factors.


2017 ◽  
Vol 22 (2) ◽  
pp. 027004 ◽  
Author(s):  
Yang Zhao ◽  
Rui-Na Dai ◽  
Xiang Xiao ◽  
Zong Zhang ◽  
Lian Duan ◽  
...  

2016 ◽  
Vol 19 (9) ◽  
pp. 1331-1348 ◽  
Author(s):  
Harsh Taneja

This article argues that maps of the Web’s structure based solely on technical infrastructure such as hyperlinks may bear little resemblance to maps based on Web usage, as cultural factors drive the latter to a larger extent. To test this thesis, the study constructs two network maps of 1000 globally most popular Web domains, one based on hyperlinks and the other using an “audience-centric” approach with ties based on shared audience traffic between these domains. Analyses of the two networks reveal that unlike the centralized structure of the hyperlink network with few dominant “core” Websites, the audience network is more decentralized and clustered to a larger extent along geo-linguistic lines.


2015 ◽  
Vol 173 ◽  
pp. 16-26 ◽  
Author(s):  
Julián Alarte ◽  
David Insa ◽  
Josep Silva ◽  
Salvador Tamarit

Sign in / Sign up

Export Citation Format

Share Document