web robot detection Latest Research Papers

Research on Web Robot Detection Technology for Concept Drift

Journal of Physics Conference Series ◽

10.1088/1742-6596/2010/1/012161 ◽

2021 ◽

Vol 2010 (1) ◽

pp. 012161

Author(s):

Xue Chen ◽

Yang Song ◽

Wei Xiong ◽

Yutao Lu ◽

Xingen Wang

Keyword(s):

Concept Drift ◽

Web Robot ◽

Detection Technology ◽

Web Robot Detection

Download Full-text

Content-aware web robot detection

Applied Intelligence ◽

10.1007/s10489-020-01754-9 ◽

2020 ◽

Vol 50 (11) ◽

pp. 4017-4028

Author(s):

Athanasios Lagopoulos ◽

Grigorios Tsoumakas

Keyword(s):

Web Robot ◽

Content Aware ◽

Web Robot Detection

Download Full-text

SEMANTIC APPROACH FOR WEB-ROBOT DETECTION

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2019.12.pp.040-045 ◽

2019 ◽

pp. 40-45

Author(s):

A. A. Menshchikov ◽

Yu. A. Gatchin

Keyword(s):

Data Privacy ◽

Web Server ◽

Classification Model ◽

Detection Accuracy ◽

Web Resources ◽

Web Resource ◽

Web Robot ◽

Efficient Detection ◽

Selection Of ◽

Web Robot Detection

Today modern researches suggest that robotic traffic on web resources prevails over user traffic in terms of volume and intensity. Web robots threaten data privacy, copyright, as well as affect performance, security, and affect statistics. There is a need to develop efficient detection and protection methods against web robots. Existing techniques involve the use of syntactic and analytical processing of web server logs to detect web robots. This article proposes to analyze the graph of visits of web robots, taking into account the time, as well as the connectivity of topics of the visited pages. In the article we provide an algorithm for data selection and cleansing, extracting semantic features of pages on a web resource, as well as the proposed detection parameters. We describe in detail the process of forming the ground truth and the principles of existing sessions labelling to the legit and robotic types. It is proposed to use the capabilities of a web server to identify sessions uniquely. The clustering procedure and the selection of a suitable classification model are discussed. For each of the studied models, the selection of hyper parameters and cross-validation of the results are made. The analysis of performance and detection accuracy, as well as comparison with the results of existing approaches is provided. Empirical results of the proposed method on web-resources show that this method leads to better web robot detection accuracy and precision comparing with the existing approaches.

Download Full-text

Web Robot Detection: A Semantic Approach

2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai.2018.00150 ◽

2018 ◽

Cited By ~ 3

Author(s):

Athanasios Lagopoulos ◽

Grigorios Tsoumakas ◽

Georgios Papadopoulos

Keyword(s):

Semantic Approach ◽

Web Robot ◽

Web Robot Detection

Download Full-text

A soft computing approach for benign and malicious web robot detection

Expert Systems with Applications ◽

10.1016/j.eswa.2017.06.004 ◽

2017 ◽

Vol 87 ◽

pp. 129-140 ◽

Cited By ~ 15

Author(s):

Mahdieh Zabihimayvan ◽

Reza Sadeghi ◽

H. Nathan Rude ◽

Derek Doran

Keyword(s):

Soft Computing ◽

Web Robot ◽

Computing Approach ◽

Web Robot Detection

Download Full-text

An integrated method for real time and offline web robot detection

Expert Systems ◽

10.1111/exsy.12184 ◽

2016 ◽

Vol 33 (6) ◽

pp. 592-606 ◽

Cited By ~ 11

Author(s):

Derek Doran ◽

Swapna S. Gokhale

Keyword(s):

Real Time ◽

Integrated Method ◽

Web Robot ◽

Web Robot Detection

Download Full-text

Web robot detection in scholarly Open Access institutional repositories

Library Hi Tech ◽

10.1108/lht-04-2016-0048 ◽

2016 ◽

Vol 34 (3) ◽

pp. 500-520 ◽

Cited By ~ 8

Author(s):

Joseph W. Greene

Keyword(s):

Open Access ◽

Empirical Test ◽

Simple Random Sample ◽

Content Type ◽

Institutional Repositories ◽

Detection Techniques ◽

Web Robot ◽

High Level ◽

The Impact ◽

Web Robot Detection

Purpose The purpose of this paper is to investigate the impact and techniques for mitigating the effects of web robots on usage statistics collected by Open Access (OA) institutional repositories (IRs). Design/methodology/approach A close review of the literature provides a comprehensive list of web robot detection techniques. Reviews of system documentation and open source code are carried out along with personal interviews to provide a comparison of the robot detection techniques used in the major IR platforms. An empirical test based on a simple random sample of downloads with 96.20 per cent certainty is undertaken to measure the accuracy of an IR’s web robot detection at a large Irish University. Findings While web robot detection is not ignored in IRs, there are areas where the two main systems could be improved. The technique tested here is found to have successfully detected 94.18 per cent of web robots visiting the site over a two-year period (recall), with a precision of 98.92 per cent. Due to the high level of robot activity in repositories, correctly labelling more robots has an exponential effect on the accuracy of usage statistics. Research limitations/implications This study is performed on one repository using a single system. Future studies across multiple sites and platforms are needed to determine the accuracy of web robot detection in OA repositories generally. Originality/value This is the only study to date to have investigated web robot detection in IRs. It puts forward the first empirical benchmarking of accuracy in IR usage statistics.

Download Full-text

Web robot detection with semi-supervised learning method

Proceedings of the 3rd International Conference on Material, Mechanical and Manufacturing Engineering ◽

10.2991/ic3me-15.2015.407 ◽

2015 ◽

Cited By ~ 1

Author(s):

Dong Wang ◽

Lei Xi ◽

Hui Zhang ◽

Hebing Liu ◽

Hao Zhang ◽

...

Keyword(s):

Supervised Learning ◽

Learning Method ◽

Web Robot ◽

Web Robot Detection

Download Full-text

A density based clustering approach for web robot detection

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) ◽

10.1109/iccke.2014.6993362 ◽

2014 ◽

Cited By ~ 6

Author(s):

Mahdieh Zabihi ◽

Majid Vafaei Jahan ◽

Javad Hamidzadeh

Keyword(s):

Web Robot ◽

Density Based Clustering ◽

Clustering Approach ◽

Web Robot Detection

Download Full-text

Web robot detection based on pattern-matching technique

Journal of Information Science ◽

10.1177/0165551511435969 ◽

2012 ◽

Vol 38 (2) ◽

pp. 118-126 ◽

Cited By ~ 13

Author(s):

Shinil Kwon ◽

Young-Gab Kim ◽

Sungdeok Cha

Keyword(s):

Pattern Matching ◽

Web Robot ◽

Matching Technique ◽

Web Robot Detection

Download Full-text

web robot detection
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Research on Web Robot Detection Technology for Concept Drift

Content-aware web robot detection

SEMANTIC APPROACH FOR WEB-ROBOT DETECTION

Web Robot Detection: A Semantic Approach

A soft computing approach for benign and malicious web robot detection

An integrated method for real time and offline web robot detection

Web robot detection in scholarly Open Access institutional repositories

Web robot detection with semi-supervised learning method

A density based clustering approach for web robot detection

Web robot detection based on pattern-matching technique

Export Citation Format

web robot detectionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Research on Web Robot Detection Technology for Concept Drift

Content-aware web robot detection

SEMANTIC APPROACH FOR WEB-ROBOT DETECTION

Web Robot Detection: A Semantic Approach

A soft computing approach for benign and malicious web robot detection

An integrated method for real time and offline web robot detection

Web robot detection in scholarly Open Access institutional repositories

Web robot detection with semi-supervised learning method

A density based clustering approach for web robot detection

Web robot detection based on pattern-matching technique

web robot detection
Recently Published Documents