scholarly journals SMART CRAWLER: A TWO-STAGE CRAWLER FOR EFFICIENTLY HARVESTING DEEP-WEB INTERFACES

Keyword(s):  
Deep Web ◽  
2016 ◽  
Vol 9 (4) ◽  
pp. 608-620 ◽  
Author(s):  
Feng Zhao ◽  
Jingyu Zhou ◽  
Chang Nie ◽  
Heqing Huang ◽  
Hai Jin
Keyword(s):  
Deep Web ◽  

2016 ◽  
Vol 5 (4) ◽  
pp. 20
Author(s):  
SHARMA NIKITHA ◽  
DEVI V. SOWMYA ◽  
◽  
Keyword(s):  

2016 ◽  
Vol 1 (1) ◽  
pp. 40-44
Author(s):  
Suchetadevi M. Gaikwad ◽  
Sanjay B. Thakare

As deep web enlarges; there has been increased interest in methods which help efficiently trace deep-web interfaces. However, because of huge volume and varying nature of deep-web, achieving wide coverage and high efficiency is difficult issue. We proposed a three stage framework, an Enhanced Crawler, for efficiently gathering deep web interfaces. In first stage, enhanced crawler performs site based searching of center pages using automated search engines, avoiding visiting an oversized variety of pages and consuming time. In second stage, enhanced crawler achieves quick in site browsing by fetching most relevant links with associate degree of reconciling link ranking. For further enhancement, our system ranks and priorities websites and also uses a link tree data structure to achieve deep coverage. In third stage, our system provides pre-query processing mechanism so as to help users to write their search query easily by providing char by char keyword search with ranked indexing.


Sign in / Sign up

Export Citation Format

Share Document