html parser
Recently Published Documents


TOTAL DOCUMENTS

5
(FIVE YEARS 0)

H-INDEX

0
(FIVE YEARS 0)

2020 ◽  
Vol 31 (3) ◽  
pp. 89
Author(s):  
Suhad Malalla ◽  
Ayat Hadi Ali

The random key plays a basic role in the design of any cryptographic algorithms. In this paper, a proposed model for generating a random key is presented, which will be used for security purposes. Chicken swarm optimization (CSO) and HTML parser are used for scattering the bits of the key. The statistical tests of randomness have given good and acceptable security results. The proposed key generation method was programmed by JavaScript language.


Web has understood an astonishing change in human access to learning and information. The need of plotting an upgraded program for the ostensibly tried. The present structure helps the apparently tried people to use the information in the web satisfactorily, by changing over the substance in the site page to voice for their better use. The customer can look through the substance and indispensable information from the web by simply composing the URL. The site page substances are removed by JSOUP HTML parser. The isolated substance will be scrutinized out by Text to Speech (TTS) engine. The weights in this system are the apparently tried need to type URL the required in the change box, there are no contrasting options to control TTS and there are no decisions for investigating through site pages. The proposed structure is to arrange talk affirmation engine.


Author(s):  
James Wijaya
Keyword(s):  

Dengan adanya perkembangan teknologi informasi, orang-orang dapat mengakses berbagai informasi dari berbagai halaman web dengan menggunakan internet. Web Santapan Rohani adalah salah satu contoh website yang dapat digunakan oleh orang-orang terlebih khusus umat Kristiani untuk membaca renungan harian atau untuk melakukan saat teduh. Penelitian ini bertujuan menciptakan suatu teknologi ekstraksi informasi dari web Santapan Rohani yang berisikan renungan harian sehingga dapat membantu untuk analisa bagi penelitian-penelitian berikutnya yang dapat dikembangkan dari kehadiran teknologi ini. Halaman web memiliki bentuk yang semi-structured dan berisikan informasi berupa teks, gambar, video, URL, dan sebagainya. Hal ini menjadi kendala untuk dapat melakukan ekstraksi informasi dari halaman web. HTML Agility Pack merupakan salah satu tools terbaik yang dapat digunakan untuk melakukan HTML Parser dari suatu halaman web. Dengan menggunakan HTML Agility Pack dapat mempermudah untuk melakukan ekstraksi informasi dari berbagai halaman web, terlebih khusus untuk melakukan ekstraksi informasi pada renungan harian dari Web Santapan Rohani.


2013 ◽  
Vol 774-776 ◽  
pp. 1802-1806
Author(s):  
Zhi Ming Zhang ◽  
Shuai Shuai Huang ◽  
Ping Li

With the rapid development of Internet, and surge in the amount of information on the Internet, how to accurately and quickly get the information of the users really need, such as the title, links, and pictures, is the hotspot. This paper proposed a fast web information extraction method based on html parser, this paper validated the effect of the proposed method by extracting commodities information of e-commerce website, the results show that the accuracy of the information extraction by our method is higher than the extraction method based on regular expressions, and the extraction time is greatly shortened.


2013 ◽  
Vol 397-400 ◽  
pp. 1972-1978
Author(s):  
Song Pu Wu ◽  
Qing Wang

An adaptive web information extraction approach is presented in this paper. Most of the traditional web information extraction approaches depend on the templates of web sites. If the templates are changed, the information extraction rules should be redesigned. To reduce the maintenance costs and improve the adaptability of information extractors, an adaptive web information extraction approach is proposed based on the STU-DOM tree. The webpage is parsed into DOM Trees based on HTML Parser. Then DOM trees are filtered into STU-DOM trees to confirm blocks which contain keywords of a certain topic. The proposed approach is applied to webpages and the results show that the approach not only extracts information efficiently, but also is irrelevant to site structures.


Sign in / Sign up

Export Citation Format

Share Document