scholarly journals Document classification systems in heterogeneous computing environments

Author(s):  
Nasibeh Nasiri ◽  
Philip Colangelo ◽  
Oren Segal ◽  
Martin Margala ◽  
Wim Vanderbauwhede
2018 ◽  
Vol 19 (1) ◽  
Author(s):  
Shakuntala Baichoo ◽  
Yassine Souilmi ◽  
Sumir Panji ◽  
Gerrit Botha ◽  
Ayton Meintjes ◽  
...  

2020 ◽  
Vol 11 (4) ◽  
pp. 149-193
Author(s):  
Shalini Puri ◽  
Satya Prakash Singh

Today, rapid digitization requires efficient bilingual non-image and image document classification systems. Although many bilingual NLP and image-based systems provide solutions for real-world problems, they primarily focus on text extraction, identification, and recognition tasks with limited document types. This article discusses a journey of these systems and provides an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs. The gaps found lead toward the idea of a generic and integrated bilingual English-Hindi document classification system, which classifies heterogeneous documents using a dual class feeder and two character corpora. Its non-image and image modules include pre- and post-processing stages and pre-and post-segmentation stages to classify documents into predefined classes. This article discusses many real-life applications on societal and commercial issues. The analytical results show important findings of existing and proposed systems.


1996 ◽  
Vol 4 (2-3) ◽  
pp. 97-117 ◽  
Author(s):  
R. Aversa ◽  
N. Mazzocca ◽  
U. Villano

2005 ◽  
Vol 15 (04) ◽  
pp. 423-438
Author(s):  
RENATO P. ISHII ◽  
RODRIGO F. DE MELLO ◽  
LUCIANO J. SENGER ◽  
MARCOS J. SANTANA ◽  
REGINA H. C. SANTANA ◽  
...  

This paper presents a new model for the evaluation of the impacts of processing operations resulting from the communication among processes. This model quantifies the traffic volume imposed on the communication network by means of the latency parameters and the overhead. Such parameters represent the load that each process imposes over the network and the delay on CPU, as a consequence of the network operations. This delay is represented on the model by means of metric measurements slowdown. The equations that quantify the costs involved in the processing operation and message exchange are defined. In the same way, equations to determine the maximum network bandwidth are used in the decision-making scheduling. The proposed model uses a constant that delimitates the communication network maximum allowed usage, this constant defines two possible scheduling techniques: group scheduling or through communication network. Such techniques are incorporated to the DPWP policy, generating an extension of this policy. Experimental and simulation results confirm the performance enhancement of parallel applications under supervision of the extended DPWP policy, compared to the executions supervised by the original DPWP.


Sign in / Sign up

Export Citation Format

Share Document