scholarly journals DWSA: An Intelligent Document Structural Analysis Model for Information Extraction and Data Mining

Electronics ◽  
2021 ◽  
Vol 10 (19) ◽  
pp. 2443
Author(s):  
Tan  Yue ◽  
Yong Li ◽  
Zonghai Hu

The structure of a document contains rich information such as logical relations in context, hierarchy, affiliation, dependence, and applicability. It will greatly affect the accuracy of document information processing, particularly of legal documents and business contracts. Therefore, intelligent document structural analysis is important to information extraction and data mining. However, unlike the well-studied field of text semantic analysis, current work in document structural analysis is still scarce. In this paper, we propose an intelligent document structural analysis framework through data pre-processing, feature engineering, and structural classification with a dynamic sample weighting algorithm. As a typical application, we collect more than 11,000 insurance document content samples and carry out the machine learning experiments to check the efficiency of our framework. Meanwhile, to address the sample imbalance problem in the hierarchy classification task, a dynamic sample weighting algorithm is incorporated into our Dynamic Weighting Structural Analysis (DWSA) framework, in which the weights of different category tags according to the structural levels are iterated dynamically in training. Our results show that the DWSA has significantly improved the comprehensive accuracy and the classification F1-score of each category. The comprehensive accuracy is as high as 94.68% (3.36% absolute improvement) and the Macro F1-score is 88.29% (5.1% absolute improvement).

2019 ◽  
Vol 267 ◽  
pp. 02001
Author(s):  
Liangli Xiao ◽  
Yan Liu ◽  
Zhuang Du ◽  
Zhao Yang ◽  
Kai Xu

This study combines specific high-rise shear wall residential projects with the Revit to demonstrate BIM application processes. The use of R-Star CAD may help to realize the link barrier of the building information model and the structural analysis software PKPM. Sequentially, the information supplement of the structural analysis model is completed by extracting the structural information with the Revit secondary development. By the collaborative design platform based on BIM technology, the paper examines the collision check of structural model, conducts collision analysis on other professional models and modifies the design scheme for conflict points. After the statistics of material usage, an optimized design is proposed. The findings of this paper could contribute to provide some reference for the specific application of BIM in structural design and realize the application of BIM technology in the process of building structure design.


Author(s):  
Houssam Nassif ◽  
Ryan Woods ◽  
Elizabeth Burnside ◽  
Mehmet Ayvaci ◽  
Jude Shavlik ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document