A Novel Image Text Extraction Method Based on K-Means Clustering

Author(s):  
Yan Song ◽  
Anan Liu ◽  
Lin Pang ◽  
Shouxun Lin ◽  
Yongdong Zhang ◽  
...  
2014 ◽  
Vol 989-994 ◽  
pp. 3768-3772
Author(s):  
Xuan Qi Chen ◽  
Biao He ◽  
Guo Cheng Wang ◽  
Yao Xin Li

This paper presents a new method to achieve effective text extraction using mathematical morphology. Firstly, the document is segmented and divided into several parts based on the layout. And then, every part is dilated to big connected regions, whose biggest skeleton will be extracted and serve as a structure element (SE). Finally, a proposed region-concatenated operation with the SE will be employed, whose result can be the input of subsequent OCR system. Experimentally, the proposed method is robust to noise, the text orientation, font style and size, language and layout.


Sign in / Sign up

Export Citation Format

Share Document