In this paper, an efficient approach to segment Persian off-line handwritten text-line into characters is presented. The proposed algorithm first traces the baseline of the input text-line image and straightens it. Subsequently, it over-segments each word/subwords using features extracted from histogram analysis and then removes extra segmentation points using some baseline dependent as well as language dependent rules. We tested the proposed character segmentation scheme with 2 different datasets. On a test set of 899 Persian words/subwords created by us, 90.26% of the characters were segmented correctly. From another dataset of 200 handwritten Arabic word images we obtained 93.49% correct segmentation accuracy.
Conference proceeding
A baseline dependent approach for Persian handwritten character segmentation
Proceedings of the 20th International Conference on Pattern Recognition, pp.1977-1980
2010 20th International Conference on Pattern Recognition (Istanbul, Turkey, 23/08/2010 - 26/08/2010)
2010
Metrics
30 Record Views
Abstract
Details
- Title
- A baseline dependent approach for Persian handwritten character segmentation
- Creators
- Ali Reza Alaei (Author) - Griffith UniversityP Nagabhushan (Author) - Indian Institute of Information Technology Allahabad, IndiaUmapada Pal (Author) - Indian Statistical Institute
- Publication Details
- Proceedings of the 20th International Conference on Pattern Recognition, pp.1977-1980
- Conference
- 2010 20th International Conference on Pattern Recognition (Istanbul, Turkey, 23/08/2010 - 26/08/2010)
- Publisher
- IEEE; USA
- Number of pages
- 1977-1980
- Identifiers
- 2016; 991012822014202368
- Academic Unit
- Information Technology; Faculty of Science and Engineering; School of Business and Tourism; Faculty of Business, Law and Arts
- Resource Type
- Conference proceeding