Skip to main content

Advertisement

Springer Nature Link
Log in
Menu
Find a journal Publish with us Track your research
Search
Saved research
Cart
  1. Home
  2. Document Analysis Systems VII
  3. Conference paper

Offline Handwritten Arabic Character Segmentation with Probabilistic Model

  • Conference paper
  • pp 402–412
  • Cite this conference paper
Save conference paper
View saved research
Document Analysis Systems VII (DAS 2006)
Offline Handwritten Arabic Character Segmentation with Probabilistic Model
  • Pingping Xiu18,
  • Liangrui Peng18,
  • Xiaoqing Ding18 &
  • …
  • Hua Wang18 

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3872))

Included in the following conference series:

  • International Workshop on Document Analysis Systems
  • 1938 Accesses

  • 16 Citations

Abstract

The research on offline handwritten Arabic character recognition has received more and more attention in recent years, because of the increasing needs of Arabic document digitization. The variation in Arabic handwriting brings great difficulty in character segmentation and recognition, eg., the sub-parts (diacritics) of the Arabic character may shift away from the main part. In this paper, a new probabilistic segmentation model is proposed. First, a contour-based over-segmentation method is conducted, cutting the word image into graphemes. The graphemes are sorted into 3 queues, which are character main parts, sub-parts (diacritics) above or below main parts respectively. The confidence for each character is calculated by the probabilistic model, taking into account both of the recognizer output and the geometric confidence besides with logical constraint. Then, the global optimization is conducted to find optimal cutting path, taking weighted average of character confidences as objective function. Experiments on handwritten Arabic documents with various writing styles show the proposed method is effective.

This paper is supported by National Natural Science Foundation of China (project 60472002).

Download to read the full chapter text

Chapter PDF

Similar content being viewed by others

Survey on Segmentation and Recognition of Handwritten Arabic Script

Article 06 June 2020

Contribution on Arabic Handwriting Recognition Using Deep Neural Network

Chapter © 2021

Arabic Handwritten Recognition Using Deep Learning: A Survey

Article 23 January 2022

Explore related subjects

Discover the latest articles, books and news in related subjects, suggested using machine learning.
  • Arabic
  • Automated Pattern Recognition
  • Biometrics
  • ESCRT
  • Object Recognition
  • Shape Analysis
  • Handwriting Recognition Systems Using Deep Learning Techniques

References

  1. Al-Yousefi, H., Udpa, S.S.: Recognition of Arabic characters. IEEE Transactions on Pattern Analysis and Machine Intelligence (1992)

    Google Scholar 

  2. Amin, A., Mari, J.F.: Machine recognition and correction of printed Arabic text. IEEE Transactions on Systems, Man and Cybernetics (1989)

    Google Scholar 

  3. Amin, A., Al-Sadoun, H.B.: A new segmentation technique of Arabic text. in Pattern Recognition. In: Conference B: Proceedings of 11th IAPR International Conference on Pattern Recognition Methodology and Systems, vol. II (1992)

    Google Scholar 

  4. Sari, T., Souici, L., Sellami, M.: Off-line handwritten Arabic character Segmentation algorithm: ACSA. In: Proceedings of Eighth International Workshop on Frontiers in Handwriting Recognition, pp. 452–457 (2002)

    Google Scholar 

  5. Olivier, C., et al.: Segmentation and Coding of Arabic Handwritten Words. In: 13th International Conference on Pattern Recognition, ICPR 1996 (1996)

    Google Scholar 

  6. Jin, J., et al.: Printed Arabic document recognition system. Vision Geometry XIII. In: Latecki, L.J., Mount, D.M., Wu, A.Y. (eds.) Proceedings of the SPIE, vol. 5676, pp. 48–55 (2004)

    Google Scholar 

  7. Cheung, A., Bennamoun, M., Bergmann, N.W.: A recognition-based Arabic optical character recognition system. In: IEEE International Conference on Systems, Man, and Cybernetics (1998)

    Google Scholar 

  8. Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT - database. In: Proceedings of Seventh International Conference on Document Analysis and Recognition (2003)

    Google Scholar 

  9. Fakir, M., Hassani, M.M., Sodeyama, C.: Recognition of Arabic characters using Karhunen-Loeve transform anddynamic programming. In: Proceedings of 1999 IEEE International Conference on Systems, Man, and Cybernetics. IEEE SMC 1999 (1999)

    Google Scholar 

  10. Dehghan, M., et al.: Holistic handwritten word recognition using discrete HMM and self-organizing feature map. In: Proc. IEEE Int. Conf. Syst. Man Cybern. (2000)

    Google Scholar 

  11. Bortolozzi, F., et al.: Recent advances in handwriting recognition. In: Proceedings of the IWDA 2005 (2005)

    Google Scholar 

  12. Sarfraz, M., Nawaz, S.N., Al-Khuraidly, A.: Offline Arabic Text Recognition System. In: 2003 International Conference on Geometric Modeling and Graphics, GMAG 2003 (2003)

    Google Scholar 

  13. Najoua, B.A., Noureddine, E.: A robust approach for Arabic printed character segmentation. In: Proceedings of the Third International Conference on Document Analysis and Recognition (1995)

    Google Scholar 

  14. Motawa, D., Amin, A., Sabourin, R.: Segmentation of Arabic cursive script. In: Proceedings of the Fourth International Conference on Document Analysis and Recognition (1997)

    Google Scholar 

  15. Bushofa, B.M.F., Spann, M.: Segmentation of Arabic characters using their contour information. In: The 1997 13th International Conference on Digital Signal Processing, DSP. Part 2 (of 2) (1997)

    Google Scholar 

  16. Romeo-Pakker, K., Miled, H., Lecourtier, Y.: A new approach for Latin/Arabic character segmentation. In: Proceedings of the Third International Conference on Document Analysis and Recognition (1995)

    Google Scholar 

  17. Tolba, M.F., Shaddad, E.: On the automatic reading of printed Arabic characters. In: Proceedings of IEEE International Conference on Systems, Man and Cybernetics, pp. 496–498 (1990)

    Google Scholar 

  18. Maergner, V.: SARAT-a system for the recognition of Arabic printed text. In: Conference B: Proceedings of 11th IAPR International Conference on Pattern Recognition Methodology and Systems. Pattern Recognition, vol. II (1992)

    Google Scholar 

  19. Elgammal, A.M., Ismail, M.A.: A Graph-Based Segmentation and Feature-Extraction Framework for Arabic Text Recognition. In: Sixth InternationalConference on Document Analysis and Recognition, ICDAR 2001 (2001)

    Google Scholar 

  20. Lethelier, E., Leroux, M., Poste, M.G.L.: An automatic reading system for handwritten numeral amounts on French checks. In: Proceedings of the Third International Conference on Document Analysis and Recognition (1995)

    Google Scholar 

  21. Wang, H., et al.: New statistical method for machine-printed Arabic character recognition. In: Smith, E.H.B., Taghva, K. (eds.) Proceedings of SPIE. Document Recognition and Retrieval XII, vol. 5676 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Dept. of Electronic Engineering, Tsinghua University, State Key Laboratory of Intelligent Technology and Systems, 100084, Beijing, China

    Pingping Xiu, Liangrui Peng, Xiaoqing Ding & Hua Wang

Authors
  1. Pingping Xiu
    View author publications

    Search author on:PubMed Google Scholar

  2. Liangrui Peng
    View author publications

    Search author on:PubMed Google Scholar

  3. Xiaoqing Ding
    View author publications

    Search author on:PubMed Google Scholar

  4. Hua Wang
    View author publications

    Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

  1. Institute of Computer Science and Applied Mathematics, University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland

    Horst Bunke

  2. DocRec Ltd, 34 Strathaven Place, 7001, Atawhai, Nelson, New Zealand

    A. Lawrence Spitz

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xiu, P., Peng, L., Ding, X., Wang, H. (2006). Offline Handwritten Arabic Character Segmentation with Probabilistic Model. In: Bunke, H., Spitz, A.L. (eds) Document Analysis Systems VII. DAS 2006. Lecture Notes in Computer Science, vol 3872. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11669487_36

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/11669487_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32140-8

  • Online ISBN: 978-3-540-32157-6

  • eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Optical Character Recognition
  • Logical Rule
  • Integral Character
  • Arabic Text
  • Stroke Width

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics

Societies and partnerships

  • The International Association for Pattern Recognition
    The International Association for Pattern Recognition (opens in a new tab)

Search

Navigation

  • Find a journal
  • Publish with us
  • Track your research

Footer Navigation

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Journal finder
  • Publish your research
  • Language editing
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our brands

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Discover

Corporate Navigation

  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support
  • Legal notice
  • Cancel contracts here

162.0.217.198

Not affiliated

Springer Nature

© 2026 Springer Nature