IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE

Authors

  • Fardian Fardian Electrical Engineering Department, Syiah Kuala University, Banda Aceh, Indonesia
  • Fitri Arnia Electrical Engineering Department, Syiah Kuala University, Banda Aceh, Indonesia
  • Sayed Muchallil Electrical Engineering Department, Syiah Kuala University, Banda Aceh, Indonesia
  • Khairul Munadi Electrical Engineering Department, Syiah Kuala University, Banda Aceh, Indonesia

DOI:

https://doi.org/10.11113/jt.v77.6668

Keywords:

Restoration Software, Binarisation, Document Degradation

Abstract

The Aceh Museum stores many digitized ancient manuscripts from hundreds of years ago. The condition of those manuscripts has degraded into several degradation types such as uneven contrast, show through effects, background spots, and text fading, which cause decreasing readability. A binarisation method is used to decrease the degradation effect on ancient manuscripts. Our research team is currently working on developing application software that consists of five binarisation methods, namely Otsu, Niblack, Sauvola, Lu, and Su for ancient manuscript restoration for the Aceh Museum staff to improve documents’ readability. In practice, a user still finds it difficult to choose the best method because there is no method that works best on every ancient manuscript for different types of degradation. This paper intends to determine a binarisation method that suits most manuscript conditions. The method used in this research includes the identification and classification of degradation types from 200 ancient Aceh digital manuscripts, followed by cropping the manuscripts to the size of 256 x 256 pixels. As many as five cropped areas from each degradation type are selected as research samples. These samples are binarisated using the methods. The last step is finding the most suitable binarisation method for each degradation type and classifying which methods are considered to have good readability, and that achieves at least 80% recall and precision values. From our experiments, we found that the Su binarisation methods demonstrate the best performance overall for every degradation type. Otsu, Lu, and Su are suited for uneven background; Sauvola, Lu, and Su are suited for showthrough effects; Otsu, Sauvola, and Su are suited for background spots; and Otsu and Su are suited for both text and background blurring and ‘fox’.  

References

F. Stanco, L. Tenze and G. Ramponi, 2007.Technique to correct yellowing and foxing in antique books, IET Image Process. 1(2):123133,.

E.Kavallieratou, E.Stamatatos, 2006. Improving the quality of degraded document images, IEEE proceedings of dial, 340-349, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

Ntirogiannis, K. et al. 2012. A Combined approach for the binarization of handwritten document images. Pattern Recognition Letters. 35: 3-15

Fitri, A., M. Fardian, M. Sayed, and K. Munadi. 2014. Improvement of Binarization Perfornance By Applying DCT as Pre-Processing Procedure. Communications, Control and Signal Processing (ISCCSP), 6th International Symposium. 128-132. http://dx.doi.org/10.1109/ISCCSP.2014.6877832

Niblack, W. 1986. Introduction to digital image processing. Prentice Hall, New Jersey, pp 115-116.

Khurshid, K., I. Shiddiq, C. Faure, and N. Vincent. 2009. Comparison of Niblack inspired binarization methods for ancient documents. SPIE Proceedings, 16th Document Recognition and Retrieval Conference, DRR-09, col 7247. 1-10.

Bolan, S., L. Shijian, T. Chew Lim. 2013. Robust Document Image Binarization Technique for Degraded Document Images. IEEE Transactions on Image Processing. 22: 408 - 1417.

Albrecht, H. 2014. Timbuktu Manuscripts Project for the Preservation and Promotion of African Literary Heritage. Department of Culture Studies and Oriental Languages University of Oslo. Accesed: December 28, 2014 from http://www.hf.uio.no/ikos/english/research/projects/timbuktu/

Ogier, J.-M., K. Tombre. 2006. Madonne: Document Image Analysis Techniques for Cultural Heritage Documents. International Conference on Digital Cultural Heritage, Aug. 2006, Vienna, Austria.

Journet, Nicholas, et al. Dedicated texture based tools for characterisation of old books. Document Image Analysis for Libraries, 2006. DIAL'06. Second International Conference on. IEEE, 2006..

Le Bourgeois, Frank, and Hubert Emptoz.2007. "Debora: Digital access to books of the renaissance." International Journal of Document Analysis and Recognition (IJDAR) 9. 2-4: 193-221.

Deng, Fanbo, et al. BinarizationShop: a user-assisted software suite for converting Old Documents To Black-And-White. Proceedings Of The 10th Annual Joint Conference On Digital Libraries. ACM, 2010.

Wang, Q., C. L. Tan. 2001. Matching of Double-Sided Document Images to Remove Interference. Proceedings from the IEEE Computer Vision and Patter Recogintion. 1: 1084-1089

Downloads

Published

2015-12-11

How to Cite

IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE. (2015). Jurnal Teknologi (Sciences & Engineering), 77(22). https://doi.org/10.11113/jt.v77.6668