Document Type

Conference Proceeding

Publication Date

2011

Publication Title

Proceedings of the 2011 Workshop on Historical Document Imaging and Processing

Abstract

Thousands of documents written in Syriac script by early Christian theologians are of unknown provenance and uncertain date, partly due to a shortage of human expertise. This paper addresses the problem of attribution by developing a novel algorithm for offline handwriting style identification and document retrieval, demonstrated on a set of documents in the Estrangelo variant of Syriac writing. The method employs a feature vector based upon the estimated affine transformation of actual observed characters, character parts, and voids within characters as compared to a hypothetical average or ideal form. Experiments on seventy-six pages from nineteen Syriac manuscripts written by different scribes show that the method can identify pages written in the same hand with high precision, even with documents that exhibit various challenging forms of degradation.

Creative Commons License

Creative Commons Attribution 4.0 License
This work is licensed under a Creative Commons Attribution 4.0 License.

Rights

© the authors

Comments

Author’s submitted manuscript.

hip11talk.pdf (1396 kB)

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.