Document Type

Conference Proceeding

Publication Date


Publication Title

Workshop on Historical Image Processing


This paper describes a set of hand-isolated character samples selected from securely dated manuscripts written in Syriac between 300 and 1300 C.E., which are being made available for research purposes. The collection can be used for a number of applications, including ground truth for character segmentation and form analysis for paleographical dating. Several applications based upon convolutional neural networks demonstrate the possibilities of the data set.


Syriac, paleography, historical manuscripts, handwriting style


Creative Commons License

Creative Commons Attribution 4.0 International License
This work is licensed under a Creative Commons Attribution 4.0 International License.


“Licensed to Smith College and distributed CC-BY under the Smith College Faculty Open Access Policy.”


CCS CONCEPTS • Applied computing → Digital libraries and archives; Arts and humanities

Author’s submitted manuscript.



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.