Publication: Developing an automatic layout analysis system for Ottoman population registers
Program
KU-Authors
KU Authors
Co-Authors
Advisor
Publication Date
2020
Language
Turkish
Type
Conference proceeding
Journal Title
Journal ISSN
Volume Title
Abstract
For extracting information from the historical documents, digitization efforts have increased dramatically in the recent decades. Accurate layout analysis will help researchers for developing more robust HTR and OCR techniques which will extract meaningful information from these documents. Variable layouts, low quality and distorted images of historical documents create different problems to deal with when compared to modern document processing. Arabic script features have even more problems for these automatic processing systems. In this study, we have developed a tool for automatically analyzing the layouts of the first Ottoman population registers which are written in Arabic script form. We built a dataset for testing the performance of our system which are chosen from the first population records of the Ottoman Empire between the 1840s and 1860s. We successfully classified two different object types in those documents.
Description
Source:
2020 28th Signal Processing and Communications Applications Conference (Siu)
Publisher:
IEEE
Keywords:
Subject
Engineering, Electrical electronic engineering, Telecommunications