Publication:
Developing an automatic layout analysis system for Ottoman population registers

Placeholder

Organizational Units

Program

KU Authors

Co-Authors

Advisor

Publication Date

2020

Language

Turkish

Type

Conference proceeding

Journal Title

Journal ISSN

Volume Title

Abstract

For extracting information from the historical documents, digitization efforts have increased dramatically in the recent decades. Accurate layout analysis will help researchers for developing more robust HTR and OCR techniques which will extract meaningful information from these documents. Variable layouts, low quality and distorted images of historical documents create different problems to deal with when compared to modern document processing. Arabic script features have even more problems for these automatic processing systems. In this study, we have developed a tool for automatically analyzing the layouts of the first Ottoman population registers which are written in Arabic script form. We built a dataset for testing the performance of our system which are chosen from the first population records of the Ottoman Empire between the 1840s and 1860s. We successfully classified two different object types in those documents.

Description

Source:

2020 28th Signal Processing and Communications Applications Conference (Siu)

Publisher:

IEEE

Keywords:

Subject

Engineering, Electrical electronic engineering, Telecommunications

Citation

Endorsement

Review

Supplemented By

Referenced By

Copy Rights Note

0

Views

0

Downloads

View PlumX Details