Publication:
Automatic estimation of age distributions from the first Ottoman Empire population register series by using deep learning

dc.contributor.departmentDepartment of History
dc.contributor.departmentDepartment of History
dc.contributor.kuauthorKabadayı, Mustafa Erdem
dc.contributor.kuauthorCan, Yekta Said
dc.contributor.kuprofileFaculty Member
dc.contributor.schoolcollegeinstituteCollege of Social Sciences and Humanities
dc.contributor.yokid33267
dc.contributor.yokidN/A
dc.date.accessioned2024-11-09T11:46:51Z
dc.date.issued2021
dc.description.abstractRecently, an increasing number of studies have applied deep learning algorithms for extracting information from handwritten historical documents. In order to accomplish that, documents must be divided into smaller parts. Page and line segmentation are vital stages in the Handwritten Text Recognition systems; it directly affects the character segmentation stage, which in turn deter-mines the recognition success. In this study, we first applied deep learning-based layout analysis techniques to detect individuals in the first Ottoman population register series collected between the 1840s and the 1860s. Then, we employed horizontal projection profile-based line segmentation to the demographic information of these detected individuals in these registers. We further trained a CNN model to recognize automatically detected ages of individuals and estimated age distributions of people from these historical documents. Extracting age information from these historical registers is significant because it has enormous potential to revolutionize historical demography of around 20 successor states of the Ottoman Empire or countries of today. We achieved approximately 60% digit accuracy for recognizing the numbers in these registers and estimated the age distribution with Root Mean Square Error 23.61.
dc.description.fulltextYES
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.issue18
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuEU
dc.description.sponsorshipEuropean Union (EU)
dc.description.sponsorshipHorizon 2020
dc.description.sponsorshipEuropean Research Council (ERC)
dc.description.sponsorshipResearch and Innovation Program
dc.description.sponsorship“Industrialisation and Urban Growth from the mid-nineteenth century Ottoman Empire to Contemporary Turkey in a Comparative Perspective, 1850–2000” Project
dc.description.sponsorshipUrbanOccupationsOETR
dc.description.versionPublisher version
dc.description.volume10
dc.formatpdf
dc.identifier.doi10.3390/electronics10182253
dc.identifier.eissn2079-9292
dc.identifier.embargoNO
dc.identifier.filenameinventorynoIR03184
dc.identifier.linkhttps://doi.org/10.3390/electronics10182253
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-85114732462
dc.identifier.urihttps://hdl.handle.net/20.500.14288/543
dc.identifier.wos699649600001
dc.keywordsArabic document processing
dc.keywordsConvolutional neural networks
dc.keywordsDigit detection and recognition
dc.keywordsLine segmentation
dc.keywordsPage segmentation
dc.keywordsProjection profiles
dc.languageEnglish
dc.publisherMultidisciplinary Digital Publishing Institute (MDPI)
dc.relation.grantno679097
dc.relation.urihttp://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/9945
dc.sourceElectronics
dc.subjectComputer science
dc.subjectEngineering
dc.subjectInformation systems
dc.subjectPhysics
dc.titleAutomatic estimation of age distributions from the first Ottoman Empire population register series by using deep learning
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.authorid0000-0003-3206-0190
local.contributor.authoridN/A
local.contributor.kuauthorKabadayı, Mustafa Erdem
local.contributor.kuauthorCan, Yekta Said
relation.isOrgUnitOfPublicationbe8432df-d124-44c3-85b4-be586c2db8a3
relation.isOrgUnitOfPublication.latestForDiscoverybe8432df-d124-44c3-85b4-be586c2db8a3

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
9945.pdf
Size:
2.19 MB
Format:
Adobe Portable Document Format