Publications with Fulltext
Permanent URI for this collectionhttps://hdl.handle.net/20.500.14288/6
Browse
4 results
Search Results
Publication Open Access Automatic CNN-based Arabic numeral spotting and handwritten digit recognition by using deep transfer learning in Ottoman population registers(Multidisciplinary Digital Publishing Institute (MDPI), 2020) Department of History; Kabadayı, Mustafa Erdem; Can, Yekta Said; Faculty Member; Department of History; College of Social Sciences and Humanities; 33267; N/AHistorical manuscripts and archival documentation are handwritten texts which are the backbone sources for historical inquiry. Recent developments in the digital humanities field and the need for extracting information from the historical documents have fastened the digitization processes. Cutting edge machine learning methods are applied to extract meaning from these documents. Page segmentation (layout analysis), keyword, number and symbol spotting, handwritten text recognition algorithms are tested on historical documents. For most of the languages, these techniques are widely studied and high performance techniques are developed. However, the properties of Arabic scripts (i.e., diacritics, varying script styles, diacritics, and ligatures) create additional problems for these algorithms and, therefore, the number of research is limited. In this research, we first automatically spotted the Arabic numerals from the very first series of population registers of the Ottoman Empire conducted in the mid-nineteenth century and recognized these numbers. They are important because they held information about the number of households, registered individuals and ages of individuals. We applied a red color filter to separate numerals from the document by taking advantage of the structure of the studied registers (numerals are written in red). We first used a CNN-based segmentation method for spotting these numerals. In the second part, we annotated a local Arabic handwritten digit dataset from the spotted numerals by selecting uni-digit ones and tested the Deep Transfer Learning method from large open Arabic handwritten digit datasets for digit recognition. We achieved promising results for recognizing digits in these historical documents.Publication Open Access Automatic detection of road types from the third military mapping survey of Austria-Hungary historical map series with deep convolutional neural networks(Institute of Electrical and Electronics Engineers (IEEE), 2021) Department of History; Kabadayı, Mustafa Erdem; Can, Yekta Said; Gerrits, Petrus Johannes; Faculty Member; Department of History; College of Social Sciences and Humanities; 33267; N/A; N/AWith the increased amount of digitized historical documents, information extraction from them gains pace. Historical maps contain valuable information about historical, geographical and economic aspects of an era. Retrieving information from historical maps is more challenging than processing modern maps due to lower image quality, degradation of documents and the massive amount of non-annotated digital map archives. Convolutional Neural Networks (CNN) solved many image processing challenges with great success, but they require a vast amount of annotated data. For historical maps, this means an unprecedented scale of manual data entry and annotation. In this study, we first manually annotated the Third Military Mapping Survey of Austria-Hungary historical map series conducted between 1884 and 1918 and made them publicly accessible. We recognized different road types and their pixel-wise positions automatically by using a CNN architecture and achieved promising results.Publication Open Access Stressed or just running? differentiation of mental stress and physical activity by using machine learning(TÜBİTAK, 2022) Department of History; Can, Yekta Said; Department of History; College of Social Sciences and HumanitiesRecently, modern people have excessive stress in their daily lives. With the advances in physiological sensors and wearable technology, people's physiological status can be tracked, and stress levels can be recognized for providing beneficial services. Smartwatches and smartbands constitute the majority of wearable devices. Although they have an excellent potential for physiological stress recognition, some crucial issues need to be addressed, such as the resemblance of physiological reaction to stress and physical activity, artifacts caused by movements and low data quality. This paper focused on examining and differentiating physiological responses to both stressors and physical activity. Physiological data are collected in the laboratory environment, which contain relaxed, stressful and physically active states and they are differentiated successfully by using machine learning.Publication Open Access Automatic estimation of age distributions from the first Ottoman Empire population register series by using deep learning(Multidisciplinary Digital Publishing Institute (MDPI), 2021) Department of History; Kabadayı, Mustafa Erdem; Can, Yekta Said; Faculty Member; Department of History; College of Social Sciences and Humanities; 33267; N/ARecently, an increasing number of studies have applied deep learning algorithms for extracting information from handwritten historical documents. In order to accomplish that, documents must be divided into smaller parts. Page and line segmentation are vital stages in the Handwritten Text Recognition systems; it directly affects the character segmentation stage, which in turn deter-mines the recognition success. In this study, we first applied deep learning-based layout analysis techniques to detect individuals in the first Ottoman population register series collected between the 1840s and the 1860s. Then, we employed horizontal projection profile-based line segmentation to the demographic information of these detected individuals in these registers. We further trained a CNN model to recognize automatically detected ages of individuals and estimated age distributions of people from these historical documents. Extracting age information from these historical registers is significant because it has enormous potential to revolutionize historical demography of around 20 successor states of the Ottoman Empire or countries of today. We achieved approximately 60% digit accuracy for recognizing the numbers in these registers and estimated the age distribution with Root Mean Square Error 23.61.