Publication: Typo correction in domain-specific texts using FastText
Program
KU-Authors
KU Authors
Co-Authors
Bayrak, Ahmet Tuğrul
Advisor
Publication Date
2020
Language
Turkish
Type
Conference proceeding
Journal Title
Journal ISSN
Volume Title
Abstract
Analyzing customer reviews are quite important for customer satisfaction. Customer reviews might contain spelling mistakes, which causes data pollution and decreases the efficiency of the analyzes. In this study, a domain-specific solution is proposed by using the data related to tourism. Even if there are several applications to correct typos in Turkish, domain-specific solutions are limited. Since a correction should be specific for the meaning of a typo, this study is required. For the study, a FastText model-oriented typo correction algorithm has been developed by using customer reviews in the tourism industry. The results are compared with a commonly used correction application and it is observed that the algorithm developed is more successful for correcting typos in tourism specific phrases.
Description
Source:
Proceedings - 2020 Innovations in Intelligent Systems and Applications Conference, ASYU 2020
Publisher:
Institute of Electrical and Electronics Engineers Inc.
Keywords:
Subject
Optical character recognition, Writing