Publication:
Assessing the predictive power of social media data-fed large language models on vote behavior

dc.contributor.departmentGraduate School of Social Sciences and Humanities
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorAtsızelti, Şükrü
dc.contributor.kuauthorBarkhordar, Ehsan
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SOCIAL SCIENCES AND HUMANITIES
dc.date.accessioned2024-12-29T09:36:59Z
dc.date.issued2024
dc.description.abstractThis article investigates how large language models (LLMs) reflect human preferences and exhibit biases influenced by the diversity and nature of their input data. We used survey data related to Turkish presidential elections alongside tweets to assess the predictive performance and bias manifestations of LLMs under three different data inclusion strategies: (1) using only demographic information, (2) integrating demographic information with tweets, and (3) relying solely on tweets. Our findings reveal that prompts enriched with tweets typically achieve higher F1 Macro scores. However, this trend differs significantly when examining classes individually. While user-generated content significantly improves performance in predictions related to Recep Tayyip Erdogan, it does not show the same effect for Kemal Klllçdaroglu. This study shows that different models and prompting styles result in varied biases for each candidate, leading to mixed outcomes. These results underscore the importance of exploring how biases vary across different scenarios, models, and prompting strategies for each case.
dc.description.indexedbyScopus
dc.description.openaccessAll Open Access
dc.description.openaccessBronze Open Access
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuN/A
dc.identifier.doi10.1145/3630744.3659831
dc.identifier.isbn979-840070453-6
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85197181722
dc.identifier.urihttps://doi.org/10.1145/3630744.3659831
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22215
dc.keywordsLarge language models
dc.keywordsPredictive analytics
dc.keywordsSocial media
dc.keywordsVoter behavior
dc.language.isoeng
dc.publisherAssociation for Computing Machinery, Inc
dc.relation.ispartofCompanion Proceedings of the 16th ACM Web Science Conference, Websci Companion 2024 - Reflecting on the Web, AI and Society
dc.subjectComputational linguistics
dc.subjectPopulation statistics
dc.subjectSocial networking
dc.titleAssessing the predictive power of social media data-fed large language models on vote behavior
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorBarkhordar, Ehsan
local.contributor.kuauthorAtsızelti, Şükrü
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1GRADUATE SCHOOL OF SOCIAL SCIENCES AND HUMANITIES
local.publication.orgunit2Graduate School of Sciences and Engineering
local.publication.orgunit2Graduate School of Social Sciences and Humanities
relation.isOrgUnitOfPublicatione192fff1-4efe-45a7-ab71-30233fc185a9
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscoverye192fff1-4efe-45a7-ab71-30233fc185a9
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublicationc5c9bf5f-4655-411c-a602-0d68f2e2ad88
relation.isParentOrgUnitOfPublication.latestForDiscovery434c9663-2b11-4e66-9399-c863e2ebae43

Files