PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets

Moritz Wilksch; Olga Abramova

doi:10.1016/j.jjimei.2023.100171

PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Standard

PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets. / Wilksch, Moritz; Abramova, Olga.
in: International Journal of Information Management Data Insights, Jahrgang 3, Nr. 1, 100171, 01.04.2023.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Bibtex

@article{7fc1c3958a6e4010b2fad49f5c2eaadb,

title = "PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets",

abstract = "Responding to the poor performance of generic automated sentiment analysis solutions on domain-specific texts, we collect a dataset of 10,000 tweets discussing the topics of finance and investing. We manually assign each tweet its market sentiment, i.e., the investor's anticipation of a stock's future return. Using this data, we show that all existing sentiment models trained on adjacent domains struggle with accurate market sentiment analysis due to the task's specialized vocabulary. Consequently, we design, train, and deploy our own sentiment model. It outperforms all previous models (VADER, NTUSD-Fin, FinBERT, TwitterRoBERTa) when evaluated on Twitter posts. On posts from a different platform, our model performs on par with BERT-based large language models. We achieve this result at a fraction of the training and inference costs due to the model's simple design. We publish the artifact as a python library to facilitate its use by future researchers and practitioners.",

keywords = "Deep learning, Financial market sentiment, Machine learning, Opinion mining, Sentiment analysis, Business informatics, Informatics",

author = "Moritz Wilksch and Olga Abramova",

note = "Publisher Copyright: {\textcopyright} 2023 The Author(s)",

year = "2023",

month = apr,

day = "1",

doi = "10.1016/j.jjimei.2023.100171",

language = "English",

volume = "3",

journal = "International Journal of Information Management Data Insights",

issn = "2667-0968",

publisher = "Elsevier B.V.",

number = "1",

}

RIS

TY - JOUR

T1 - PyFin-sentiment

T2 - Towards a machine-learning-based model for deriving sentiment from financial tweets

AU - Wilksch, Moritz

AU - Abramova, Olga

PY - 2023/4/1

Y1 - 2023/4/1

N2 - Responding to the poor performance of generic automated sentiment analysis solutions on domain-specific texts, we collect a dataset of 10,000 tweets discussing the topics of finance and investing. We manually assign each tweet its market sentiment, i.e., the investor's anticipation of a stock's future return. Using this data, we show that all existing sentiment models trained on adjacent domains struggle with accurate market sentiment analysis due to the task's specialized vocabulary. Consequently, we design, train, and deploy our own sentiment model. It outperforms all previous models (VADER, NTUSD-Fin, FinBERT, TwitterRoBERTa) when evaluated on Twitter posts. On posts from a different platform, our model performs on par with BERT-based large language models. We achieve this result at a fraction of the training and inference costs due to the model's simple design. We publish the artifact as a python library to facilitate its use by future researchers and practitioners.

AB - Responding to the poor performance of generic automated sentiment analysis solutions on domain-specific texts, we collect a dataset of 10,000 tweets discussing the topics of finance and investing. We manually assign each tweet its market sentiment, i.e., the investor's anticipation of a stock's future return. Using this data, we show that all existing sentiment models trained on adjacent domains struggle with accurate market sentiment analysis due to the task's specialized vocabulary. Consequently, we design, train, and deploy our own sentiment model. It outperforms all previous models (VADER, NTUSD-Fin, FinBERT, TwitterRoBERTa) when evaluated on Twitter posts. On posts from a different platform, our model performs on par with BERT-based large language models. We achieve this result at a fraction of the training and inference costs due to the model's simple design. We publish the artifact as a python library to facilitate its use by future researchers and practitioners.

KW - Deep learning

KW - Financial market sentiment

KW - Machine learning

KW - Opinion mining

KW - Sentiment analysis

KW - Business informatics

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85150288650&partnerID=8YFLogxK

U2 - 10.1016/j.jjimei.2023.100171

DO - 10.1016/j.jjimei.2023.100171

M3 - Journal articles

AN - SCOPUS:85150288650

VL - 3

JO - International Journal of Information Management Data Insights

JF - International Journal of Information Management Data Insights

SN - 2667-0968

IS - 1

M1 - 100171

ER -

In der gleichen Zeitschrift

Collective response to the health crisis among German twitter users: A structural

Abramova, O., Batzel, K. & Modesti, D., 11.2022, in: International Journal of Information Management Data Insights. 2, 2, 100126.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Weitere Publikationen dieser Person(en)

Inclusion of Autistic It Workforce in Action: An Auticon Approach

Abramova, O., Recker, J., Schemm, U. & Barwitzki, L. D., 09.2025, in: Information Systems Journal. 35, 5, S. 1439-1459 21 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Behind Videoconferencing Fatigue at Work: The Taxing Effects of Self-View and the Mediating Role of Public Self-Awareness

Abramova, O. & Gladkaya, M., 04.2025, in: Business and Information Systems Engineering. 67, 2, S. 227-245 19 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Effects of an orderly vs. cluttered online environment on user behavior

Abramova, O. & Voronin, G., 2024, 45th International Conference on Information Systems, ICIS 2024: Digital Platforms for Emerging Societies. The Association for Information Systems (AIS), 17 S. 3162. (45th International Conference on Information Systems, ICIS 2024).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

The differential effects of self-view in virtual meetings when speaking vs. listening

Abramova, O., Gladkaya, M. & Krasnova, H., 2025, in: European Journal of Information Systems. 34, 2, S. 230-248 19 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

The Predictive Power of Social Media Sentiment for Short-Term Stock Movements

Wilksch, M. & Abramova, O., 2022, Wirtschaftsinformatik 2022 Proceedings. Laumer, S. & Matzner, M. (Hrsg.). The Association for Information Systems (AIS), 13 S.

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung

DOI

https://doi.org/10.1016/j.jjimei.2023.100171
Endgültige, publizierte Fassung

PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets

Standard

Harvard

APA

Vancouver

Bibtex

RIS

In der gleichen Zeitschrift

Collective response to the health crisis among German twitter users: A structural

Weitere Publikationen dieser Person(en)

Inclusion of Autistic It Workforce in Action: An Auticon Approach

Behind Videoconferencing Fatigue at Work: The Taxing Effects of Self-View and the Mediating Role of Public Self-Awareness

Effects of an orderly vs. cluttered online environment on user behavior

The differential effects of self-view in virtual meetings when speaking vs. listening

The Predictive Power of Social Media Sentiment for Short-Term Stock Movements

DOI

Zuletzt angesehen

Publikationen