PyFin-sentiment: Towards a machine-learning-based model for deriving sentiment from financial tweets

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

Responding to the poor performance of generic automated sentiment analysis solutions on domain-specific texts, we collect a dataset of 10,000 tweets discussing the topics of finance and investing. We manually assign each tweet its market sentiment, i.e., the investor's anticipation of a stock's future return. Using this data, we show that all existing sentiment models trained on adjacent domains struggle with accurate market sentiment analysis due to the task's specialized vocabulary. Consequently, we design, train, and deploy our own sentiment model. It outperforms all previous models (VADER, NTUSD-Fin, FinBERT, TwitterRoBERTa) when evaluated on Twitter posts. On posts from a different platform, our model performs on par with BERT-based large language models. We achieve this result at a fraction of the training and inference costs due to the model's simple design. We publish the artifact as a python library to facilitate its use by future researchers and practitioners.

OriginalspracheEnglisch
Aufsatznummer100171
ZeitschriftInternational Journal of Information Management Data Insights
Jahrgang3
Ausgabenummer1
Anzahl der Seiten10
DOIs
PublikationsstatusErschienen - 01.04.2023
Extern publiziertJa

Bibliographische Notiz

Publisher Copyright:
© 2023 The Author(s)

DOI

Zuletzt angesehen

Publikationen

  1. How to identify published articles originating from paper presentations at academic conferences of the Earth System Governance Research Community
  2. Introduction
  3. Social cohesion and the inclination towards conspiracy mentality
  4. Dierk Schmidt: The Division of the Earth
  5. Exploring the Poincaré Ellipsis
  6. Der Zentrale Runde Tisch der DDR: Wortprotokoll und Dokumente
  7. Rainfall and temperature variation does not explain arid species diversity in outback Australia
  8. Die Liebe der Soziologie
  9. Didactical design methods applied in design studios for Architectural and Cultural Sciences in Brazil
  10. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  11. Vorwort
  12. Attitudes towards computers and information technology at three universities in Germany, Belgium, and the U.S.
  13. Arrival of a Kitty
  14. Die Kontingenz des Gegebenen
  15. COVID-19 and the ageing workforce
  16. Demographic change in work organizations
  17. Richard K. Nelson’s The Island Within
  18. Das Datenhandeln
  19. Differences in impact of long term caregiving for mentally ill older adults on the daily life of informal caregivers
  20. Article 77 CISG
  21. Habitat diversity and peat moss cover drive the occurrence probability of the threatened ground beetle Carabus menetriesi (Coleoptera: Carabidae) in a Bavarian mire
  22. The face of schadenfreude
  23. Das Parlament der Dinge
  24. Populism and corruption
  25. Über Kritikpotenziale und blinde Flecken der Rechtsdogmatik
  26. Doppelrepräsentation und mathematische Begabung
  27. Basel II Rahmenwerk
  28. Article 75 CISG
  29. Deformation microstructures and textures of cast Mg-3Sn-2Ca alloy under uniaxial hot compression