Revisiting Supervised Contrastive Learning for Microblog Classification

Junbo Huang; Ricardo Usbeck

doi:10.18653/v1/2024.emnlp-main.876

Revisiting Supervised Contrastive Learning for Microblog Classification

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Authors

Junbo Huang
Ricardo Usbeck

Professorship for Information Systems, in particular Artificial Intelligence and Explainability

Microblog content (e.g., Tweets) is noisy due to its informal use of language and its lack of contextual information within each post. To tackle these challenges, state-of-the-art microblog classification models rely on pre-training language models (LMs). However, pre-training dedicated LMs is resource-intensive and not suitable for small labs. Supervised contrastive learning (SCL) has shown its effectiveness with small, available resources. In this work, we examine the effectiveness of fine-tuning transformer-based language models, regularized with a SCL loss for English microblog classification. Despite its simplicity, the evaluation on two English microblog classification benchmarks (TweetEval and Tweet Topic Classification) shows an improvement over baseline models. The result shows that, across all subtasks, our proposed method has a performance gain of up to 11.9 percentage points. All our models are open source.

Original language	English
Title of host publication	The 2024 Conference on Empirical Methods in Natural Language Processing : Proceedings of the Conference; November 12-16, 2024
Editors	Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Number of pages	10
Place of Publication	Kerrville
Publisher	Association for Computational Linguistics
Publication date	2024
Pages	15644-15653
ISBN (electronic)	979-8-89176-164-3
DOIs	https://doi.org/10.18653/v1/2024.emnlp-main.876
Publication status	Published - 2024
Event	Conference on Empirical Methods in Natural Language Processing - EMNLP 2024 - Hyatt Regency Miami Hotel, Miami, United States Duration: 12.11.2024 → 16.11.2024 Conference number: 29 https://2024.emnlp.org/

Bibliographical note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Research areas

Business informatics

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (ed.). Springer Science and Business Media Deutschland, p. 95-110 16 p. (Lecture Notes in Computer Science; vol. 15836 LNCS).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (eds.). Cham: Springer Nature Switzerland AG, Vol. 1. p. 460-480 21 p. (Lecture Notes in Computer Science ; vol. 15718).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (eds.). IOS Press BV, p. 176-193 18 p. (Studies on the Semantic Web; vol. 62).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Bridge-Generate: Scholarly Hybrid Question Answering

Taffa, T. A. & Usbeck, R., 23.05.2025, WWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025: Companion Proceedings of the ACM Web Conference 2025, April 28-May 2, 2025 Sydney, NSW, Australia. Long, G., Blumestein, M., Chang, Y., Lewin-Eytan, L., Huang, H. & Yom-Tov, E. (eds.). New York: Association for Computing Machinery, Inc, p. 1321-1325 5 p.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

HySQA: Hybrid Scholarly Question Answering

Taffa, T., Banerjee, D., Assabie, Y. & Usbeck, R., 26.08.2025, Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Vol. 62. p. 247 17 p. (Studies on the Semantic Web).

Research output: Contributions to collected editions/works › Chapter › peer-review

DOI

https://doi.org/10.18653/v1/2024.emnlp-main.876
Final published version

Revisiting Supervised Contrastive Learning for Microblog Classification

Authors

Bibliographical note

Research areas

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Automating SPARQL Query Translations between DBpedia and Wikidata

Bridge-Generate: Scholarly Hybrid Question Answering

HySQA: Hybrid Scholarly Question Answering

DOI

Recently viewed

Researchers

Projects

Activities

Prizes

Publications