WikiEvents - A Novel Resource for NLP Downstream Tasks

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

WikiEvents - A Novel Resource for NLP Downstream Tasks. / Michaelis, Lars; Huang, Junbo; Usbeck, Ricardo.
ESWC 2023 Workshops and Tutorials Joint Proceedings: Joint Proceedings of the ESWC 2023 Workshops and Tutorials, Hersonissos, Greece, May 28-29, 2023.. ed. / Mehwish Alam; Cassia Trojahn; Sven Hertling; Catia Pesquita; Christian Aebeloe; Hidir Aras; Amr Azzam; Juan Cano; John Domingue; Simon Gottschalk; Olaf Hartig; Katja Hose; Sabrina Kirrane; Pasquale Lisena; Francesco Osborne; Philipp Rohde; Luc Steels; Ruben Taelman; Aisling Third; Ilaria Tiddi; Rima Türker. Vol. 3443 Sun Site Central Europe (RWTH Aachen University), 2023. (CEUR Workshop Proceedings).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Michaelis, L, Huang, J & Usbeck, R 2023, WikiEvents - A Novel Resource for NLP Downstream Tasks. in M Alam, C Trojahn, S Hertling, C Pesquita, C Aebeloe, H Aras, A Azzam, J Cano, J Domingue, S Gottschalk, O Hartig, K Hose, S Kirrane, P Lisena, F Osborne, P Rohde, L Steels, R Taelman, A Third, I Tiddi & R Türker (eds), ESWC 2023 Workshops and Tutorials Joint Proceedings: Joint Proceedings of the ESWC 2023 Workshops and Tutorials, Hersonissos, Greece, May 28-29, 2023.. vol. 3443, CEUR Workshop Proceedings, Sun Site Central Europe (RWTH Aachen University), Joint of the 20th European Semantic Web Conference - Workshops and Tutorials, ESWC-JP 2023, Hersonissos, Greece, 28.05.23. <https://ceur-ws.org/Vol-3443/ESWC_2023_SEMMES_WikiEvents.pdf>

APA

Michaelis, L., Huang, J., & Usbeck, R. (2023). WikiEvents - A Novel Resource for NLP Downstream Tasks. In M. Alam, C. Trojahn, S. Hertling, C. Pesquita, C. Aebeloe, H. Aras, A. Azzam, J. Cano, J. Domingue, S. Gottschalk, O. Hartig, K. Hose, S. Kirrane, P. Lisena, F. Osborne, P. Rohde, L. Steels, R. Taelman, A. Third, I. Tiddi, ... R. Türker (Eds.), ESWC 2023 Workshops and Tutorials Joint Proceedings: Joint Proceedings of the ESWC 2023 Workshops and Tutorials, Hersonissos, Greece, May 28-29, 2023. (Vol. 3443). (CEUR Workshop Proceedings). Sun Site Central Europe (RWTH Aachen University). https://ceur-ws.org/Vol-3443/ESWC_2023_SEMMES_WikiEvents.pdf

Vancouver

Michaelis L, Huang J, Usbeck R. WikiEvents - A Novel Resource for NLP Downstream Tasks. In Alam M, Trojahn C, Hertling S, Pesquita C, Aebeloe C, Aras H, Azzam A, Cano J, Domingue J, Gottschalk S, Hartig O, Hose K, Kirrane S, Lisena P, Osborne F, Rohde P, Steels L, Taelman R, Third A, Tiddi I, Türker R, editors, ESWC 2023 Workshops and Tutorials Joint Proceedings: Joint Proceedings of the ESWC 2023 Workshops and Tutorials, Hersonissos, Greece, May 28-29, 2023.. Vol. 3443. Sun Site Central Europe (RWTH Aachen University). 2023. (CEUR Workshop Proceedings).

Bibtex

@inbook{fca2ffb3552944479d4405f7280d1586,
title = "WikiEvents - A Novel Resource for NLP Downstream Tasks",
abstract = "Efficient Natural Language Processing (NLP) models require large amounts of training data. Manually creating training data is time-consuming. We present WikiEvents, an automatically curated dataset based on Wikipedia{\textquoteright}s Current Events portal. WikiEvents is a novel knowledge graph that aims to provide data for various event-centric NLP tasks, such as event-related location extraction and entity linking. Therefore, WikiEvents includes event summaries with linked entities and locations. WikiEvents also provides spatial and temporal information about extracted events for various use case analyses. We leverage the NLP Interchange Format (NIF) ontology and an event-specific novel ontology - CoyPu. We evaluate the suitability regarding NLP tasks by (1) training three BERT models on event-related location extraction with data queried from WikiEvents and (2) comparing WikiEvents to the existing entity linking dataset AIDA-YAGO2. Qualitative, event-related research capabilities are explored by querying data from WikiEvents for multiple use cases and visualizing it.",
keywords = "CoyPu, Dataset, Event Detection, Event Extraction, Events, Knowledge Graph, NIF, NLP, Business informatics, Informatics",
author = "Lars Michaelis and Junbo Huang and Ricardo Usbeck",
note = "Publisher Copyright: {\textcopyright} 2023 Copyright for this paper by its authors.; Joint of the 20th European Semantic Web Conference - Workshops and Tutorials, ESWC-JP 2023 ; Conference date: 28-05-2023 Through 29-05-2023",
year = "2023",
language = "English",
volume = "3443",
series = "CEUR Workshop Proceedings",
publisher = "Sun Site Central Europe (RWTH Aachen University)",
editor = "Mehwish Alam and Cassia Trojahn and Sven Hertling and Catia Pesquita and Christian Aebeloe and Hidir Aras and Amr Azzam and Juan Cano and John Domingue and Simon Gottschalk and Olaf Hartig and Katja Hose and Sabrina Kirrane and Pasquale Lisena and Francesco Osborne and Philipp Rohde and Luc Steels and Ruben Taelman and Aisling Third and Ilaria Tiddi and Rima T{\"u}rker",
booktitle = "ESWC 2023 Workshops and Tutorials Joint Proceedings",
address = "Germany",
url = "https://2023.eswc-conferences.org/about/",

}

RIS

TY - CHAP

T1 - WikiEvents - A Novel Resource for NLP Downstream Tasks

AU - Michaelis, Lars

AU - Huang, Junbo

AU - Usbeck, Ricardo

N1 - Publisher Copyright: © 2023 Copyright for this paper by its authors.

PY - 2023

Y1 - 2023

N2 - Efficient Natural Language Processing (NLP) models require large amounts of training data. Manually creating training data is time-consuming. We present WikiEvents, an automatically curated dataset based on Wikipedia’s Current Events portal. WikiEvents is a novel knowledge graph that aims to provide data for various event-centric NLP tasks, such as event-related location extraction and entity linking. Therefore, WikiEvents includes event summaries with linked entities and locations. WikiEvents also provides spatial and temporal information about extracted events for various use case analyses. We leverage the NLP Interchange Format (NIF) ontology and an event-specific novel ontology - CoyPu. We evaluate the suitability regarding NLP tasks by (1) training three BERT models on event-related location extraction with data queried from WikiEvents and (2) comparing WikiEvents to the existing entity linking dataset AIDA-YAGO2. Qualitative, event-related research capabilities are explored by querying data from WikiEvents for multiple use cases and visualizing it.

AB - Efficient Natural Language Processing (NLP) models require large amounts of training data. Manually creating training data is time-consuming. We present WikiEvents, an automatically curated dataset based on Wikipedia’s Current Events portal. WikiEvents is a novel knowledge graph that aims to provide data for various event-centric NLP tasks, such as event-related location extraction and entity linking. Therefore, WikiEvents includes event summaries with linked entities and locations. WikiEvents also provides spatial and temporal information about extracted events for various use case analyses. We leverage the NLP Interchange Format (NIF) ontology and an event-specific novel ontology - CoyPu. We evaluate the suitability regarding NLP tasks by (1) training three BERT models on event-related location extraction with data queried from WikiEvents and (2) comparing WikiEvents to the existing entity linking dataset AIDA-YAGO2. Qualitative, event-related research capabilities are explored by querying data from WikiEvents for multiple use cases and visualizing it.

KW - CoyPu

KW - Dataset

KW - Event Detection

KW - Event Extraction

KW - Events

KW - Knowledge Graph

KW - NIF

KW - NLP

KW - Business informatics

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85168666356&partnerID=8YFLogxK

M3 - Article in conference proceedings

AN - SCOPUS:85168666356

VL - 3443

T3 - CEUR Workshop Proceedings

BT - ESWC 2023 Workshops and Tutorials Joint Proceedings

A2 - Alam, Mehwish

A2 - Trojahn, Cassia

A2 - Hertling, Sven

A2 - Pesquita, Catia

A2 - Aebeloe, Christian

A2 - Aras, Hidir

A2 - Azzam, Amr

A2 - Cano, Juan

A2 - Domingue, John

A2 - Gottschalk, Simon

A2 - Hartig, Olaf

A2 - Hose, Katja

A2 - Kirrane, Sabrina

A2 - Lisena, Pasquale

A2 - Osborne, Francesco

A2 - Rohde, Philipp

A2 - Steels, Luc

A2 - Taelman, Ruben

A2 - Third, Aisling

A2 - Tiddi, Ilaria

A2 - Türker, Rima

PB - Sun Site Central Europe (RWTH Aachen University)

T2 - Joint of the 20th European Semantic Web Conference - Workshops and Tutorials, ESWC-JP 2023

Y2 - 28 May 2023 through 29 May 2023

ER -

Recently viewed

Publications

  1. Biodiversity–stability relationships strengthen over time in a long-term grassland experiment
  2. Notting Hill Gate
  3. Performativierung des Raums
  4. How perceived security risk influences acceptance of virtual shopping walls
  5. Hysteresis Analysis and Control of a Metal-Polymer Hybrid Soft Actuator
  6. Which Potential Linguistic Challenges do Pre-Service Teachers Identify in a Mathematical Expository Text?
  7. Förderung von Gesundheitskompetenzen mit Location-based Games. Eine partizipative Entwicklung
  8. Wirtschaftsinformatik
  9. Some studies on the thermal-expansion behavior of C-Fiber, SiCp, and in-situ Mg2Si-reinforced AZ31 Mg alloy-based hybrid composites
  10. Betriebliche Umweltinformationssysteme
  11. Reinforcing Systems of Exclusion
  12. Kunst im Nationalsozialismus
  13. Effect of heat treatment on the microstructure and creep behavior of Mg-Sn-Ca alloys
  14. Tectono-climatic controls of the early rift alluvial succession
  15. Is the wild rabbit (Oryctolagus cuniculus) a threatened species in Spain? Sociological constraints in the conservation of species
  16. Is Lean Production Really Lean?
  17. Motivation related to work
  18. Polizei und Gewalt: Editoral
  19. Alteration of share capital
  20. Qualitätssicherung in der Lehrerbildung
  21. Hot working mechanisms and texture development in Mg-3Sn-2Ca-0.4Al alloy
  22. Development of a Sustainability Balanced Scorecard
  23. Geschlechtsspezifische Perspektiven auf das Unternehmertum
  24. Crop variety and prey richness affect spatial patterns of human-wildlife conflicts in Iran's Hyrcanian forests
  25. Biorefineries in Germany
  26. When mortality knocks
  27. A Hybrid Extended Kalman Filter as an Observer for a Pot-Electro-Magnetic Actuator
  28. Über die (Un)Möglichkeit Co-Creation zu managen
  29. Choreographen der Gewalt
  30. e-learning für das Fach Statistik
  31. Actor-Network Theory II
  32. Effect of biaxial compressive stress state on the microstructure evolution and deformation compatibility of rolled sheet Mg alloy AZ31 at room temperature
  33. Is Export Diversification good for Productivity? First Evidence for Manufacturing Enterprises in Germany
  34. Tablets im Sportunterricht!? Echt? Wow!