WikiEvents - A Novel Resource for NLP Downstream Tasks
Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Standard
ESWC 2023 Workshops and Tutorials Joint Proceedings: Joint Proceedings of the ESWC 2023 Workshops and Tutorials, Hersonissos, Greece, May 28-29, 2023.. ed. / Mehwish Alam; Cassia Trojahn; Sven Hertling; Catia Pesquita; Christian Aebeloe; Hidir Aras; Amr Azzam; Juan Cano; John Domingue; Simon Gottschalk; Olaf Hartig; Katja Hose; Sabrina Kirrane; Pasquale Lisena; Francesco Osborne; Philipp Rohde; Luc Steels; Ruben Taelman; Aisling Third; Ilaria Tiddi; Rima Türker. Vol. 3443 Sun Site Central Europe (RWTH Aachen University), 2023. (CEUR Workshop Proceedings).
Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - CHAP
T1 - WikiEvents - A Novel Resource for NLP Downstream Tasks
AU - Michaelis, Lars
AU - Huang, Junbo
AU - Usbeck, Ricardo
N1 - Publisher Copyright: © 2023 Copyright for this paper by its authors.
PY - 2023
Y1 - 2023
N2 - Efficient Natural Language Processing (NLP) models require large amounts of training data. Manually creating training data is time-consuming. We present WikiEvents, an automatically curated dataset based on Wikipedia’s Current Events portal. WikiEvents is a novel knowledge graph that aims to provide data for various event-centric NLP tasks, such as event-related location extraction and entity linking. Therefore, WikiEvents includes event summaries with linked entities and locations. WikiEvents also provides spatial and temporal information about extracted events for various use case analyses. We leverage the NLP Interchange Format (NIF) ontology and an event-specific novel ontology - CoyPu. We evaluate the suitability regarding NLP tasks by (1) training three BERT models on event-related location extraction with data queried from WikiEvents and (2) comparing WikiEvents to the existing entity linking dataset AIDA-YAGO2. Qualitative, event-related research capabilities are explored by querying data from WikiEvents for multiple use cases and visualizing it.
AB - Efficient Natural Language Processing (NLP) models require large amounts of training data. Manually creating training data is time-consuming. We present WikiEvents, an automatically curated dataset based on Wikipedia’s Current Events portal. WikiEvents is a novel knowledge graph that aims to provide data for various event-centric NLP tasks, such as event-related location extraction and entity linking. Therefore, WikiEvents includes event summaries with linked entities and locations. WikiEvents also provides spatial and temporal information about extracted events for various use case analyses. We leverage the NLP Interchange Format (NIF) ontology and an event-specific novel ontology - CoyPu. We evaluate the suitability regarding NLP tasks by (1) training three BERT models on event-related location extraction with data queried from WikiEvents and (2) comparing WikiEvents to the existing entity linking dataset AIDA-YAGO2. Qualitative, event-related research capabilities are explored by querying data from WikiEvents for multiple use cases and visualizing it.
KW - CoyPu
KW - Dataset
KW - Event Detection
KW - Event Extraction
KW - Events
KW - Knowledge Graph
KW - NIF
KW - NLP
KW - Business informatics
KW - Informatics
UR - http://www.scopus.com/inward/record.url?scp=85168666356&partnerID=8YFLogxK
M3 - Article in conference proceedings
AN - SCOPUS:85168666356
VL - 3443
T3 - CEUR Workshop Proceedings
BT - ESWC 2023 Workshops and Tutorials Joint Proceedings
A2 - Alam, Mehwish
A2 - Trojahn, Cassia
A2 - Hertling, Sven
A2 - Pesquita, Catia
A2 - Aebeloe, Christian
A2 - Aras, Hidir
A2 - Azzam, Amr
A2 - Cano, Juan
A2 - Domingue, John
A2 - Gottschalk, Simon
A2 - Hartig, Olaf
A2 - Hose, Katja
A2 - Kirrane, Sabrina
A2 - Lisena, Pasquale
A2 - Osborne, Francesco
A2 - Rohde, Philipp
A2 - Steels, Luc
A2 - Taelman, Ruben
A2 - Third, Aisling
A2 - Tiddi, Ilaria
A2 - Türker, Rima
PB - Sun Site Central Europe (RWTH Aachen University)
T2 - Joint of the 20th European Semantic Web Conference - Workshops and Tutorials, ESWC-JP 2023
Y2 - 28 May 2023 through 29 May 2023
ER -