GANDR - Georelating Dataset, Metrics, and Evaluation

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Georelating has been introduced to learn geospatial representations of events from textual reports, which requires the interpretation of spatial relations. To foster the development and evaluation of Georelating systems, we construct the silver-standard Georelating Annotated Natural Disaster Reports dataset GANDR and benchmark our LLM agent architecture as a baseline (areal F1 = 0.609, fuzzy cell match score = 0.833) for this new task.GANDR comprises synthetic disaster reports referencing 1,000 US and 1,000 EU cities, annotated with Discrete Global Grid System (DGGS) cells for efficient geospatial integration. We propose a set of five complementary metrics capitalizing on the DGGS annotations for efficient and comprehensive evaluation.Analysis reveals the potential of reasoning LLMs integrated with geographical knowledge bases to address variation across spatial relations such as (inter-)cardinal directions. We highlight the estimation of the impact area's size as a key challenge of Georelating.
Original languageEnglish
Title of host publicationGeoAI '25: Proceedings of the 8th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
EditorsShawn Newsam, Lexie Yang
Number of pages10
Place of PublicationNew York, NY, USA
PublisherAssociation for Computing Machinery
Publication date19.12.2025
Pages61-71
ISBN (print)9798400721793
DOIs
Publication statusPublished - 19.12.2025

    Research areas

  • georelating, LLMs, spatial reasoning, DGGS, dataset, metrics
  • Informatics

DOI