Information Extraction from Invoices: A Graph Neural Network Approach for Datasets with High Layout Variety

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Extracting information from invoices is a highly structured, recurrent task in auditing. Automating this task would yield efficiency improvements, while simultaneously improving audit quality. The challenge for this endeavor is to account for the text layout on invoices and the high variety of layouts across different issuers. Recent research has proposed graphs to structurally represent the layout on invoices and to apply graph convolutional networks to extract the information pieces of interest. However, the effectiveness of graph-based approaches has so far been shown only on datasets with a low variety of invoice layouts. In this paper, we introduce a graph-based approach to information extraction from invoices and apply it to a dataset of invoices from multiple vendors. We show that our proposed model extracts the specified key items from a highly diverse set of invoices with a macro F 1 score of 0.8753.

OriginalspracheEnglisch
TitelInnovation Through Information Systems - Volume II : A Collection of Latest Research on Technology Issues
HerausgeberFrederik Ahlemann, Reinhard Schütte, Stefan Stieglitz
Anzahl der Seiten16
ErscheinungsortCham
VerlagSpringer Science and Business Media Deutschland
Erscheinungsdatum2021
Seiten5-20
ISBN (Print)978-3-030-86796-6
ISBN (elektronisch)978-3-030-86797-3
DOIs
PublikationsstatusErschienen - 2021

Bibliographische Notiz

Publisher Copyright:
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Links

DOI

Zuletzt angesehen

Publikationen

  1. Gab es wirklich eine Sintflut?
  2. Ankunft einer Katze
  3. Handbuch Integrated Reporting
  4. Where pragmatics and dialectology meet
  5. Plasma arcing during contact separation of HVDC relays
  6. Application of Adaptive Element-Free Galerkin Method to Simulate Friction Stir Welding of Aluminum
  7. An overview of European programs to support energy projects in Africa and strategies to involve the private sector
  8. Donor Upgrading Strategies
  9. Lekcja 21-22
  10. Fertilized graminoids intensify negative drought effects on grassland productivity
  11. Exploring the influence of testimonial source on attitudes towards e-mental health interventions among university students
  12. Effects of introspective vs. extraspective instruction in scaling of hedonic properties of flavouring ingredients by Chinese and German subjects
  13. Energy transitions in small-scale regions – What we can learn from a regional innovation systems perspective.
  14. Instrumentality
  15. An Introduction to Corporate Environmental Management
  16. Innovation is not enough
  17. Analyzing social interactions
  18. SAP exchange infrastructure for developers
  19. Eine Gesellschaft des Interviews / A Society of the Interview
  20. Liveness Formats
  21. Where is paradise? The EU's navigation system Galileo - Some comments on inherent risks (or paradise lost)
  22. Antibiotics in the Aquatic Environment
  23. Soziokultur
  24. Conceptual frameworks and methods for advancing invasion ecology
  25. Grassroots Innovations for Inclusive Development
  26. A Kinetic Approach to the study of Ideal Multipole Resonance Probe
  27. School Will Never End
  28. Soziale Tatsachen