Information Extraction from Invoices: A Graph Neural Network Approach for Datasets with High Layout Variety

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Extracting information from invoices is a highly structured, recurrent task in auditing. Automating this task would yield efficiency improvements, while simultaneously improving audit quality. The challenge for this endeavor is to account for the text layout on invoices and the high variety of layouts across different issuers. Recent research has proposed graphs to structurally represent the layout on invoices and to apply graph convolutional networks to extract the information pieces of interest. However, the effectiveness of graph-based approaches has so far been shown only on datasets with a low variety of invoice layouts. In this paper, we introduce a graph-based approach to information extraction from invoices and apply it to a dataset of invoices from multiple vendors. We show that our proposed model extracts the specified key items from a highly diverse set of invoices with a macro F 1 score of 0.8753.

OriginalspracheEnglisch
TitelInnovation Through Information Systems - Volume II : A Collection of Latest Research on Technology Issues
HerausgeberFrederik Ahlemann, Reinhard Schütte, Stefan Stieglitz
Anzahl der Seiten16
ErscheinungsortCham
VerlagSpringer Science and Business Media Deutschland
Erscheinungsdatum2021
Seiten5-20
ISBN (Print)978-3-030-86796-6
ISBN (elektronisch)978-3-030-86797-3
DOIs
PublikationsstatusErschienen - 2021

Bibliographische Notiz

Publisher Copyright:
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Links

DOI

Zuletzt angesehen

Publikationen

  1. Genossenschaft, Repräsentation und Partizipation
  2. Concatenated Commons and Operational Aesthetics
  3. Keep calm and follow the news
  4. Post-foundationalism and the Possibility of Critique
  5. Gab es wirklich eine Sintflut?
  6. Well-being and Prosperity beyond Growth
  7. Outsourcing
  8. Comparing measured and modelled PFOS concentrations in a UK freshwater catchment and estimating emission rates
  9. A Multisite Preregistered Paradigmatic Test of the Ego-Depletion Effect
  10. Can I believe what I see? Data visualization and trust in the humanities
  11. Carbon Management Accounting and Reporting in Practice
  12. The role of transdisciplinarity in building a decolonial bridge between science, policy, and practice
  13. Stabilizing the grid with regional virtual power plants
  14. Anti-identity strategizing
  15. Jenseits des Kopftuchs
  16. Design of a Master of Science Sustainable Chemistry
  17. Is subjective knowledge the key to fostering sustainable behavior? Mixed evidence from an education intervention in Mexico
  18. The effect of industrialization and globalization on domestic land-use
  19. Reinforcing Systems of Exclusion
  20. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  21. Environmentalitäre Zeit
  22. “Self-centered, self-promoting, and self-legitimizing”
  23. Functional traits drive ground beetle community structures in Central European forests
  24. Quality management in a top tier accounting firm: Towards a socio-cognitive model
  25. Morde in hellen Nächten
  26. Environmental performance, carbon performance and earnings management
  27. The measurement of work ability
  28. Notting Hill Gate 4