Information Extraction from Invoices: A Graph Neural Network Approach for Datasets with High Layout Variety

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Extracting information from invoices is a highly structured, recurrent task in auditing. Automating this task would yield efficiency improvements, while simultaneously improving audit quality. The challenge for this endeavor is to account for the text layout on invoices and the high variety of layouts across different issuers. Recent research has proposed graphs to structurally represent the layout on invoices and to apply graph convolutional networks to extract the information pieces of interest. However, the effectiveness of graph-based approaches has so far been shown only on datasets with a low variety of invoice layouts. In this paper, we introduce a graph-based approach to information extraction from invoices and apply it to a dataset of invoices from multiple vendors. We show that our proposed model extracts the specified key items from a highly diverse set of invoices with a macro F 1 score of 0.8753.

OriginalspracheEnglisch
TitelInnovation Through Information Systems - Volume II : A Collection of Latest Research on Technology Issues
HerausgeberFrederik Ahlemann, Reinhard Schütte, Stefan Stieglitz
Anzahl der Seiten16
ErscheinungsortCham
VerlagSpringer Science and Business Media Deutschland
Erscheinungsdatum2021
Seiten5-20
ISBN (Print)978-3-030-86796-6
ISBN (elektronisch)978-3-030-86797-3
DOIs
PublikationsstatusErschienen - 2021

Bibliographische Notiz

Publisher Copyright:
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Links

DOI

Zuletzt angesehen

Publikationen

  1. RIGID PRICES AS A RESULT FROM PROFIT MAXIMIZATION
  2. Free to blame? Belief in free will is related to victim blaming
  3. Basin efficiency approach and its effect on streamflow quality, Zerafshan River Uzbekistan
  4. Concatenated Commons and Operational Aesthetics
  5. Effect of laser peening process parameters and sequences on residual stress profiles
  6. Keep calm and follow the news
  7. Post-foundationalism and the Possibility of Critique
  8. Investigations on hot tearing of Mg-Al binary alloys by using a new quantitative method
  9. Six Steps towards a Spatial Design for Large-Scale Pollinator Surveillance Monitoring
  10. A Multisite Preregistered Paradigmatic Test of the Ego-Depletion Effect
  11. How do individual farmers’ objectives influence the evaluation of rangeland management strategies under a variable climate?
  12. Timing, fragmentation of work and income inequality
  13. Modeling of Friction-Induced Vibrations during Tightening of Bolted Joints
  14. Planning for Sea Spaces II
  15. Stabilizing the grid with regional virtual power plants
  16. Tree species richness strengthens relationships between ants and the functional composition of spider assemblages in a highly diverse forest
  17. Exploring the Paradigms of Private Law
  18. Jenseits des Kopftuchs
  19. Design of a Master of Science Sustainable Chemistry
  20. Introduction to the symposium on feminist perspectives on human–nature relations
  21. Commentary: Mitroff's Ethical Management
  22. The effect of industrialization and globalization on domestic land-use
  23. Reinforcing Systems of Exclusion
  24. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  25. Critical evaluation of commonly used methods to determine the concordance between sonography and magnetic resonance imaging: A comparative study
  26. Differences in labor supply to monopsonistic firms and the gender pay gap

Presse / Medien

  1. Die Kunst des Möglichen