Extraction of information from invoices - challenges in the extraction pipeline

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Data from invoices are key information for business processes. In order to use the data and create business value, the information must be captured in a digital and structured form. Leveraging digital tools and AI/ML is state-of-The-Art in the extraction of information from invoices. However, the existing approaches are trained on specific languages and layouts, and while focusing on the performance of individual metrics, they neglect the demonstration of the pipeline from raw data to processable information. In this paper, we investigate the types of information on invoices and address the challenges in the extraction pipeline. We contribute by providing a morphological framework for the problematization and design of a pipeline as part of a design science study.

Original languageEnglish
Title of host publicationINFORMATIK 2023 : Designing Futures: Zukünfte gestalten, 26. – 29. September 2023, Berlin
EditorsMaike Klein, Daniel Krupka, Cornelia Winter, Volker Wohlgemuth
Number of pages16
Place of PublicationBonn
PublisherGesellschaft für Informatik e.V.
Publication date2023
Pages1777-1792
ISBN (electronic)978-3-88579-731-9
DOIs
Publication statusPublished - 2023
Event53. Annual Meeting of the German Informatics Society (GI) - INFORMATICS 2023: Designing Futures - Zukünfte Gestalten - Online & HTW Berlin, Berlin, Germany
Duration: 26.09.202329.09.2023
Conference number: 53
https://informatik2023.gi.de/

Bibliographical note

Publisher Copyright:
© 2023 Gesellschaft fur Informatik (GI). All rights reserved.

DOI

Recently viewed

Researchers

  1. Marco Waage

Publications

  1. Magnesium-based metal matrix nanocomposites—processing and properties
  2. I Am Not A Hacker
  3. Pragmatics broadly viewed
  4. Dimensions, dialectic, discourse
  5. Non-acceptances in context
  6. What Role for Public Participation in Implementing the EU Floods Directive? A comparison with the Water Framework Directive, early evidence from Germany, and a research agenda
  7. Contrasting requests in Inner Circle Englishes
  8. Learning Analytics an Hochschulen
  9. From teacher-centered instruction to peer tutoring in the heterogeneous international classroom
  10. Nitrogen uptake by grassland communities
  11. Kommentar zu Ute Tellmann
  12. Operationalizing Network Theory for Ecosystem Service Assessments
  13. Thanking and responding to thanks in American English: Language patterning and contextual appropriateness
  14. Competition in fragmented markets
  15. Mapping the Order of New Migration
  16. Is There a Way Back or Can the Internet Remember its Own History?
  17. Case study: The development of a multi-material heat sink by Additive Manufacturing using Aerosint technology
  18. Synthesis and future research directions linking tree diversity to growth, survival, and damage in a global network of tree diversity experiments
  19. Predicting the future performance of soccer players
  20. Testing for a break in the persistence in yield spreads of EMU government bonds
  21. Deep drawing of high-strength tailored blanks by using tailored tools
  22. Fluorometer controlled apparatus designed for long-duration algal-feeding experiments and environmental effect studies with mussels
  23. An assessment of the published results of animal relocations
  24. Numerical Investigation of the Effect of Rolling on the Localized Stress and Strain Induction for Wire + Arc Additive Manufactured Structures