Extraction of information from invoices - challenges in the extraction pipeline

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Data from invoices are key information for business processes. In order to use the data and create business value, the information must be captured in a digital and structured form. Leveraging digital tools and AI/ML is state-of-The-Art in the extraction of information from invoices. However, the existing approaches are trained on specific languages and layouts, and while focusing on the performance of individual metrics, they neglect the demonstration of the pipeline from raw data to processable information. In this paper, we investigate the types of information on invoices and address the challenges in the extraction pipeline. We contribute by providing a morphological framework for the problematization and design of a pipeline as part of a design science study.

Original languageEnglish
Title of host publicationINFORMATIK 2023 : Designing Futures: Zukünfte gestalten, 26. – 29. September 2023, Berlin
EditorsMaike Klein, Daniel Krupka, Cornelia Winter, Volker Wohlgemuth
Number of pages16
Place of PublicationBonn
PublisherGesellschaft für Informatik e.V.
Publication date2023
Pages1777-1792
ISBN (electronic)978-3-88579-731-9
DOIs
Publication statusPublished - 2023
Event53. Annual Meeting of the German Informatics Society (GI) - INFORMATICS 2023: Designing Futures - Zukünfte Gestalten - Online & HTW Berlin, Berlin, Germany
Duration: 26.09.202329.09.2023
Conference number: 53
https://informatik2023.gi.de/

Bibliographical note

Publisher Copyright:
© 2023 Gesellschaft fur Informatik (GI). All rights reserved.

DOI

Recently viewed

Activities

  1. Things Take Their Times: Coordinating Individual and Material Eigenzeiten in Creative Work
  2. Make academia meaningful again: A conversation on research, cocreation and impact
  3. ECPR Joint Sessions of Workshops - ECPR 2019
  4. Praxis Englisch (Fachzeitschrift)
  5. Harvard Universität
  6. Control and Sovereignty via Blockchains
  7. Observing Videos of Teachers’ Own or Others’ Classrooms. What Do Teachers Learn When They Analyze Two Different Video Types?
  8. Workshop "Marked-based Instruments (MBI) for Ecosystem Services and Nature Protection" - 2011
  9. Evidence-based governance or governance learning? How policy-makers design participation processes for EU Floods Directive implementation
  10. Conference on Cross-sectional Dependence in Panel Data Models - 2013
  11. Community formation and boundary drawing processes towards Muslims from a generational perspective
  12. On Rhythming
  13. Network-based analysis of Lagrangian transport and mixing
  14. Can we solve the climate crisis? Contributions from artS, technology and science
  15. How working from home impairs recovery from work: Anticipated availability as a cognitive process in the stressor-detachment model
  16. Reading expository texts at school - how text cohesion can support students’ reading comprehension
  17. Affective Human-Robot Interaction – The Influence of Humans’ Emotion Recognition Ability
  18. Emergency Design
  19. hyper-retinal in service of the mind
  20. Inquiry-based Learning Environment to Welcome the Diversity of a Chemistry Class
  21. Modern micropolitics of antipopulism: Rethinking discourse and empathy
  22. 81st Annual Meeting of the Academy of Management

Publications

  1. I Am Not A Hacker
  2. Credit constraints and exports: A survey of empirical studies using firm level data
  3. Project-Mentoring in Engineering Education - a competence-oriented teaching and learning approach
  4. “Smart is not smart enough!” Anticipating critical raw material use in smart city concepts
  5. Development and validation of chemometrics-assisted spectrophotometry and liquid chromatography methods for the simultaneous determination of the active ingredients in two multicomponent mixtures containing chlorpheniramine maleate and phenylpropanolamine hydrochloride.
  6. Effects of plyometric training on postural control in static and dynamic testing situations
  7. Evaluating a hybrid web-based training program for panic disorder and agoraphobia
  8. Archives
  9. Biological Computer Laboratory
  10. Gas-Kampf oder Gas-Krampf
  11. Rapid ecosystem change challenges the adaptive capacity of local environmental knowledge
  12. Dispute and morality in the perception of societal risks: extending the psychometric model
  13. Tree species identity, canopy structure and prey availability differentially affect canopy spider diversity and trophic composition
  14. Entangled – But How?
  15. Leading Knowledge Exploration and Exploitation in Schools
  16. The pace of range expansion
  17. Meta-analytic cointegrating rank tests for dependent panels
  18. Responsibility and environment
  19. A Transatlantic Symposium on the Restatement (Fourth)
  20. Sliding Mode Control for a Vertical Dynamics in the Presence of Nonlinear Friction
  21. Separable models for interconnected production-inventory systems
  22. One Fits Them All?
  23. Meat substitutes
  24. Identifying determinants of teachers' judgment (in)accuracy regarding students' school-related motivations using a Bayesian cross-classified multi-level model