Extraction of information from invoices - challenges in the extraction pipeline

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Data from invoices are key information for business processes. In order to use the data and create business value, the information must be captured in a digital and structured form. Leveraging digital tools and AI/ML is state-of-The-Art in the extraction of information from invoices. However, the existing approaches are trained on specific languages and layouts, and while focusing on the performance of individual metrics, they neglect the demonstration of the pipeline from raw data to processable information. In this paper, we investigate the types of information on invoices and address the challenges in the extraction pipeline. We contribute by providing a morphological framework for the problematization and design of a pipeline as part of a design science study.

OriginalspracheEnglisch
TitelINFORMATIK 2023 : Designing Futures: Zukünfte gestalten, 26. – 29. September 2023, Berlin
HerausgeberMaike Klein, Daniel Krupka, Cornelia Winter, Volker Wohlgemuth
Anzahl der Seiten16
ErscheinungsortBonn
VerlagGesellschaft für Informatik e.V.
Erscheinungsdatum2023
Seiten1777-1792
ISBN (elektronisch)978-3-88579-731-9
DOIs
PublikationsstatusErschienen - 2023
Veranstaltung53. Jahrestagung der Gesellschaft für Informatik e.V.(GI) - INFORMATIK 2023: Designing Futures - Zukünfte Gestalten - Online & HTW Berlin, Berlin, Deutschland
Dauer: 26.09.202329.09.2023
Konferenznummer: 53
https://informatik2023.gi.de/

Bibliographische Notiz

Publisher Copyright:
© 2023 Gesellschaft fur Informatik (GI). All rights reserved.

DOI