Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach. The NFDI4DS Gateway, as a foundational framework, offers a unified and intuitive interface for querying various scientific databases using federated search. The RAG-based scholarly QA, powered by a Large Language Model (LLM), facilitates dynamic interaction with search results, enhancing filtering capabilities and fostering a conversational engagement with the Gateway search. The effectiveness of both the Gateway and the scholarly QA system is demonstrated through experimental analysis.

Original languageEnglish
Title of host publicationNatural Scientific Language Processing and Research Knowledge Graphs - 1st International Workshop, NSLP 2024, Proceedings
EditorsGeorg Rehm, Stefan Dietze, Sonja Schimmler, Frank Krüger
Number of pages16
PublisherSpringer Science and Business Media Deutschland
Publication date2024
Pages3-18
ISBN (print)978-3-031-65793-1
ISBN (electronic)978-3-031-65794-8
DOIs
Publication statusPublished - 2024
Event1st International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs - NSLP 2024 - Hersonissos, Hersonissos, Greece
Duration: 27.05.202427.05.2024
Conference number: 1
https://nfdi4ds.github.io/nslp2024/

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

    Research areas

  • Federated Search, Large Language Models, NFDI4DS Gateway, Retrieval Augmented Generation, Scholarly Question Answering
  • Informatics
  • Business informatics

Recently viewed

Publications

  1. Towards a spatial understanding of identity play
  2. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  3. Experimentally established correlation of friction surfacing process temperature and deposit geometry
  4. Interpreting Strings, Weaving Threads
  5. Changes in the Complexity of Limb Movements during the First Year of Life across Different Tasks
  6. Stimulating Computing
  7. Introducing split orders and optimizing operational policies in robotic mobile fulfillment systems
  8. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  9. What can conservation strategies learn from the ecosystem services approach?
  10. Data based analysis of order processing strategies to support the positioning between conflicting economic and logistic objectives
  11. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  12. Machine Learning and Knowledge Discovery in Databases
  13. Competing Vegetation Structure Indices for Estimating Spatial Constrains in Carabid Abundance Patterns in Chinese Grasslands Reveal Complex Scale and Habitat Patterns
  14. Spaces for challenging experiences, indeterminacy, and experimentation
  15. Commitment to grand challenges in fluid forms of organizing
  16. Errors in Training Computer Skills
  17. Guest Editorial Special Issue on Sensors in Machine Vision of Automated Systems
  18. AGDISTIS - Graph-based disambiguation of named entities using linked data
  19. Online-scheduling using past and real-time data
  20. The effects of different on-line adaptive response time limits on speed and amount of learning in computer assisted instruction and intelligent tutoring
  21. Probabilistic approach to modelling of recession curves
  22. Grazing, exploring and networking for sustainability-oriented innovations in learning-action networks
  23. Exploring large vegetation databases to detect temporal trends in species occurrences
  24. Grounding Space
  25. Using data mining techniques to investigate the correlation between surface cracks and flange lengths in deep drawn sheet metals
  26. Soil conditions modify species diversity effects on tree functional trait expression
  27. The impact of linguistic complexity on the solution of mathematical modelling tasks
  28. Analysis of a phase‐field finite element implementation for precipitation
  29. Scaling-based Least Squares Methods with Implemented Kalman filter Approach for Nano-Parameters Identification
  30. Combining linked data and statistical information retrieval