Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach. The NFDI4DS Gateway, as a foundational framework, offers a unified and intuitive interface for querying various scientific databases using federated search. The RAG-based scholarly QA, powered by a Large Language Model (LLM), facilitates dynamic interaction with search results, enhancing filtering capabilities and fostering a conversational engagement with the Gateway search. The effectiveness of both the Gateway and the scholarly QA system is demonstrated through experimental analysis.

Original languageEnglish
Title of host publicationNatural Scientific Language Processing and Research Knowledge Graphs - 1st International Workshop, NSLP 2024, Proceedings
EditorsGeorg Rehm, Stefan Dietze, Sonja Schimmler, Frank Krüger
Number of pages16
PublisherSpringer Science and Business Media Deutschland
Publication date2024
Pages3-18
ISBN (print)978-3-031-65793-1
ISBN (electronic)978-3-031-65794-8
DOIs
Publication statusPublished - 2024
Event1st International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs - NSLP 2024 - Hersonissos, Hersonissos, Greece
Duration: 27.05.202427.05.2024
Conference number: 1
https://nfdi4ds.github.io/nslp2024/

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

    Research areas

  • Federated Search, Large Language Models, NFDI4DS Gateway, Retrieval Augmented Generation, Scholarly Question Answering
  • Informatics
  • Business informatics

Recently viewed

Publications

  1. A Wavelet Packet Algorithm for Online Detection of Pantograph Vibrations
  2. Web-scale extension of RDF knowledge bases from templated websites
  3. Sensitivity to complexity - an important prerequisite of problem solving mathematics teaching
  4. Advantages and disadvantages of different text coding procedures for research and practice in a school context
  5. Interpreting Strings, Weaving Threads
  6. Applications of the Simultaneous Modular Approach in the Field of Material Flow Analysis
  7. Foundations and applications of computer based material flow networks for einvironmental management
  8. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  9. Computer als Medium
  10. Stability analysis of a linear model predictive control and its application in a water recovery process
  11. Using Fuzzy PD Controllers for Soft Motions in a Car-like Robot
  12. Exploring the limits of graph invariant- and spectrum-based discrimination of (sub)structures.
  13. A framework for business model development in technology-driven start-ups
  14. An evaluation of BPR methodologies adopting NIMSAD: A systematic framework for understanding and evaluating methodologies
  15. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  16. Robust feedback linearization using an adaptive PD regulator for a sensorless control of a throttle valve
  17. Spaces for challenging experiences, indeterminacy, and experimentation
  18. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  19. Commitment to grand challenges in fluid forms of organizing
  20. Cognitive Predictors of Child Second Language Comprehension and Syntactic Learning
  21. Using augmented video to test in-car user experiences of context analog HUDs
  22. Modeling Conditional Dependencies in Multiagent Trajectories
  23. Second language learners' performance in mathematics
  24. An Interactive Layers Model of Self-Regulated Learning and Cognitive Load
  25. Finding Creativity in Predictability: Seizing Kairos in Chronos Through Temporal Work in Complex Innovation Processes
  26. Dynamic environment modelling and prediction for autonomous systems
  27. Derivative approximation using a discrete dynamic system
  28. Top-down contingent attentional capture during feed-forward visual processing
  29. The effects of different on-line adaptive response time limits on speed and amount of learning in computer assisted instruction and intelligent tutoring
  30. From pre-processing to advanced dynamic modeling of pupil data
  31. A Voxel-based technique to estimate the volume of trees from terrestrial laser scanner data
  32. Effects of diversity versus segregation on automatic approach and avoidance behavior towards own and other ethnic groups
  33. Treating dialogue quality evaluation as an anomaly detection problem
  34. On the added value of considering effects of generic and subject-specific instructional quality on students’ achievements – an exploratory study on the example of implementing formative assessment in mathematics education