Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach. The NFDI4DS Gateway, as a foundational framework, offers a unified and intuitive interface for querying various scientific databases using federated search. The RAG-based scholarly QA, powered by a Large Language Model (LLM), facilitates dynamic interaction with search results, enhancing filtering capabilities and fostering a conversational engagement with the Gateway search. The effectiveness of both the Gateway and the scholarly QA system is demonstrated through experimental analysis.

Original languageEnglish
Title of host publicationNatural Scientific Language Processing and Research Knowledge Graphs - 1st International Workshop, NSLP 2024, Proceedings
EditorsGeorg Rehm, Stefan Dietze, Sonja Schimmler, Frank Krüger
Number of pages16
PublisherSpringer Science and Business Media Deutschland
Publication date2024
Pages3-18
ISBN (print)978-3-031-65793-1
ISBN (electronic)978-3-031-65794-8
DOIs
Publication statusPublished - 2024
Event1st International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs - NSLP 2024 - Hersonissos, Hersonissos, Greece
Duration: 27.05.202427.05.2024
Conference number: 1
https://nfdi4ds.github.io/nslp2024/

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

    Research areas

  • Federated Search, Large Language Models, NFDI4DS Gateway, Retrieval Augmented Generation, Scholarly Question Answering
  • Informatics
  • Business informatics

Recently viewed

Publications

  1. The Influence of Note-taking on Mathematical Solution Processes while Working on Reality-Based Tasks
  2. Holistic and scalable ranking of RDF data
  3. Database on Learning for Sustainable Development – analysis of projects
  4. Taking notes as a strategy for solving reality-based tasks in mathematics
  5. The role of learners’ memory in app-based language instruction: the case of Duolingo.
  6. Creating regional (e-)learning networks
  7. Towards a spatial understanding of identity play
  8. A reference architecture for the integration of EMIS and ERP-Systems
  9. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  10. Multi-view discriminative sequential learning
  11. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  12. Mathematical Modeling for Robot 3D Laser Scanning in Complete Darkness Environments to Advance Pipeline Inspection
  13. Constraints are the solution, not the problem
  14. Robust Flatness Based Control of an Electromagnetic Linear Actuator Using Adaptive PID Controller
  15. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  16. Understanding storytelling in the context of information systems
  17. The signal location task as a method quantifying the distribution of attention
  18. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  19. FaST: A linear time stack trace alignment heuristic for crash report deduplication
  20. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  21. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  22. “Ideation is Fine, but Execution is Key”
  23. Age effects on controlling tools with sensorimotor transformations
  24. Assessing Effects Through Semi-Field and Field Toxicity Testing
  25. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  26. Computing regression statistics from grouped data
  27. An analytical approach to evaluating bivariate functions of fuzzy numbers with one local extremum
  28. Performance analysis for loss systems with many subscribers and concurrent services
  29. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  30. Scaffolding argumentation in mathematics with CSCL scripts
  31. A localized boundary element method for the floating body problem
  32. Robust feedback linearization control of a throttle plate by using an approximated pd regulator
  33. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  34. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  35. Integration of laser scanning and projection speckle pattern for advanced pipeline monitoring
  36. Partitioned beta diversity patterns of plants across sharp and distinct boundaries of quartz habitat islands