Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach. The NFDI4DS Gateway, as a foundational framework, offers a unified and intuitive interface for querying various scientific databases using federated search. The RAG-based scholarly QA, powered by a Large Language Model (LLM), facilitates dynamic interaction with search results, enhancing filtering capabilities and fostering a conversational engagement with the Gateway search. The effectiveness of both the Gateway and the scholarly QA system is demonstrated through experimental analysis.

Original languageEnglish
Title of host publicationNatural Scientific Language Processing and Research Knowledge Graphs - 1st International Workshop, NSLP 2024, Proceedings
EditorsGeorg Rehm, Stefan Dietze, Sonja Schimmler, Frank Krüger
Number of pages16
PublisherSpringer Science and Business Media Deutschland
Publication date2024
Pages3-18
ISBN (print)978-3-031-65793-1
ISBN (electronic)978-3-031-65794-8
DOIs
Publication statusPublished - 2024
Event1st International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs - NSLP 2024 - Hersonissos, Hersonissos, Greece
Duration: 27.05.202427.05.2024
Conference number: 1
https://nfdi4ds.github.io/nslp2024/

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

    Research areas

  • Federated Search, Large Language Models, NFDI4DS Gateway, Retrieval Augmented Generation, Scholarly Question Answering
  • Informatics
  • Business informatics

Recently viewed

Publications

  1. Technological System and the Problem of Desymbolization
  2. The Influence of Note-taking on Mathematical Solution Processes while Working on Reality-Based Tasks
  3. Holistic and scalable ranking of RDF data
  4. Database on Learning for Sustainable Development – analysis of projects
  5. Taking notes as a strategy for solving reality-based tasks in mathematics
  6. Contextual movement models based on normalizing flows
  7. A guided simulated annealing search for solving the pick-up and delivery problem with time windows and capacity constraints
  8. The role of learners’ memory in app-based language instruction: the case of Duolingo.
  9. Creating regional (e-)learning networks
  10. Towards a spatial understanding of identity play
  11. A Lean Convolutional Neural Network for Vehicle Classification
  12. A reference architecture for the integration of EMIS and ERP-Systems
  13. Modeling of lateness distributions depending on the sequencing method with respect to productivity effects
  14. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  15. Multi-view discriminative sequential learning
  16. Sensor Fusion for Power Line Sensitive Monitoring and Load State Estimation
  17. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  18. Web-scale extension of RDF knowledge bases from templated websites
  19. Clause identification using entropy guided transformation learning
  20. Experimentally established correlation of friction surfacing process temperature and deposit geometry
  21. Intellectual property issues in the use and distribution of remote sensing data
  22. Mathematical Modeling for Robot 3D Laser Scanning in Complete Darkness Environments to Advance Pipeline Inspection
  23. Interpreting Strings, Weaving Threads
  24. Constraints are the solution, not the problem
  25. Robust Flatness Based Control of an Electromagnetic Linear Actuator Using Adaptive PID Controller
  26. Segment Introduction
  27. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  28. Improving short-term academic performance in the flipped classroom using dynamic geometry software
  29. Homogenization methods for multi-phase elastic composites with non-elliptical reinforcements
  30. From "cracking the orthographic code" to "playing with language"
  31. The signal location task as a method quantifying the distribution of attention
  32. Generating Energy Optimal Powertrain Force Trajectories with Dynamic Constraints
  33. Universal Threshold Calculation for Fingerprinting Decoders using Mixture Models
  34. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  35. FaST: A linear time stack trace alignment heuristic for crash report deduplication
  36. Towards a Bayesian Student Model for Detecting Decimal Misconceptions