Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearch

Authors

Human-produced emissions are growing at an alarming rate, causing already observable changes in the climate and environment in general. Each year global carbon dioxide emissions hit a new record, and it is reported that 0.5% of total US greenhouse gas emissions are attributed to data centres as of 2021. The release of ChatGPT in late 2022 sparked social interest in Large Language Models (LLMs), the new generation of Language Models with a large number of parameters and trained on massive amounts of data. Currently, numerous companies are releasing products featuring various LLMs, with many more models in development and awaiting release. Deep Learning research is a competitive field, with only models that reach top performance attracting attention and being utilized. Hence, achieving better accuracy and results is often the first priority, while the model's efficiency and the environmental impact of the study are neglected. However, LLMs demand substantial computational resources and are very costly to train, both financially and environmentally. It becomes essential to raise awareness and promote conscious decisions about algorithmic and hardware choices. Providing information on training time, the approximate carbon dioxide emissions and power consumption would assist future studies in making necessary adjustments and determining the compatibility of available computational resources with model requirements. In this study, we infused T5 LLM with external knowledge and fine-tuned the model for Question-Answering task. Furthermore, we calculated and reported the approximate environmental impact for both steps. The findings demonstrate that the smaller models may not always be sustainable options, and increased training does not always imply better performance. The most optimal outcome is achieved by carefully considering both performance and efficiency factors.
Original languageEnglish
Title of host publicationSustainable AI Conference 2023: Sustainable AI Across Borders : Conference Proceedings
Volumeabs/2408.01453
DOIs
Publication statusIn preparation - 2024
Event2. Sustainable AI Conference 2023: Sustainable AI Across Borders - University of Bonn, Bonn, Germany
Duration: 30.05.202301.06.2023
Conference number: 2
https://www.uni-bonn.de/de/veranstaltungen/sustainable-ai-conference-2023-sustainable-ai-across-borders

Recently viewed

Publications

  1. Biodiversity in space and time - towards a grid mapping for Mongolia
  2. archiDART: an R package for the automated computation of plant root architectural traits
  3. AGDISTIS-agnostic disambiguation of named entities using linked open data
  4. Hot forging of cast magnesium alloy TX31 using semi-closed die and its finite element simulation
  5. Pathways of Data-driven Business Model Design and Realization
  6. Dynamic capabilities and routinization
  7. Performance concepts and performance theory
  8. The Network Dynamics of Movements
  9. Using Daily Stretching to Counteract Performance Decreases as a Result of Reduced Physical Activity—A Controlled Trial
  10. Integrating resilience thinking and optimisation for conservation
  11. Reciprocal Relationships Between Dispositional Optimism and Work Experiences
  12. Machine Learning Applications in Convective Turbulence
  13. The link between in- and external rotation of the auditor and the quality of financial accounting and external audit
  14. Improving Flood Forecasting in a Developing Country
  15. Comparison of three methods of length compensation in a parallel kinematic and their equivalence conditions
  16. Anomalous Results in G-Factor Models
  17. A Lyapunov Approach to Set the Parameters of a PI-Controller to Minimise Velocity Oscillations in a Permanent Magnet Synchronous Motor Using Chopper Control for Electrical Vehicles
  18. Absolutely continuous random power series in reciprocals of Pisot numbers
  19. The role of spatial ability when fostering mental animation in multimedia learning
  20. Modeling the effect of workpiece temperature on micromagnetic high-speed-3MA-testing in case of AISI 4140
  21. Introduction
  22. Integrating inductive and deductive analysis to identify and characterize archetypical social-ecological systems and their changes
  23. Europe and the media: Changing structures in a changing context
  24. Effectiveness of the world network of biosphere reserves in maintaining forest ecosystem functions
  25. Analyzing Talk and Text II: Thematic Analysis
  26. Assessment of cognitive load in multimedia learning using dual-task methodology
  27. Cobalt in end-of-life products in the EU, where does it end up? - The MaTrace approach
  28. Semi-Supervised Generative Models for Multi-Agent Trajectories