Hands in Focus: Sign Language Recognition Via Top-Down Attention

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Noha Sarhan
  • Christian Wilms
  • Vanessa Closius
  • Ulf Brefeld
  • Simone Frintrop

In this paper, we propose a novel Sign Language Recognition (SLR) model that leverages the task-specific knowledge to incorporate Top-Down (TD) attention to focus the processing of the network on the most relevant parts of the input video sequence. For SLR, this includes information about the hands' shape, orientation and positions, and motion trajectory. Our model consists of three streams that process RGB, optical flow and TD attention data. For the TD attention, we generate pixel-precise attention maps focusing on both hands, thereby retaining valuable hand information, while eliminating distracting background information. Our proposed method outperforms state-of-the-art on a challenging large-scale dataset by over 2%, and achieves strong results with a much simpler architecture compared to other systems on the newly released AUTSL dataset [1].

OriginalspracheEnglisch
Titel2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings : Proceedings
Anzahl der Seiten5
ErscheinungsortPiscataway
VerlagIEEE Electromagnetic Compatibility Society
Erscheinungsdatum08.10.2023
Seiten2555-2559
ISBN (Print)978-1-7281-9836-1
ISBN (elektronisch)978-1-7281-9835-4
DOIs
PublikationsstatusErschienen - 08.10.2023
Veranstaltung2023 IEEE International Conference on Image Processing - Kuala Lumpur Convention Centre, Kuala Lumpur, Malaysia
Dauer: 08.10.202311.10.2023
Konferenznummer: 30
https://2023.ieeeicip.org/

Bibliographische Notiz

Publisher Copyright:
© 2023 IEEE.

DOI

Zuletzt angesehen

Publikationen

  1. Moderators of intergroup evaluation in disadvantaged groups
  2. How context affects transdisciplinary research
  3. The magnitude of correlation between deadlift 1RM and jumping performance is sports dependent
  4. Bank management of the net interest margin
  5. Open Innovation in Schools
  6. Performance Saga: Interview 07
  7. Art History Update
  8. Fast Catch Bumerang
  9. Alignment of the life cycle initiative’s “principles for the application of life cycle sustainability assessment” with the LCSA practice
  10. Evaluation of a temporal causal model for predicting the mood of clients in an online therapy
  11. Consequence evaluations and moral concerns about climate change
  12. Systemprogrammierung I
  13. Effect of cascading of higher-lying states on a delayed 1 s-2 p transition after beam-foil excitation of 56 MeV hydrogen-like oxygen and fluorine
  14. Gemachter oder gelebter Tourismus?
  15. Social and dimensional comparison effects on math and reading self-concepts of elementary school children
  16. Studienprogramm Nachhaltigkeit
  17. Analyzing Pragmatic Variation in English
  18. Practical critique: Bridging the gap between critical and practice oriented REDD+ research communities’
  19. Ecosystem Services as a Contested Concept
  20. Modeling of 3D fluid-structure-interaction during in-situ hybridization of double-curved fiber-metal-laminates
  21. Selecting methods for ecosystem service assessment
  22. De-Anonymizing Anonymous
  23. Making REDD+ pay
  24. Relational Transdisciplinarity: Five Reflexive Steps for Embodying Relational Ontologies in Transdisciplinary Learning Contexts
  25. Resilience or vulnerability? Vegetation patterns of a Central Tibetan pastoral ecotone
  26. Evaluating Introductory Lectures in Entrepreneurship
  27. 11. Methoden-Muster
  28. Study of digital morphing tools in the architectural design process
  29. The effects of an Internet based self-help course for reducing panic symptoms-Don't Panic Online
  30. An academia beyond quantity
  31. Computergestütztes Repetitorium der Elementarmathematik