Hands in Focus: Sign Language Recognition Via Top-Down Attention

Noha Sarhan; Christian Wilms; Vanessa Closius; Ulf Brefeld; Simone Frintrop

doi:10.1109/icip49359.2023.10222729

Hands in Focus: Sign Language Recognition Via Top-Down Attention

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Authors

Noha Sarhan
Christian Wilms
Vanessa Closius
Ulf Brefeld
Simone Frintrop

Professur für Wirtschaftsinformatik, insbesondere Machine Learning

In this paper, we propose a novel Sign Language Recognition (SLR) model that leverages the task-specific knowledge to incorporate Top-Down (TD) attention to focus the processing of the network on the most relevant parts of the input video sequence. For SLR, this includes information about the hands' shape, orientation and positions, and motion trajectory. Our model consists of three streams that process RGB, optical flow and TD attention data. For the TD attention, we generate pixel-precise attention maps focusing on both hands, thereby retaining valuable hand information, while eliminating distracting background information. Our proposed method outperforms state-of-the-art on a challenging large-scale dataset by over 2%, and achieves strong results with a much simpler architecture compared to other systems on the newly released AUTSL dataset [1].

Originalsprache	Englisch
Titel	2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings : Proceedings
Anzahl der Seiten	5
Erscheinungsort	Piscataway
Verlag	IEEE Electromagnetic Compatibility Society
Erscheinungsdatum	08.10.2023
Seiten	2555-2559
ISBN (Print)	978-1-7281-9836-1
ISBN (elektronisch)	978-1-7281-9835-4
DOIs	https://doi.org/10.1109/icip49359.2023.10222729
Publikationsstatus	Erschienen - 08.10.2023
Veranstaltung	2023 IEEE International Conference on Image Processing - Kuala Lumpur Convention Centre, Kuala Lumpur, Malaysia Dauer: 08.10.2023 → 11.10.2023 Konferenznummer: 30 https://2023.ieeeicip.org/

Bibliographische Notiz

Publisher Copyright:
© 2023 IEEE.

Fachgebiete

Informatik

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Fassmeyer, D., Cordes, M. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 15 S., 38.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Bengs, D., Brefeld, U., Kroehne, U. & Zehner, F., 2025, (Angenommen/Im Druck) in: Psychometrika.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Brefeld, U. (Herausgeber*in), Davis, J. (Herausgeber*in), Van Haaren, J. (Herausgeber*in) & Zimmermann, A. (Herausgeber*in), 2025, Cham: Springer Verlag. 119 S. (Communications in Computer and Information Science; Band 2460)

Publikation: Bücher und Anthologien › Konferenzbände und -dokumentationen › Forschung

Masked autoencoder for multiagent trajectories

Rudolph, Y. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 18 S., 44.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Self-improvement for Computerized Adaptive Testing

Rudolph, Y., Neubauer, K. & Brefeld, U., 23.09.2025, Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Proceedings. Ribeiro, R. P., Pfahringer, B., Japkowicz, N., Larrañaga, P., Jorge, A. M., Soares, C., Abreu, P. H. & Gama, J. (Hrsg.). Band 2. S. 70-86 17 S. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence); Band 16014).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

DOI

https://doi.org/10.1109/icip49359.2023.10222729
Endgültige, publizierte Fassung

Hands in Focus: Sign Language Recognition Via Top-Down Attention

Authors

Bibliographische Notiz

Fachgebiete

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Masked autoencoder for multiagent trajectories

Self-improvement for Computerized Adaptive Testing

DOI

Zuletzt angesehen

Forschende

Projekte

Aktivitäten

Publikationen