Harvesting information from captions for weakly supervised semantic segmentation

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Since acquiring pixel-wise annotations for training convolutional neural networks for semantic image segmentation is time-consuming, weakly supervised approaches that only require class tags have been proposed. In this work, we propose another form of supervision, namely image captions as they can be found on the Internet. These captions have two advantages. They do not require additional curation as it is the case for the clean class tags used by current weakly supervised approaches and they provide textual context for the classes present in an image. To leverage such textual context, we deploy a multi-modal network that learns a joint embedding of the visual representation of the image and the textual representation of the caption. The network estimates text activation maps (TAMs) for class names as well as compound concepts, i.e. combinations of nouns and their attributes. The TAMs of compound concepts describing classes of interest substantially improve the quality of the estimated class activation maps which are then used to train a network for semantic segmentation. We evaluate our method on the COCO dataset where it achieves state of the art results for weakly supervised image segmentation.

OriginalspracheEnglisch
Titel2019 International Conference on Computer Vision Workshops : ICCV 2019 : proceedings : 27 October-2 November 2019, Seoul, Korea
Anzahl der Seiten10
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum10.2019
Seiten4481-4490
Aufsatznummer9022140
ISBN (Print)978-1-7281-5024-6
ISBN (elektronisch)978-1-7281-5023-9
DOIs
PublikationsstatusErschienen - 10.2019
Extern publiziertJa
Veranstaltung17th IEEE/CVF International Conference on Computer Vision Workshop - ICCVW 2019 - Seoul, Südkorea
Dauer: 27.10.201928.10.2019
Konferenznummer: 17
https://iccv2019.thecvf.com/

Bibliographische Notiz

Publisher Copyright:
© 2019 IEEE.

DOI

Zuletzt angesehen

Projekte

  1. ZIM - SmartPress

Publikationen

  1. Privatizing the commons
  2. Failed mobility transition in an ideal setting and implications for building a green city
  3. Mining Implications From Data
  4. Bayesian Analysis of Longitudinal Multitrait
  5. Octanol-Water Partition Coefficient Measurement by a Simple 1H NMR Method
  6. New method for assessing the repeatability of the measuring system for roughness measurements
  7. Artificial intelligence
  8. Early Detection of Faillure in Conveyor Chain Systems by Wireless Sensor Node
  9. Changing Data Collection Methods Means Different Kind of Data
  10. Predicting the Individual Mood Level based on Diary Data
  11. Trait-based approaches to analyze links between the drivers of change and ecosystem services
  12. Approximate tree kernels
  13. Mathematical Modeling for Robot 3D Laser Scanning in Complete Darkness Environments to Advance Pipeline Inspection
  14. Life Cycle Assessment of Consumption Patterns – Understanding the links between changing social practices and environmental impacts
  15. Logistic Operating Curves in Theory and Practice
  16. Introducing split orders and optimizing operational policies in robotic mobile fulfillment systems
  17. Metrics for Experimentation Programs: Categories, Benefits and Challenges
  18. Application of design of experiments for laser shock peening process optimization
  19. Temperature control in Peltier cells comparing sliding mode control and PID controllers
  20. What Makes for a Good Theory? How to Evaluate a Theory Using the Strength Model of Self-Control as an Example
  21. How to support synchronous net-based learning discourses
  22. How, when and why do negotiators use reference points?
  23. Optimal dynamic scale and structure of a multi-pollution economy