Harvesting information from captions for weakly supervised semantic segmentation

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Since acquiring pixel-wise annotations for training convolutional neural networks for semantic image segmentation is time-consuming, weakly supervised approaches that only require class tags have been proposed. In this work, we propose another form of supervision, namely image captions as they can be found on the Internet. These captions have two advantages. They do not require additional curation as it is the case for the clean class tags used by current weakly supervised approaches and they provide textual context for the classes present in an image. To leverage such textual context, we deploy a multi-modal network that learns a joint embedding of the visual representation of the image and the textual representation of the caption. The network estimates text activation maps (TAMs) for class names as well as compound concepts, i.e. combinations of nouns and their attributes. The TAMs of compound concepts describing classes of interest substantially improve the quality of the estimated class activation maps which are then used to train a network for semantic segmentation. We evaluate our method on the COCO dataset where it achieves state of the art results for weakly supervised image segmentation.

OriginalspracheEnglisch
Titel2019 International Conference on Computer Vision Workshops : ICCV 2019 : proceedings : 27 October-2 November 2019, Seoul, Korea
Anzahl der Seiten10
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum10.2019
Seiten4481-4490
Aufsatznummer9022140
ISBN (Print)978-1-7281-5024-6
ISBN (elektronisch)978-1-7281-5023-9
DOIs
PublikationsstatusErschienen - 10.2019
Extern publiziertJa
Veranstaltung17th IEEE/CVF International Conference on Computer Vision Workshop - ICCVW 2019 - Seoul, Südkorea
Dauer: 27.10.201928.10.2019
Konferenznummer: 17
https://iccv2019.thecvf.com/

Bibliographische Notiz

Publisher Copyright:
© 2019 IEEE.

DOI

Zuletzt angesehen

Publikationen

  1. Comparing two hybrid neural network models to predict real-world bus travel time
  2. Improving Flood Forecasting in a Developing Country
  3. Microstructure-based modeling of residual stresses in WC-12Co-sprayed coatings
  4. Fallstudie
  5. Introduction to Thinking the Problematic
  6. Non-invariance? An Overstated Problem With Misconceived Causes
  7. Operationalizing ecosystem services for the mitigation of soil threats
  8. A framework to enable sustainability-oriented transition activities in HEIs
  9. Personalized Transaction Kernels for Recommendation Using MCTS
  10. Polycrisis patterns
  11. Script and sound
  12. Dynamic capabilities and routinization
  13. Possible underestimations of risks for the environment due to unregulated emissions of biocides from households to wastewater
  14. Explaining Investment Dynamics: Empirical Evidence from German New Ventures
  15. Assessment of occupational exertion and strain in laboratory- and real occupational environments
  16. Extending Enterprise Architectures for Adopting the Internet of Things
  17. Wie lang sollte eine Rollstuhlrampe sein?
  18. Assessing mire-specific biodiversity with an indicator based approach
  19. Vorstellungen über null und Null
  20. Tristan Garcia, Form and Object
  21. Existential insecurity and deference to authority
  22. Systematic learning in water governance: insights from five local adaptive management projects for water quality innovation
  23. Research-Creation
  24. An assessment of the published results of animal relocations
  25. Towards a dimensional approach to common mental disorders in the ICD-11?
  26. Linking trait similarity to interspecific spatial associations in a moist tropical forest
  27. On the Existence of Digital Objects
  28. Case Study
  29. Effects of budget constraints on conservation network design for biodiversity and ecosystem services
  30. Beyond Urban Challenges-Virtual Reality Tools in Participatory Design Processes
  31. Results from the project 'Acceptance of CO2 capture and storage
  32. Conditionality of EU funds: an instrument to enforce EU fundamental values?
  33. Erich und die Übersetzer