Harvesting information from captions for weakly supervised semantic segmentation

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Since acquiring pixel-wise annotations for training convolutional neural networks for semantic image segmentation is time-consuming, weakly supervised approaches that only require class tags have been proposed. In this work, we propose another form of supervision, namely image captions as they can be found on the Internet. These captions have two advantages. They do not require additional curation as it is the case for the clean class tags used by current weakly supervised approaches and they provide textual context for the classes present in an image. To leverage such textual context, we deploy a multi-modal network that learns a joint embedding of the visual representation of the image and the textual representation of the caption. The network estimates text activation maps (TAMs) for class names as well as compound concepts, i.e. combinations of nouns and their attributes. The TAMs of compound concepts describing classes of interest substantially improve the quality of the estimated class activation maps which are then used to train a network for semantic segmentation. We evaluate our method on the COCO dataset where it achieves state of the art results for weakly supervised image segmentation.

OriginalspracheEnglisch
Titel2019 International Conference on Computer Vision Workshops : ICCV 2019 : proceedings : 27 October-2 November 2019, Seoul, Korea
Anzahl der Seiten10
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum10.2019
Seiten4481-4490
Aufsatznummer9022140
ISBN (Print)978-1-7281-5024-6
ISBN (elektronisch)978-1-7281-5023-9
DOIs
PublikationsstatusErschienen - 10.2019
Extern publiziertJa
Veranstaltung17th IEEE/CVF International Conference on Computer Vision Workshop - ICCVW 2019 - Seoul, Südkorea
Dauer: 27.10.201928.10.2019
Konferenznummer: 17
https://iccv2019.thecvf.com/

Bibliographische Notiz

Publisher Copyright:
© 2019 IEEE.

DOI

Zuletzt angesehen

Aktivitäten

  1. Visualizing and analyzing big data sets: Results from the Student Bodies-Eating Disorders study
  2. Linguistic Determines Mathematics: How Linguistic Item Characteristics Influence the Difficulty of Mathematics Test Ttems
  3. An Axiomatic Approach to Decision under Knightian Uncertainty
  4. Effects of enhanced visual feedback on postural control in static and dynamic conditions.
  5. Experiences on the theme of actions for sustainable development in the field of educational systems
  6. Blogs in the Foreign Language Classroom
  7. Teams are changing! Going into the wild to expand theory on dynamics in modern teamwork settings
  8. On the relational structure of two tests measuring general pedagogical knowledge
  9. Tracing the Unknown: Learning from Provenance Data
  10. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  11. Field Experimentation in Governance Research. Early insights from researching the effectiveness of public participation in implementing the EU Floods Directive
  12. Coherent behavior in geophysical flows
  13. Workshop zu Audiogames and Simulation
  14. Learning and Re-learning in Chat-based CSCL: The Impact of Individual Learning Strategies
  15. Development of a temperature controlled weathering test box to evaluate the life cycle behaviour of interior automotive components
  16. Presenting paper 'Writing Organization Atmospherically'
  17. Between Connections and Knowledge: An Approach to Culture through Graph Theory and Complex Systems
  18. Local Interest Representation in Multi-Level Parties
  19. Sino-German Summer School on Design and data analysis of biodiversity-ecosystem functioning experiments 2011
  20. The influence of polycentricity on collaborative environmental management – the case of EU Water Framework Directive implementation in Germany
  21. Presentation: Nexus of Housing and Migration
  22. International Symposium on Multiscale Computational Analysis of Complex Materials
  23. Empirical Research Methods on Legitimacy: Repertory Grid as the Interface between „Measuring“ and „Evaluating“

Publikationen

  1. Failed mobility transition in an ideal setting and implications for building a green city
  2. New method for assessing the repeatability of the measuring system for roughness measurements
  3. Masked autoencoder for multiagent trajectories
  4. Mimicking and anticipating others’ actions is linked to social information processing
  5. Chapter 9: Particular Remedies for Non-performance: Section 1: Right to Performance
  6. Impulsive Feedback Linearization for Decoupling of a Constant Disturbance with Low Relative Degree to Control Maglev Systems
  7. Web-scale extension of RDF knowledge bases from templated websites
  8. Robust approximate fixed-time tracking control for uncertain robot manipulators
  9. Understanding and Supporting Management Decision-Making
  10. Effect of gap distortion on the field splitting of collective modes in superfluid He3-B
  11. Jackson networks in nonautonomous random environments
  12. Exploring transition research as transformative science
  13. Using corpus-linguistic methods to track longitudinal development
  14. Holistic and scalable ranking of RDF data
  15. Employing A-B tests for optimizing prices levels in e-commerce applications
  16. Incorporating ecosystem services into ecosystem-based management to deal with complexity
  17. Highly Efficient IPT Transmitter Circuit Based on a Novel Enhanced Class B Amplifier for Consumer Applications
  18. Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability
  19. TRY plant trait database – enhanced coverage and open access
  20. Machine Learning and Data Mining for Sports Analytics
  21. Pluralism and diversity: Trends in the use and application of ordination methods 1990-2007
  22. ENVISIONING PROTECTED AREAS THROUGH PARTICIPATORY SCENARIO PLANNING: NAVIGATING COVERAGE AND EFFECTIVENESS CHALLENGES AHEAD
  23. An interdisciplinary perspective on scaling in transitions
  24. Cross-case knowledge transfer in transformative research: enabling learning in and across sustainability-oriented labs through case reporting
  25. In-Vehicle Sensor System for Monitoring Efficiency of Vehicle E/E Architectures
  26. A Sensitive Microsystem as Biosensor for Cell Growth Monitoring and Antibiotic Testing