Harvesting information from captions for weakly supervised semantic segmentation

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Since acquiring pixel-wise annotations for training convolutional neural networks for semantic image segmentation is time-consuming, weakly supervised approaches that only require class tags have been proposed. In this work, we propose another form of supervision, namely image captions as they can be found on the Internet. These captions have two advantages. They do not require additional curation as it is the case for the clean class tags used by current weakly supervised approaches and they provide textual context for the classes present in an image. To leverage such textual context, we deploy a multi-modal network that learns a joint embedding of the visual representation of the image and the textual representation of the caption. The network estimates text activation maps (TAMs) for class names as well as compound concepts, i.e. combinations of nouns and their attributes. The TAMs of compound concepts describing classes of interest substantially improve the quality of the estimated class activation maps which are then used to train a network for semantic segmentation. We evaluate our method on the COCO dataset where it achieves state of the art results for weakly supervised image segmentation.

OriginalspracheEnglisch
Titel2019 International Conference on Computer Vision Workshops : ICCV 2019 : proceedings : 27 October-2 November 2019, Seoul, Korea
Anzahl der Seiten10
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum10.2019
Seiten4481-4490
Aufsatznummer9022140
ISBN (Print)978-1-7281-5024-6
ISBN (elektronisch)978-1-7281-5023-9
DOIs
PublikationsstatusErschienen - 10.2019
Extern publiziertJa
Veranstaltung17th IEEE/CVF International Conference on Computer Vision Workshop - ICCVW 2019 - Seoul, Südkorea
Dauer: 27.10.201928.10.2019
Konferenznummer: 17
https://iccv2019.thecvf.com/

Bibliographische Notiz

Publisher Copyright:
© 2019 IEEE.

DOI

Zuletzt angesehen

Publikationen

  1. Errors in Working with Office Computers
  2. Knowledge-Enhanced Language Models Are Not Bias-Proof
  3. Determination of the construction and the material identity values of outside building components with the help of in-situ measuring procedures and FEM-simulation calculations
  4. Repeat Receipts: A device for generating visible data in market research focus groups
  5. The Network Dynamics of Movements
  6. Performance concepts and performance theory
  7. Multilayer neural networks
  8. User Authentication via Multifaceted Mouse Movements and Outlier Exposure
  9. Joint Proceedings of Scholarly QALD 2023 and SemREC 2023 co-located with 22nd International Semantic Web Conference ISWC 2023
  10. An Overview of Electro Hydraulic Full Variable Valve Train Systems to Reduce Emissions in Internal Combustion Engines
  11. The measurement time required for determining total NH3 losses after field application of slurries by trail hoses
  12. Recontextualizing context
  13. Manual for Analysis of Soils and Related Materials
  14. Using a decoupling technique to identify the magnetic flux in a permanent magnet synchronous motor
  15. Analysis of the mechanical properties of an arc-sprayed WC-FeCSiMn coating
  16. Sensorless Control of AC Motor Drives with Adaptive Extended Kalman Filter
  17. Velocity-free friction compensation for motion systems with actuator constraint
  18. A Robust Approximated Derivative Action of a PID Regulator to be Applied in a Permanent Magnet Synchronous Motor Control
  19. Rebound Effects in Methods of Artificial Intelligence
  20. In situ synchrotron radiation diffraction during solidification of Mgl5Gd
  21. Systematic risk behavior in cyclical industries
  22. The importance of product lifetime labelling for purchase decisions
  23. Der FFB-Server mit Microsoft Windows Server 2003
  24. The influence of vertical integration and property rights on network access charges in the German electricity market
  25. A New, Rapid, Fully Automated Method for Determination of Fluconazole in Serum by Column-Switching Liquid Chromatography