Active and semi-supervised data domain description

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Data domain description techniques aim at deriving concise descriptions of objects belonging to a category of interest. For instance, the support vector domain description (SVDD) learns a hypersphere enclosing the bulk of provided unlabeled data such that points lying outside of the ball are considered anomalous. However, relevant information such as expert and background knowledge remain unused in the unsupervised setting. In this paper, we rephrase data domain description as a semi-supervised learning task, that is, we propose a semi-supervised generalization of data domain description (SSSVDD) to process unlabeled and labeled examples. The corresponding optimization problem is non-convex. We translate it into an unconstraint, continuous problem that can be optimized accurately by gradient-based techniques. Furthermore, we devise an effective active learning strategy to query low-confidence observations. Our empirical evaluation on network intrusion detection and object recognition tasks shows that our SSSVDDs consistently outperform baseline methods in relevant learning settings.
OriginalspracheEnglisch
TitelMachine Learning and Knowledge Discovery in Databases : European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I
HerausgeberWray Buntine, Marko Grobelnik, Dunja Mladenic, John Shawe-Taylor
Anzahl der Seiten16
ErscheinungsortBerlin, Heidelberg
VerlagSpringer Verlag
Erscheinungsdatum01.07.2009
Seiten407-422
ISBN (Print)978-3-642-04179-2
ISBN (elektronisch)978-3-642-04180-8
DOIs
PublikationsstatusErschienen - 01.07.2009
Extern publiziertJa
VeranstaltungEuropean Conference on Machine Learning and Knowledge Discovery in Databases - 2009 - Bled, Slowenien
Dauer: 07.09.200911.09.2009
https://www.k4all.org/event/european-conference-on-machine-learning-and-principles-and-practice-of-knowledge-discovery-in-databases/

DOI

Zuletzt angesehen

Publikationen

  1. The Lifecycle of "Facts'': A Survey of Social Bias in Knowledge Graphs
  2. Graph-based Approaches for Analyzing Team Interaction on the Example of Soccer
  3. Species constancy depends on plot size - A problem for vegetation classification and how it can be solved
  4. Increased auditor independence by external rotation and separating audit and non audit duties?
  5. Creating regional (e-)learning networks
  6. Beyond Path Dependency
  7. Introduction: The representative turn in EU studies
  8. Segment Introduction
  9. Noise level estimation and detection
  10. Designing a Thrifty Approach for SME Business Continuity: Practices for Transparency of the Design Process
  11. How Much Home Office is Ideal? A Multi-Perspective Algorithm
  12. An innovative efficiency of incubator to enhance organization supportive business using machine learning approach
  13. The identification of up-And downstream industries using input-output tables and a firm-level application to minority shareholdings
  14. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  15. Integration durch soziale Kontrolle?
  16. Intraindividual variability in identity centrality
  17. Speed of processing and stimulus complexity in low-frequency and high-frequency channels
  18. Wavelet functions for rejecting spurious values
  19. Using conditional inference trees and random forests to predict the bioaccumulation potential of organic chemicals
  20. Supporting Visual and Verbal Learning Preferences in a Second-Language Multimedia Learning Environment
  21. Quantification of amino acids in fermentation media by isocratic HPLC analysis of their
  22. Combining flatness based feedforward action with a fractional PI regulator to control the intake valve engine
  23. E-privacy concerns
  24. Treating dialogue quality evaluation as an anomaly detection problem
  25. Digging into the roots