Active and semi-supervised data domain description

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Data domain description techniques aim at deriving concise descriptions of objects belonging to a category of interest. For instance, the support vector domain description (SVDD) learns a hypersphere enclosing the bulk of provided unlabeled data such that points lying outside of the ball are considered anomalous. However, relevant information such as expert and background knowledge remain unused in the unsupervised setting. In this paper, we rephrase data domain description as a semi-supervised learning task, that is, we propose a semi-supervised generalization of data domain description (SSSVDD) to process unlabeled and labeled examples. The corresponding optimization problem is non-convex. We translate it into an unconstraint, continuous problem that can be optimized accurately by gradient-based techniques. Furthermore, we devise an effective active learning strategy to query low-confidence observations. Our empirical evaluation on network intrusion detection and object recognition tasks shows that our SSSVDDs consistently outperform baseline methods in relevant learning settings.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases : European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I
EditorsWray Buntine, Marko Grobelnik, Dunja Mladenic, John Shawe-Taylor
Number of pages16
Place of PublicationBerlin, Heidelberg
PublisherSpringer Verlag
Publication date01.07.2009
Pages407-422
ISBN (print)978-3-642-04179-2
ISBN (electronic)978-3-642-04180-8
DOIs
Publication statusPublished - 01.07.2009
Externally publishedYes
EventEuropean Conference on Machine Learning and Knowledge Discovery in Databases - 2009 - Bled, Slovenia
Duration: 07.09.200911.09.2009
https://www.k4all.org/event/european-conference-on-machine-learning-and-principles-and-practice-of-knowledge-discovery-in-databases/

    Research areas

  • Informatics - Active Learning, Background knowledge, Baseline methods, Continuous problems, Data domain description, Empirical evaluations, Gradient based, Learning settings, Network intrusion detection, Optimization problems, Semi-supervised learning, upport vector domain description, Unlabeled data
  • Business informatics

Recently viewed

Publications

  1. Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra
  2. The impact of explicit references in computer supported collaborative learning: Evidence from eye movement analyses
  3. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  4. Explaining the (Non-) Adoption of Advanced Data Analytics in Auditing
  5. Variational Pragmatics
  6. An Experimental Approach to the Optimization of Customer Information at the Point of Sale
  7. Development and characterisation of a new interface for coupling capillary LC with collision-cell ICPMS and its application for phosphorylation profiling of tryptic protein digests
  8. Mythos
  9. Importance of timing
  10. Time Use and Time Budgets
  11. CASE via MS
  12. Terminologien/Semantik
  13. Development of an Interdisciplinary, Intercultural Master’s Program on Sustainability
  14. Machine learning for optimization of energy and plastic consumption in the production of thermoplastic parts in SME
  15. Personalbeschaffung
  16. Measuring Variation in Gaze Following Across Communities, Ages, and Individuals
  17. Binnendifferenzierung in der Schulpraxis
  18. Forest history from a single tree species perspective
  19. Anders als die anderen?
  20. Games
  21. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  22. Farewell to the party model?
  23. Die Bedeutung der Zeit
  24. The declarative value of paraphs and the scope of military opposition. Annotations to Johannes Hurter: On the way to military opposition.
  25. Theodor Fontane, das Fremde und die Juden
  26. A Note on Risk Aversion and Labour Market Outcomes
  27. No matter what the name, we’re all the same? Examining ethnic online discrimination in ridesharing marketplaces
  28. Future Making
  29. Edge Effects