Active and semi-supervised data domain description

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Data domain description techniques aim at deriving concise descriptions of objects belonging to a category of interest. For instance, the support vector domain description (SVDD) learns a hypersphere enclosing the bulk of provided unlabeled data such that points lying outside of the ball are considered anomalous. However, relevant information such as expert and background knowledge remain unused in the unsupervised setting. In this paper, we rephrase data domain description as a semi-supervised learning task, that is, we propose a semi-supervised generalization of data domain description (SSSVDD) to process unlabeled and labeled examples. The corresponding optimization problem is non-convex. We translate it into an unconstraint, continuous problem that can be optimized accurately by gradient-based techniques. Furthermore, we devise an effective active learning strategy to query low-confidence observations. Our empirical evaluation on network intrusion detection and object recognition tasks shows that our SSSVDDs consistently outperform baseline methods in relevant learning settings.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases : European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I
EditorsWray Buntine, Marko Grobelnik, Dunja Mladenic, John Shawe-Taylor
Number of pages16
Place of PublicationBerlin, Heidelberg
PublisherSpringer Verlag
Publication date01.07.2009
Pages407-422
ISBN (print)978-3-642-04179-2
ISBN (electronic)978-3-642-04180-8
DOIs
Publication statusPublished - 01.07.2009
Externally publishedYes
EventEuropean Conference on Machine Learning and Knowledge Discovery in Databases - 2009 - Bled, Slovenia
Duration: 07.09.200911.09.2009
https://www.k4all.org/event/european-conference-on-machine-learning-and-principles-and-practice-of-knowledge-discovery-in-databases/

    Research areas

  • Informatics - Active Learning, Background knowledge, Baseline methods, Continuous problems, Data domain description, Empirical evaluations, Gradient based, Learning settings, Network intrusion detection, Optimization problems, Semi-supervised learning, upport vector domain description, Unlabeled data
  • Business informatics

Recently viewed

Researchers

  1. Felix Modelsee

Publications

  1. Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra
  2. Supporting the Development and Realization of Data-Driven Business Models with Enterprise Architecture Modeling and Management
  3. Extraction of finite-time coherent sets in 3D Rayleigh-Benard Convection using the dynamic Laplacian
  4. Using Heider’s Epistemology of Thing and Medium for Unpacking the Conception of Documents: Gantt Charts and Boundary Objects
  5. Privatizing the commons
  6. Design, Modeling and Control of an Over-actuated Hexacopter Tilt-Rotor
  7. Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions:
  8. The Framework for Inclusive Science Education
  9. Adaptive Item Selection Under Matroid Constraints
  10. A Besov space mapping property for the double layer potential on polygons
  11. Introduction: The representative turn in EU Studies
  12. Improvements in Flexibility depend on Stretching Duration
  13. Improving Human-Machine Interaction
  14. Forging of Mg–3Sn–2Ca–0.4Al Alloy Assisted by Its Processing Map and Validation Through Analytical Modeling
  15. Using Reading Strategy Training to Foster Students´ Mathematical Modelling Competencies
  16. Aging and Distal Effect Anticipation when Using Tools
  17. An Ecosystem Architecture Meta-Model for Supporting Ultra-Large Scale Digital Transformations
  18. Natural enemy diversity reduces temporal variability in wasp but not bee parasitism
  19. A Statistical Approach to Estimate Spatial Distributions of Wet Deposition in Germany
  20. Fast response of groundwater to heavy rainfall
  21. Transcending the Locality of Grassroots Initiatives
  22. Correlation between Isometric Maximum Strength and One Repetition Maximum in the Calf Muscle in Extended and Bended Knee Joint
  23. Entrepreneurial actions
  24. Effects of oral corrective feedback on the development of complex morphosyntax
  25. "Wen feiern wir denn eigentlich?"
  26. Contrasting requests in Inner Circle Englishes

Press / Media

  1. Too long, didn't read?