Discriminative Identification of Duplicates

Activity: Talk or presentationConference PresentationsResearch

Peter Haider - Speaker

Ulf Brefeld - Speaker

Tobias Scheffer - Speaker

The problem of finding duplicates in data is ubiquitous in
data mining. We cast the problem of finding duplicates in sequential data
into a poly-cut problem on a fully connected graph. The edge weights can
be identified with parameterized pairwise similarities between objects
that are optimized by structural support vector machines on labeled
training sets. Our approach adapts the similarity measure to the data and
is independent of the number of clusters. We present three large margin
approximations of learning the pairwise similarities: an integrated QP-
formulation, a sequential multi-class approach and a pairwise classifier.
We report on experimental results
18.09.200622.09.2006

Event

European Conference on Machine Learning

18.09.0622.09.06

Berlin, Berlin, Germany

Event: Conference

Recently viewed

Researchers

  1. Stephan Scheel

Publications

  1. Scaffolding Learner Agency in Technology-Enhanced Language Learning Environments
  2. Enhancing EFL classroom instruction via the FeedBook: effects on language development and communicative language use.
  3. Teaching Sustainable Development in a Sensory and Artful Way — Concepts, Methods, and Examples
  4. User Authentication via Multifaceted Mouse Movements and Outlier Exposure
  5. An introduction to sliding mode control for interdisciplinary education
  6. Exploring priority and year effects on plant diversity, productivity and vertical root distribution: first insights from a grassland field experiment
  7. Adaptive capacity and learning to learn as leverage for social-ecological resilience
  8. The Framework for Inclusive Science Education
  9. Microstructure-based modeling of residual stresses in WC-12Co-sprayed coatings
  10. A high-resolution approach for the spatiotemporal analysis of forest canopy space using terrestrial laser scanning data
  11. How generative drawing affects the learning process
  12. The role of place in shaping responsibility logics
  13. An Optimal and Stabilising PI Controller with an Anti-windup Scheme for a Purification Process of Potable Water
  14. Direct parameter specification of an attention shift: Evidence from perceptual latency priming
  15. Metrics for Experimentation Programs: Categories, Benefits and Challenges
  16. A Trajectory Generation Algorithm for Optimal Consumption in Electromagnetic Actuators
  17. General management principles and a checklist of strategies to guide forest biodiversity conservation
  18. Development of a Parameterized Model for Additively Manufactured Dies to Control the Strains in Extrudates
  19. Stressing the Relevance of Differentiating between Systematic and Random Measurement Errors in Ultrasound Muscle Thickness Diagnostics
  20. The buffering effect of selection, optimization, and compensation strategy use on the relationship between problem solving demands and occupational well-being
  21. Bayesian Parameter Estimation in Green Business Process Management
  22. What motivates people to use energy feedback systems? A multiple goal approach to predict long-term usage behaviour in daily life
  23. A Process Perspective on Organizational Failure
  24. Design of Reliable Remobilisation Finger Implants with Geometry Elements of a Triple Periodic Minimal Surface Structure via Additive Manufacturing of Silicon Nitride
  25. Emotional text design in multimedia learning
  26. Evaluating A Teaching-Learning Sequence (TLS) About Acid-Base Reactions In Upper Secondary School
  27. Implementation of Chemometric Tools to Improve Data Mining and Prioritization in LC-HRMS for Nontarget Screening of Organic Micropollutants in Complex Water Matrixes
  28. Application of design of experiments for laser shock peening process optimization
  29. Explicit references in chat-based CSCL