Discriminative Identification of Duplicates

Activity: Talk or presentationConference PresentationsResearch

Peter Haider - Speaker

Ulf Brefeld - Speaker

Tobias Scheffer - Speaker

The problem of finding duplicates in data is ubiquitous in
data mining. We cast the problem of finding duplicates in sequential data
into a poly-cut problem on a fully connected graph. The edge weights can
be identified with parameterized pairwise similarities between objects
that are optimized by structural support vector machines on labeled
training sets. Our approach adapts the similarity measure to the data and
is independent of the number of clusters. We present three large margin
approximations of learning the pairwise similarities: an integrated QP-
formulation, a sequential multi-class approach and a pairwise classifier.
We report on experimental results
18.09.200622.09.2006

Event

European Conference on Machine Learning

18.09.0622.09.06

Berlin, Berlin, Germany

Event: Conference

Recently viewed

Publications

  1. Development and application of a simplified sampling method for volatile polyfluorinated alkyl substances in indoor and environmental air
  2. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  3. Introduction to Automatic Imitation
  4. SoilTemp: A global database of near-surface temperature
  5. Science-Related Outcomes
  6. The complexity of integrated flood management
  7. Navigating (In)Visibility
  8. An empirically grounded ontology for analyzing IT-based interventions in business ecosystems
  9. Influence of Mg content in Al alloys on processing characteristics and dynamically recrystallized microstructure of friction surfacing deposits
  10. The development of an eco-label for software products
  11. Teaching Sustainable Development in a Sensory and Artful Way — Concepts, Methods, and Examples
  12. The Weird and the Eerie
  13. Insights into creep behavior of Mg–14Gd–1Zn–0.4Zr (wt.%) alloy containing β- and γ-type precipitates
  14. On walks in molecular graphs.
  15. Linking concepts of change and ecosystem services research: A systematic review
  16. Concurrently Observed Actions Are Represented Not as Compound Actions but as Independent Actions
  17. Microstructure and mechanical properties of as-cast Mg-Sn-Ca alloys and effect of alloying elements
  18. Utilization of protein-rich residues in biotechnological processes
  19. Consumer Preferences for Local Food: Testing an Extended Norm Taxonomy
  20. EU decision-making in asylum policy
  21. Forms of theorising in entrepreneurship – The case of effectuation as a theory
  22. Hydrological tracers for assessing transport and dissipation processes of pesticides in a model constructed wetland system
  23. Health and the intention to retire: exploring the moderating effects of human resources practices
  24. Integrating a piezoelectric actuator with mechanical and hydraulic devices to control camless engines
  25. Modeling of microstructural pattern formation in crystal plasticity
  26. The means determine the end
  27. Curatorial Practices of the ‘Global’
  28. Leaf Nutritional Content, Tree Richness, and Season Shape the Caterpillar Functional Trait Composition Hosted by Trees
  29. Fragmentierung und Kooptation
  30. Effect of salinity-changing rates on filtration activity of mussels from two sites within the Baltic Mytilus hybrid zone
  31. The Exilic Classroom
  32. Intricate Letters and the Reification of Light