p-norm multiple kernel learning

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

Learning linear combinations of multiple kernels is an appealing strategy when the right choice of features is unknown. Previous approaches to multiple kernel learning (MKL) promote sparse kernel combinations to support interpretability and scalability. Unfortunately, this ℓ1norm MKL is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we extend MKL to arbitrary norms. We devise new insights on the connection between several existing MKL formulations and develop two efficient interleaved optimization strategies for arbitrary norms, that is ℓp -norms with p ≥ 1. This interleaved optimization is much faster than the commonly used wrapper approaches, as demonstrated on several data sets. A theoretical analysis and an experiment on controlled artificial data shed light on the appropriateness of sparse, non-sparse and ℓ-norm MKL in various scenarios. Importantly, empirical applications of ℓp-norm MKL to three real-world problems from computational biology show that non-sparse MKL achieves accuracies that surpass the state-of-the-art. Data sets, source code to reproduce the experiments, implementations of the algorithms, and further information are available at http://doc.ml.tu-berlin.de/nonsparse-mkl/.

Original languageEnglish
JournalJournal of Machine Learning Research
Volume12
Pages (from-to)953-997
Number of pages45
ISSN1532-4435
Publication statusPublished - 03.2011
Externally publishedYes

    Research areas

  • Bioinformatics, Block coordinate descent, Convex conjugate, Generalization bounds, Large scale optimization, Learning kernels, Multiple kernel learning, Non-sparse, Rademacher complexity, Support vector machine
  • Informatics

Recently viewed

Publications

  1. Graph-based Approaches for Analyzing Team Interaction on the Example of Soccer
  2. Denoising and harmonic detection using nonorthogonal wavelet packets in industrial applications
  3. A dialectical perspective on innovation: Conflicting demands, multiple pathways, and ambidexterity
  4. Differences of Four Work-Related Behavior and Experience Patterns in Work Ability and Other Work-Related Perceptions in a Finance Company
  5. Analysis of a phase‐field finite element implementation for precipitation
  6. Optimal dynamic scale and structure of a multi-pollution economy
  7. Towards a New Aesthetic
  8. Managing Multiple Logics: The Role of Performance Measurement Systems in Social Enterprises
  9. The Augmented Theorist - Toward Automated Knowledge Extraction from Conceptual Models
  10. Warming-up effects of static stretching on power and strength
  11. Processing of CSR communication: insights from the ELM
  12. Do Linguistic Features Influence Item Difficulty in Physics Assessments?
  13. Rethink Textile Production - Developing sustainable concepts for textile industry using production simulation
  14. Global fern and lycophyte richness explained: How regional and local factors shape plot richness
  15. An Adaptive Resonance Regulator for an Actuator using Periodic Signals in Camless Engine Systems
  16. Exploring the implications of the value concept for performance assessment of sustainable business models
  17. Experience from downscaling IPCC-SRES scenarios to specific national-level focus scenarios for ecosystem service management
  18. Mechanical behavior, microstructural evolution and texture analysis of AA2024-T351 processed by multi-layer friction surfacing with high build rates