Constrained Independence for Detecting Interesting Patterns

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
EditorsGabriella Pasi, James Kwok, Osmar Zaiane, Patrick Gallinari, Eric Gaussier, Longbing Cao
Number of pages10
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date02.12.2015
Article number7344897
ISBN (electronic)978-1-4673-8272-4
DOIs
Publication statusPublished - 02.12.2015
EventIEEE International Conference on Data Science and Advanced Analytics - DSAA 2015 - Paris, France
Duration: 19.10.201521.10.2015
http://dsaa2015.lip6.fr/

Recently viewed

Publications

  1. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  2. What would Colin say?
  3. A Column Generation Approach for Bus Driver Rostering Problems
  4. (De)Composing Public Value
  5. The Influence of Robots’ Emotion Expressions on the Uncanny-Valley-Effect
  6. Why Fun Matters: In Search of Emergent Playful Experiences
  7. Chapter 9: Particular Remedies for Non-performance: Section 4: Price Reduction
  8. Multitrophic diversity in a biodiverse forest is highly nonlinear across spatial scales
  9. Rolling bones
  10. Chapter 9: Particular Remedies for Non-performance: Section 5: Damages and Interest
  11. Nostalgia is not what it used to be
  12. Lautheitskonstanz oder Range-Effekt?
  13. Initial hazard screening for genotoxicity of photo-transformation products of ciprofloxacin by applying a combination of experimental and in-silico testing
  14. Predicting the future performance of soccer players
  15. Impact Assessment of Emissions Stabilization Scenarios with and without Induced Technological Change
  16. Active-matter-Systeme
  17. Memória, internet e aprendizagem turbo
  18. Note Analytique - Swimming with the tide, or seeking to stem it?
  19. Regulating High Frequency Trading
  20. Aluminium-rich coring structures in Mg-Al alloys with carbon inoculation
  21. Das Prinzip
  22. Emotional appropriateness and decision making
  23. The Protection of Foreign Investments in Disputed Maritime Areas
  24. Zur Reform des Prüfungsausschusses post BilMoG
  25. Technology and the spiritual
  26. Biodiversität erfolgreich managen
  27. Media-Educational Habitus of Future Educators in the Context of Education in Day-Care Centers
  28. The head
  29. Zur Situation des Grundschulsports
  30. Editorial
  31. Victim, Perpetrator, or What Else?
  32. Interkulturelle Eignungsdiagnostik
  33. Handbook on Maritime Hybrid Threats — 10 Scenarios and Legal Scans
  34. T-Shirts auf dem Campus
  35. Monstrous Bodies in Rudolf Virchow's Medical Collection in Nineteenth-Century Germany
  36. Digital Leadership ‒ Mountain or Molehill?
  37. EU-topia? A Critique of the European Union as a Model
  38. Stilgeschichte des berechneten Kinos
  39. Im Namen der Emanzipation

Press / Media

  1. Radio Bremen 2