Constrained Independence for Detecting Interesting Patterns

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
EditorsGabriella Pasi, James Kwok, Osmar Zaiane, Patrick Gallinari, Eric Gaussier, Longbing Cao
Number of pages10
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date02.12.2015
Article number7344897
ISBN (electronic)978-1-4673-8272-4
DOIs
Publication statusPublished - 02.12.2015
EventIEEE International Conference on Data Science and Advanced Analytics - DSAA 2015 - Paris, France
Duration: 19.10.201521.10.2015
http://dsaa2015.lip6.fr/

Recently viewed

Activities

  1. A Learning Agent for Parameter Adaptation in Speeded Tests
  2. The Water Framework Directive: Policy Implementation Through Multi-Level Governance
  3. Transdisciplinary Evaluation of Alternative Adaptation Strategies Value-Tree Method as a Tool to Integrate Multiple Values of Science, Practice and the General Public into Decision-Making
  4. Meta analysis as a strategy of evidence-based participation research: The example of the project ‘EDGE’
  5. Analysis of kinetic damping in the spectrum of the impedance probe by means of a block-based LU decomposition
  6. Didacta - 2006
  7. GIS program – Geographischer Informationssyteme
  8. Professional Development Workshop on “What Were You Thinking: Developing Cognitive Sensibilities for Inductive Coding” with Arne Carlsen, Martha Feldman, Claus Rerup, Heather Vogue, and Kristina Workman
  9. Session "The technological cause of entropy"
  10. The control of life and everything living. Biohacking as a Technology of Cybernetic Biopolitics
  11. Realizing Potentials
  12. Dimo Dimov
  13. Legitimizing museums as an agent of social change?
  14. Dokumentation und Qualitätssicherung vor Ort
  15. MA-Arbeit 2017
  16. Virtual Doctoral-Postdoctoral Seminar on Social-Symbolic Work
  17. Is Neoliberalism Still Spreading?
  18. Kritiken des Leidens
  19. Leuphana Startwoche 2016
  20. Expressivity in Lessing, Wittgenstein and Adorno
  21. "Neue MusikmachDinge: Augmented Creativity and Connectivity?"
  22. Wie lässt sich Unterrichtsfeedbackkompetenz messen? Entwicklung und Validierung eines videobasierten Instruments zur Erfassung der Unterrichtsfeedbackkompetenz
  23. Journal of European Public Policy (Fachzeitschrift)
  24. Between standardization and creativity: The role of strategic temporal (dis-)entrainment in drug development processes
  25. Focusing Events and Changes in the Regulation of Labour Standards in Australian and German Garment Supply Chains
  26. Einleitung zur Konferenz