Constrained Independence for Detecting Interesting Patterns

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
EditorsGabriella Pasi, James Kwok, Osmar Zaiane, Patrick Gallinari, Eric Gaussier, Longbing Cao
Number of pages10
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date02.12.2015
Article number7344897
ISBN (electronic)978-1-4673-8272-4
DOIs
Publication statusPublished - 02.12.2015
EventIEEE International Conference on Data Science and Advanced Analytics - DSAA 2015 - Paris, France
Duration: 19.10.201521.10.2015
http://dsaa2015.lip6.fr/

Recently viewed

Publications

  1. A Multivariate Method for Dynamic System Analysis
  2. Integrating Mobile Devices into AAL-Environments using Knowledge based Assistance Systems
  3. A discrete-time fractional order PI controller for a three phase synchronous motor using an optimal loop shaping approach
  4. How to combine collaboration scripts and heuristic worked examples to foster mathematical argumentation - when working memory matters
  5. Methodologies for Noise and Gross Error Detection using Univariate Signal-Based Approaches in Industrial Application
  6. Analysis and comparison of two finite element algorithms for dislocation density based crystal plasticity
  7. A genetic algorithm for a self-learning parameterization of an aerodynamic part feeding system for high-speed assembly
  8. Binary Random Nets I
  9. Comparing Two Voltage Observers in a Sensorsystem using Repetitive Control
  10. Using Natural Language Processing Techniques to Tackle the Construct Identity Problem in Information Systems Research
  11. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  12. Algebraic combinatorics in mathematical chemistry. Methods and algorithms. I. Permutation groups and coherent (cellular) algebras.
  13. Supervised clustering of streaming data for email batch detection
  14. Ant colony optimization algorithm and artificial immune system applied to a robot route
  15. Development of a Didactic Graphical Simulation Interface on MATLAB for Systems Control
  16. Detection and mapping of water pollution variation in the Nile Delta using multivariate clustering and GIS techniques
  17. Multidimensional Cross-Recurrence Quantification Analysis (MdCRQA)–A Method for Quantifying Correlation between Multivariate Time-Series
  18. Data-Generating Mechanisms Versus Constructively Defined Latent Variables in Multitrait–Multimethod Analysis:
  19. Graph Conditional Variational Models: Too Complex for Multiagent Trajectories?
  20. Using learning protocols for knowledge acquisition and problem solving with individual and group incentives
  21. Modeling and simulation of deformation behavior, orientation gradient development and heterogeneous hardening in thin sheets with coarse texture
  22. A geometric algorithm for the output functional controllability in general manipulation systems and mechanisms
  23. Contributions of declarative and procedural memory to accuracy and automatization during second language practice
  24. Towards a Dynamic Interpretation of Subjective and Objective Values
  25. Analysis of priority rule-based scheduling in dual-resource-constrained shop-floor scenarios
  26. Discourse Analyses in Chat-based CSCL with Learning Protocols
  27. Modeling precipitation kinetics for multi-phase and multi-component systems using particle size distributions via a moving grid technique
  28. Using haar wavelets for fault detection in technical processes
  29. A Quadrant Approach of Camera Calibration Method for Depth Estimation Using a Stereo Vision System