Constrained Independence for Detecting Interesting Patterns

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
EditorsGabriella Pasi, James Kwok, Osmar Zaiane, Patrick Gallinari, Eric Gaussier, Longbing Cao
Number of pages10
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date02.12.2015
Article number7344897
ISBN (electronic)978-1-4673-8272-4
DOIs
Publication statusPublished - 02.12.2015
EventIEEE International Conference on Data Science and Advanced Analytics - DSAA 2015 - Paris, France
Duration: 19.10.201521.10.2015
http://dsaa2015.lip6.fr/

Recently viewed

Publications

  1. Automatic enumeration of all connected subgraphs.
  2. How to combine collaboration scripts and heuristic worked examples to foster mathematical argumentation - when working memory matters
  3. The Use of Genetic Algorithm for PID Controller Auto-Tuning in ARM CORTEX M4 Platform
  4. Analysis and comparison of two finite element algorithms for dislocation density based crystal plasticity
  5. A genetic algorithm for a self-learning parameterization of an aerodynamic part feeding system for high-speed assembly
  6. Binary Random Nets I
  7. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  8. Development of a Didactic Graphical Simulation Interface on MATLAB for Systems Control
  9. Modeling and simulation of deformation behavior, orientation gradient development and heterogeneous hardening in thin sheets with coarse texture
  10. Towards a Dynamic Interpretation of Subjective and Objective Values
  11. Proceedings of the SeMantic Answer Type and Relation Prediction Task at ISWC 2021 Semantic Web Challenge (SMART2021)
  12. Analysis of priority rule-based scheduling in dual-resource-constrained shop-floor scenarios
  13. Using haar wavelets for fault detection in technical processes
  14. Adaptive and Dynamic Feedback Loops between Production System and Production Network based on the Asset Administration Shell
  15. A sufficient asymptotic stability condition in generalised model predictive control to avoid input saturation
  16. Evaluation of Time/Phase Parameters in Frequency Measurements for Inertial Navigation Systems
  17. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  18. Application of non-convex rate dependent gradient plasticity to the modeling and simulation of inelastic microstructure development and inhomogeneous material behavior
  19. Expertise in research integration and implementation for tackling complex problems
  20. Machine Learning and Knowledge Discovery in Databases
  21. Building a process layer for business applications using the blackboard pattern
  22. Neural network-based adaptive fault-tolerant control for strict-feedback nonlinear systems with input dead zone and saturation
  23. N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format
  24. Comparing the Sensitivity of Social Networks, Web Graphs, and Random Graphs with Respect to Vertex Removal
  25. Optimal trajectory generation using MPC in robotino and its implementation with ROS system
  26. Paraphrasing Method for Controlling a Robotic Arm Using a Large Language Model
  27. Best Practices in AI and Data Science Models Evaluation
  28. Anomaly detection in formed sheet metals using convolutional autoencoders
  29. A Multilevel CFA-MTMM Model for Nested Structurally Different Methods
  30. Anatomy of Haar Wavelet Filter and Its Implementation for Signal Processing
  31. Perfect anti-windup in output tracking scheme with preaction
  32. Semantic Parsing for Knowledge Graph Question Answering with Large Language Models
  33. Reading and Calculating in Word Problem Solving
  34. Selection and Recognition of Statistically Defined Signals in Learning Systems
  35. Linux-based Embedded System for Wavelet Denoising and Monitoring of sEMG Signals using an Axiomatic Seminorm
  36. 'SPREAD THE APP, NOT THE VIRUS’ – AN EXTENSIVE SEM-APPROACH TO UNDERSTAND PANDEMIC TRACING APP USAGE IN GERMANY