Constrained Independence for Detecting Interesting Patterns

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Constrained Independence for Detecting Interesting Patterns. / Delacroix, Thomas; Boubekki, Ahcène; Lenca, Philippe et al.
2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA). ed. / Gabriella Pasi; James Kwok; Osmar Zaiane; Patrick Gallinari; Eric Gaussier; Longbing Cao. IEEE - Institute of Electrical and Electronics Engineers Inc., 2015. 7344897 (Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Delacroix, T, Boubekki, A, Lenca, P & Lallich, S 2015, Constrained Independence for Detecting Interesting Patterns. in G Pasi, J Kwok, O Zaiane, P Gallinari, E Gaussier & L Cao (eds), 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)., 7344897, Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015, IEEE - Institute of Electrical and Electronics Engineers Inc., IEEE International Conference on Data Science and Advanced Analytics - DSAA 2015, Paris, France, 19.10.15. https://doi.org/10.1109/DSAA.2015.7344897

APA

Delacroix, T., Boubekki, A., Lenca, P., & Lallich, S. (2015). Constrained Independence for Detecting Interesting Patterns. In G. Pasi, J. Kwok, O. Zaiane, P. Gallinari, E. Gaussier, & L. Cao (Eds.), 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA) Article 7344897 (Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015). IEEE - Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DSAA.2015.7344897

Vancouver

Delacroix T, Boubekki A, Lenca P, Lallich S. Constrained Independence for Detecting Interesting Patterns. In Pasi G, Kwok J, Zaiane O, Gallinari P, Gaussier E, Cao L, editors, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA). IEEE - Institute of Electrical and Electronics Engineers Inc. 2015. 7344897. (Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015). doi: 10.1109/DSAA.2015.7344897

Bibtex

@inbook{53d0848465fe4b19aaa128d35b27c5e7,
title = "Constrained Independence for Detecting Interesting Patterns",
abstract = "Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.",
keywords = "Informatics, Mathematics, Business informatics",
author = "Thomas Delacroix and Ahc{\`e}ne Boubekki and Philippe Lenca and St{\'e}phane Lallich",
year = "2015",
month = dec,
day = "2",
doi = "10.1109/DSAA.2015.7344897",
language = "English",
series = "Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015",
publisher = "IEEE - Institute of Electrical and Electronics Engineers Inc.",
editor = "Gabriella Pasi and James Kwok and Osmar Zaiane and Patrick Gallinari and Eric Gaussier and Longbing Cao",
booktitle = "2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)",
address = "United States",
note = "IEEE International Conference on Data Science and Advanced Analytics - DSAA 2015, DSAA Conference 2015 ; Conference date: 19-10-2015 Through 21-10-2015",
url = "http://dsaa2015.lip6.fr/",

}

RIS

TY - CHAP

T1 - Constrained Independence for Detecting Interesting Patterns

AU - Delacroix, Thomas

AU - Boubekki, Ahcène

AU - Lenca, Philippe

AU - Lallich, Stéphane

PY - 2015/12/2

Y1 - 2015/12/2

N2 - Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

AB - Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

KW - Informatics

KW - Mathematics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84962853098&partnerID=8YFLogxK

U2 - 10.1109/DSAA.2015.7344897

DO - 10.1109/DSAA.2015.7344897

M3 - Article in conference proceedings

T3 - Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015

BT - 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

A2 - Pasi, Gabriella

A2 - Kwok, James

A2 - Zaiane, Osmar

A2 - Gallinari, Patrick

A2 - Gaussier, Eric

A2 - Cao, Longbing

PB - IEEE - Institute of Electrical and Electronics Engineers Inc.

T2 - IEEE International Conference on Data Science and Advanced Analytics - DSAA 2015

Y2 - 19 October 2015 through 21 October 2015

ER -

Recently viewed

Publications

  1. Constructions and Reconstructions. The Architectural Image between Rendering and Photography
  2. A multi input sliding mode control for Peltier Cells using a cold-hot sliding surface
  3. Concept for Process Parameter-Based Inline Quality Control as a Basis for Pairing in a Production Line
  4. A discrete-time fractional order PI controller for a three phase synchronous motor using an optimal loop shaping approach
  5. Dynamic Lot Size Optimization with Reinforcement Learning
  6. Latent structure perceptron with feature induction for unrestricted coreference resolution
  7. Design and Control of an Inductive Power Transmission System with AC-AC Converter for a Constant Output Current
  8. A Control Scheme for PMSMs using Model Predictive Control and a Feedforward Action in the Presence of Saturated Inputs
  9. Constructs for Assessing Integrated Reports-Testing the Predictive Validity of a Taxonomy for Organization Size, Industry, and Performance
  10. GPU-accelerated meshfree computational framework for modeling the friction surfacing process
  11. NH4+ ad-/desorption in sequencing batch reactors
  12. Dispatching rule selection with Gaussian processes
  13. Unidimensional and Multidimensional Methods for Recurrence Quantification Analysis with crqa
  14. Modelling tasks—The relation between linguistic skills, intra-mathematical skills, and context-related prior knowledge
  15. Methodologies for noise and gross error detection using univariate signal-based approaches in industrial applications
  16. Optimizing sampling of flying insects using a modified window trap
  17. A New Framework for Production Planning and Control to Support the Positioning in Fields of Tension Created by Opposing Logistic Objectives
  18. Finding Similar Movements in Positional Data Streams
  19. A change of values is in the air
  20. Exploration strategies, performance, and error consequences when learning a complex computer task
  21. Integrating errors into the training process
  22. Parking space management through deep learning – an approach for automated, low-cost and scalable real-time detection of parking space occupancy
  23. Modified dynamic programming approach for offline segmentation of long hydrometeorological time series
  24. The Use of Genetic Algorithm for PID Controller Auto-Tuning in ARM CORTEX M4 Platform
  25. Framework for the Parallelized Development of Estimation Tasks for Length, Area, Capacity and Volume in Primary School - A Pilot Study
  26. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  27. Empowering materials processing and performance from data and AI
  28. Volume of Imbalance Container Prediction using Kalman Filter and Long Short-Term Memory
  29. Changes of Perception
  30. Changing the Administration from within:
  31. Using cross-recurrence quantification analysis to compute similarity measures for time series of unequal length with applications to sleep stage analysis
  32. Contributions of declarative and procedural memory to accuracy and automatization during second language practice
  33. Stepwise-based optimizing approaches for arrangements of loudspeaker in multi-zone sound field reproduction
  34. A fast sequential injection analysis system for the simultaneous determination of ammonia and phosphate