Constrained Independence for Detecting Interesting Patterns

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.

OriginalspracheEnglisch
Titel2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
HerausgeberGabriella Pasi, James Kwok, Osmar Zaiane, Patrick Gallinari, Eric Gaussier, Longbing Cao
Anzahl der Seiten10
VerlagIEEE - Institute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum02.12.2015
Aufsatznummer7344897
ISBN (elektronisch)978-1-4673-8272-4
DOIs
PublikationsstatusErschienen - 02.12.2015
VeranstaltungIEEE International Conference on Data Science and Advanced Analytics - DSAA 2015 - Paris, Frankreich
Dauer: 19.10.201521.10.2015
http://dsaa2015.lip6.fr/

DOI

Zuletzt angesehen

Publikationen

  1. Towards a Global Script?
  2. FaST: A linear time stack trace alignment heuristic for crash report deduplication
  3. The Influence of Note-taking on Mathematical Solution Processes while Working on Reality-Based Tasks
  4. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  5. A Quadrant Approach of Camera Calibration Method for Depth Estimation Using a Stereo Vision System
  6. Mapping interest rate projections using neural networks under cointegration
  7. DialogueMaps: Supporting interactive transdisciplinary dialogues with a web-based tool for multi-layer knowledge maps
  8. Comparison of Odor Thresholds obtained by a Three Alternative Choice Procedure and by the Method of Limits
  9. Automatic enumeration of all connected subgraphs.
  10. Cross-document coreference resolution using latent features
  11. Second language learners' performance in mathematics
  12. The signal location task as a method quantifying the distribution of attention
  13. Age effects on controlling tools with sensorimotor transformations
  14. Development and validation of a method for the determination of trace alkylphenols and phthalates in the atmosphere
  15. Return of Fibonacci random walks
  16. A sufficient asymptotic stability condition in generalised model predictive control to avoid input saturation
  17. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  18. Supporting discourse in a synchronous learning environment
  19. Modelling the Complexity of Measurement Estimation Situations - A Theoretical Framework for the Estimation of Lengths
  20. An Improved Approach to the Semi-Process-Oriented Implementation of Standardised ERP-Systems
  21. Distinguishing state variability from trait change in longitudinal data
  22. Optimization Analysis for an Uncovered Wagon Transportation with an Interactive Animated Simulation-Based Platform for Multidisciplinary Learning
  23. Switching from a Managing to a Monitoring Function on the Board
  24. Towards a Dynamic Interpretation of Subjective and Objective Values
  25. Performance analysis for loss systems with many subscribers and concurrent services
  26. On finding nonisomorphic connected subgraphs and distinct molecular substructures.
  27. Gaussian processes for dispatching rule selection in production scheduling
  28. Using mixture distribution models to test the construct validity of the Physical Self-Description Questionnaire
  29. Comments on "Tracking Control of Robotic Manipulators With Uncertain Kinematics and Dynamics"
  30. Analysis of long-term statistical data of cobalt flows in the EU
  31. A discrete approximate solution for the asymptotic tracking problem in affine nonlinear systems
  32. Authenticity and authentication in language learning
  33. Learning Analytics with Matlab Grader in Undergraduate Engineering Courses
  34. Appendix A: Design, implementation, and analysis of the iGOES project
  35. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  36. Improved sensorimotor control is not connected with improved proprioception
  37. A guided simulated annealing search for solving the pick-up and delivery problem with time windows and capacity constraints
  38. How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis