How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Extracting value from big data is one of today’s business challenges. In online marketing, for instance, advertisers use high volume clickstream data to increase the efficiency of their campaigns. To prevent collecting, storing, and processing of irrelevant data, it is crucial to determine how much data to analyze to achieve acceptable model performance. We propose a general procedure that employs the learning curve sampling method to determine the optimal sample size with respect to cost/benefit considerations. Applied in two case studies, we model the users' click behavior based on clickstream data and offline channel data. We observe saturation effects of the predictive accuracy when the sample size is increased and, thus, demonstrate that advertisers only have to analyze a very small subset of the full dataset to obtain an acceptable predictive accuracy and to optimize profits from advertising activities. In both case studies we observe that a random intercept logistic model outperforms a non-hierarchical model in terms of predictive accuracy. Given the high infrastructure costs and the users' growing awareness for tracking activities, our results have managerial implications for companies in the online marketing field.
Original languageEnglish
Title of host publicationProceedings of the Twenty-Third European Conference on Information Systems
Number of pages13
PublisherAIS eLibrary
Publication date29.05.2015
ISBN (print)978-3-00-050284-2
DOIs
Publication statusPublished - 29.05.2015
Event23rd European Conference on Information Systems - ECIS 2015 - Münster, Germany
Duration: 26.05.201529.05.2015
Conference number: 23
https://www.ercis.org/
http://www.ecis2015.eu/

Links

DOI

Recently viewed

Publications

  1. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  2. Evaluation of Time/Phase Parameters in Frequency Measurements for Inertial Navigation Systems
  3. An analytical approach to evaluating bivariate functions of fuzzy numbers with one local extremum
  4. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  5. On the Nonlinearity Compensation in Permanent Magnet Machine Using a Controller Based on a Controlled Invariant Subspace
  6. Dynamic environment modelling and prediction for autonomous systems
  7. Analysis and Implementation of a Resistance Temperature Estimator Based on Bi-Polynomial Least Squares Method and Discrete Kalman Filter
  8. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  9. Machine Learning and Knowledge Discovery in Databases
  10. 7th open challenge on question answering over linked data (QALD-7)
  11. Simulating X-ray beam energy and detector signal processing of an industrial CT using implicit neural representations
  12. Towards improved dispatching rules for complex shop floor scenarios - A genetic programming approach
  13. Modeling Conditional Dependencies in Multiagent Trajectories
  14. Enabling Road Condition Monitoring with an on-board Vehicle Sensor Setup
  15. Fixed-term Contracts and Wages Revisited Using Linked Employer-Employee Data from Germany
  16. Stability analysis of a linear model predictive control and its application in a water recovery process
  17. Building a process layer for business applications using the blackboard pattern
  18. Analyzing User Journey Data In Digital Health: Predicting Dropout From A Digital CBT-I Intervention
  19. Probabilistic approach to modelling of recession curves
  20. Study on the effects of tool design and process parameters on the robustness of deep drawing
  21. Identification of structure-biodegradability relationships for ionic liquids - clustering of a dataset based on structural similarity
  22. A Proposal for Integrating Theories of Complexity for Better Understanding Global Systemic Risks
  23. Changes in the Complexity of Limb Movements during the First Year of Life across Different Tasks
  24. Binary Random Nets I
  25. The learning net - an interactive representation of shared knowledge
  26. Cognitive Predictors of Child Second Language Comprehension and Syntactic Learning
  27. Analysis of semi-open queueing networks using lost customers approximation with an application to robotic mobile fulfilment systems
  28. Some model properties to control a permanent magnet machine using a controlled invariant subspace
  29. Model inversion using fuzzy neural network with boosting of the solution
  30. Trait correlation network analysis identifies biomass allocation traits and stem specific length as hub traits in herbaceous perennial plants
  31. Robust Control of Mobile Transportation Object with 3D Technical Vision System