How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Extracting value from big data is one of today’s business challenges. In online marketing, for instance, advertisers use high volume clickstream data to increase the efficiency of their campaigns. To prevent collecting, storing, and processing of irrelevant data, it is crucial to determine how much data to analyze to achieve acceptable model performance. We propose a general procedure that employs the learning curve sampling method to determine the optimal sample size with respect to cost/benefit considerations. Applied in two case studies, we model the users' click behavior based on clickstream data and offline channel data. We observe saturation effects of the predictive accuracy when the sample size is increased and, thus, demonstrate that advertisers only have to analyze a very small subset of the full dataset to obtain an acceptable predictive accuracy and to optimize profits from advertising activities. In both case studies we observe that a random intercept logistic model outperforms a non-hierarchical model in terms of predictive accuracy. Given the high infrastructure costs and the users' growing awareness for tracking activities, our results have managerial implications for companies in the online marketing field.
Original languageEnglish
Title of host publicationProceedings of the Twenty-Third European Conference on Information Systems
Number of pages13
PublisherAIS eLibrary
Publication date29.05.2015
ISBN (print)978-3-00-050284-2
DOIs
Publication statusPublished - 29.05.2015
Event23rd European Conference on Information Systems - ECIS 2015 - Münster, Germany
Duration: 26.05.201529.05.2015
Conference number: 23
https://www.ercis.org/
http://www.ecis2015.eu/

Links

DOI

Recently viewed

Publications

  1. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  2. A Service-oriented Search framework for full text, geospatial and semantic search
  3. On the Nonlinearity Compensation in Permanent Magnet Machine Using a Controller Based on a Controlled Invariant Subspace
  4. Analysis and Implementation of a Resistance Temperature Estimator Based on Bi-Polynomial Least Squares Method and Discrete Kalman Filter
  5. Derivative approximation using a discrete dynamic system
  6. Emergency detection based on probabilistic modeling in AAL-environments
  7. Modeling Conditional Dependencies in Multiagent Trajectories
  8. Enabling Road Condition Monitoring with an on-board Vehicle Sensor Setup
  9. Fixed-term Contracts and Wages Revisited Using Linked Employer-Employee Data from Germany
  10. Stability analysis of a linear model predictive control and its application in a water recovery process
  11. Supporting the Development and Realization of Data-Driven Business Models with Enterprise Architecture Modeling and Management
  12. Building a process layer for business applications using the blackboard pattern
  13. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  14. Comparing the Sensitivity of Social Networks, Web Graphs, and Random Graphs with Respect to Vertex Removal
  15. Building Assistance Systems using Distributed Knowledge Representations
  16. A statistical study of the spatial evolution of shock acceleration efficiency for 5 MeV protons and subsequent particle propagation
  17. AGDISTIS - Graph-based disambiguation of named entities using linked data
  18. The Use of Factorization and Multimode Parametric Spectra in Estimating Frequency and Spectral Parameters of Signal
  19. Structure and dynamics laboratory testing of an indirectly controlled full variable valve train for camless engines
  20. Clustering Hydrological Homogeneous Regions and Neural Network Based Index Flood Estimation for Ungauged Catchments
  21. Implementing ERP systems in multinational projects
  22. Linux-based Embedded System for Wavelet Denoising and Monitoring of sEMG Signals using an Axiomatic Seminorm
  23. Multi-Parallel Sending Coils for Movable Receivers in Inductive Charging Systems
  24. 'SPREAD THE APP, NOT THE VIRUS’ – AN EXTENSIVE SEM-APPROACH TO UNDERSTAND PANDEMIC TRACING APP USAGE IN GERMANY
  25. Errors, error taxonomies, error prevention, and error management
  26. Transductive support vector machines for structured variables
  27. Technological System and the Problem of Desymbolization
  28. Mechanistic Realization of the Turtle Shell
  29. Metaheuristics approach for solving personalized crew rostering problem in public bus transit
  30. Evaluating a Bayesian Student Model of Decimal Misconceptions
  31. Loss systems in a random environment: steady state analysis
  32. An empirical comparison of different implicit measures to predict consumer choice