How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis. / Stange, Martin; Funk, Burkhardt.
Proceedings of the Twenty-Third European Conference on Information Systems. AIS eLibrary, 2015.

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Stange, M & Funk, B 2015, How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis. in Proceedings of the Twenty-Third European Conference on Information Systems. AIS eLibrary, 23rd European Conference on Information Systems - ECIS 2015, Münster, Germany, 26.05.15. https://doi.org/10.18151/7217484

APA

Stange, M., & Funk, B. (2015). How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis. In Proceedings of the Twenty-Third European Conference on Information Systems AIS eLibrary. https://doi.org/10.18151/7217484

Vancouver

Stange M, Funk B. How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis. In Proceedings of the Twenty-Third European Conference on Information Systems. AIS eLibrary. 2015 doi: 10.18151/7217484

Bibtex

@inbook{02f02f601bbf4d3c855ca7f8227751ad,
title = "How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis",
abstract = "Extracting value from big data is one of today{\textquoteright}s business challenges. In online marketing, for instance, advertisers use high volume clickstream data to increase the efficiency of their campaigns. To prevent collecting, storing, and processing of irrelevant data, it is crucial to determine how much data to analyze to achieve acceptable model performance. We propose a general procedure that employs the learning curve sampling method to determine the optimal sample size with respect to cost/benefit considerations. Applied in two case studies, we model the users' click behavior based on clickstream data and offline channel data. We observe saturation effects of the predictive accuracy when the sample size is increased and, thus, demonstrate that advertisers only have to analyze a very small subset of the full dataset to obtain an acceptable predictive accuracy and to optimize profits from advertising activities. In both case studies we observe that a random intercept logistic model outperforms a non-hierarchical model in terms of predictive accuracy. Given the high infrastructure costs and the users' growing awareness for tracking activities, our results have managerial implications for companies in the online marketing field. ",
keywords = "Business informatics, Big Data, Online Marketing, User Journey Analysis, Learning Curve, Bayesian Models",
author = "Martin Stange and Burkhardt Funk",
year = "2015",
month = may,
day = "29",
doi = "10.18151/7217484",
language = "English",
isbn = "978-3-00-050284-2",
booktitle = "Proceedings of the Twenty-Third European Conference on Information Systems",
publisher = "AIS eLibrary",
address = "United States",
note = "23rd European Conference on Information Systems - ECIS 2015, ECIS conference 2015 ; Conference date: 26-05-2015 Through 29-05-2015",
url = "https://www.ercis.org/, http://www.ecis2015.eu/",

}

RIS

TY - CHAP

T1 - How Much Tracking Is Necessary? - The Learning Curve in Bayesian User Journey Analysis

AU - Stange, Martin

AU - Funk, Burkhardt

N1 - Conference code: 23

PY - 2015/5/29

Y1 - 2015/5/29

N2 - Extracting value from big data is one of today’s business challenges. In online marketing, for instance, advertisers use high volume clickstream data to increase the efficiency of their campaigns. To prevent collecting, storing, and processing of irrelevant data, it is crucial to determine how much data to analyze to achieve acceptable model performance. We propose a general procedure that employs the learning curve sampling method to determine the optimal sample size with respect to cost/benefit considerations. Applied in two case studies, we model the users' click behavior based on clickstream data and offline channel data. We observe saturation effects of the predictive accuracy when the sample size is increased and, thus, demonstrate that advertisers only have to analyze a very small subset of the full dataset to obtain an acceptable predictive accuracy and to optimize profits from advertising activities. In both case studies we observe that a random intercept logistic model outperforms a non-hierarchical model in terms of predictive accuracy. Given the high infrastructure costs and the users' growing awareness for tracking activities, our results have managerial implications for companies in the online marketing field.

AB - Extracting value from big data is one of today’s business challenges. In online marketing, for instance, advertisers use high volume clickstream data to increase the efficiency of their campaigns. To prevent collecting, storing, and processing of irrelevant data, it is crucial to determine how much data to analyze to achieve acceptable model performance. We propose a general procedure that employs the learning curve sampling method to determine the optimal sample size with respect to cost/benefit considerations. Applied in two case studies, we model the users' click behavior based on clickstream data and offline channel data. We observe saturation effects of the predictive accuracy when the sample size is increased and, thus, demonstrate that advertisers only have to analyze a very small subset of the full dataset to obtain an acceptable predictive accuracy and to optimize profits from advertising activities. In both case studies we observe that a random intercept logistic model outperforms a non-hierarchical model in terms of predictive accuracy. Given the high infrastructure costs and the users' growing awareness for tracking activities, our results have managerial implications for companies in the online marketing field.

KW - Business informatics

KW - Big Data

KW - Online Marketing

KW - User Journey Analysis

KW - Learning Curve

KW - Bayesian Models

U2 - 10.18151/7217484

DO - 10.18151/7217484

M3 - Article in conference proceedings

SN - 978-3-00-050284-2

BT - Proceedings of the Twenty-Third European Conference on Information Systems

PB - AIS eLibrary

T2 - 23rd European Conference on Information Systems - ECIS 2015

Y2 - 26 May 2015 through 29 May 2015

ER -

Links

DOI

Recently viewed

Activities

  1. Managing the present generations’ conflicts on the backs of future generations: How current generation’s negotiators create and claim value for themselves and future others
  2. Learning Processes in a Video-based Learning Environment: What do teachers think and feel when they observe their own teaching or that of others?
  3. Thinking of Time - A Resource which Should be Allocated Equally
  4. Emerging Visions of Seamless Travel: (En)Countering Camouflaged Sovereignty at the Frictionless Border
  5. Enacting clan crime through the production of statistical security knowledge
  6. Sustainability Transformation: Building Resilience in Sustainability Reporting for a Net-Zero Future
  7. Do mindsets make a difference? Professionalizing teachers for inclusive language learning environments
  8. Dissertation "Contested Constitutions: Constitutional Design, Conflict and Change in Post-Communist East Central Europe"
  9. (Re)Constructing a Sociology Of the Arts In the 21th Century: Problems and Perspectives - Inserting "Space" In The Sociology Of The Arts
  10. Shifting Regimes of Proof: On the Contested Politics of Identification in Border and Migration Management
  11. Simulation Project 2.0: Electing the U.S. President in a Web-Based EFL Scenario. Task, Processes Outcomes
  12. The 6th International CSR Communication Conference - 2022
  13. Provenance: Can You Bank on It?
  14. Don't ask don't tell - impact after nuclear accidents on provisioning ecosystem services
  15. Basiskolleg Sozialkritik - 2018/19
  16. Educational success despite difficult circumstances. Profiles of resilient students

Publications

  1. Overcoming Multi-legacy Application Challenges through Building Dynamic Capabilities for Low-Code Adoption
  2. Metrics for Experimentation Programs: Categories, Benefits and Challenges
  3. The explanatory power of Carnegie Classification in predicting engagement indicators
  4. The erosion of relational values resulting from landscape simplification
  5. Peter Hay, Advanced Introduction to Private International Law and Procedure
  6. Locus of control
  7. Arc spraying of WCFeCSiMn cored wires.
  8. Process limits of extrusion of multimaterial components
  9. Comparison of different machine control modes during friction extrusion of AA2024
  10. Simulation of stresses during casting of binary magnesium-aluminum alloys
  11. Why Emergency? Reflections on the Practice and Rhetoric of Exceptionalism
  12. Die Schreibwerkstatt Mehrsprachigkeit
  13. Gas-Kampf oder Gas-Krampf
  14. Tschick
  15. Strategy execution in higher education
  16. The theory of human development
  17. The causal effects of exports on firm size and labor productivity
  18. Efficacy of a Web-Based Stress Management Intervention for Beginning Teachers on Reducing Stress and Mechanisms of Change
  19. Between Usability and Trustworthiness-The Potential of Information Transfer Using Digital Information Platforms for Refugees
  20. Learning from Indigenous Populations and Local Communities
  21. Telling your own stories
  22. From farm to factory. Vertical trading and processing structures between industrial and developing countries in the international tobacco-economy
  23. Contextualising urban experimentation
  24. Reintegration strategies in a gender perspective