How Big Does Big Data Need to Be?

Publikation: Beiträge in SammelwerkenAufsätze in SammelwerkenForschungbegutachtet

Authors

Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,
a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.
OriginalspracheEnglisch
TitelEnterprise Big Data Engineering, Analytics, and Management
HerausgeberMartin Atzmueller, Samia Oussena, Thomas Roth-Berghofer
Anzahl der Seiten12
ErscheinungsortHershey
VerlagBusiness Science Reference
Erscheinungsdatum06.2016
Seiten1-12
ISBN (Print)9781522502937
ISBN (elektronisch)9781522502944
DOIs
PublikationsstatusErschienen - 06.2016

DOI

Zuletzt angesehen

Publikationen

  1. Success factors in Balanced Scorecard implementations
  2. Leverage points for reversing paddock tree loss in Upper Lachlan grazing landscapes: A workshop report.
  3. Researching Interrelations of formal and informal Learning in early Adolescence form a Critical Race Perspective
  4. In the name of God and Christianity
  5. Acting in the Name of Others
  6. Dadadatadada: From Dada to Data and Back Again
  7. Alltag
  8. Generalized Between Icon, Symbol and Index
  9. Integrating sense of place into participatory landscape planning: merging mapping surveys and geodesign workshops
  10. Geometric control techniques for manipulation systems
  11. Sprechen, Schreiben, Programmieren. Digitalisierung alter Kulturtechniken oder digitale Kultur?
  12. Is the reverse J-shaped diameter distribution universally applicable in European virgin beech forests?
  13. Ansparabschreibung durch Existenzgründer
  14. Animating embryos
  15. The significance of tree-tree interactions for forest ecosystem functioning
  16. Managing information in the case of opinion spamming
  17. Thermal Conductivity Measurement of Salt Hydrates as Porous Material using Calorimetric (DSC) Method
  18. EU Migration and Asylum Policies
  19. Working hour arrangements and working hours
  20. Armed to Kill
  21. Effect of a Web-Based Guided Self-help Intervention for Prevention of Major Depression in Adults With Subthreshold Depression A Randomized Clinical Trial
  22. Web-based guided self-help for employees with depressive symptoms (Happy@Work)
  23. The impact of emotions, moods, and other affect-related variables on creativity, innovation and initiative
  24. Improved cytotoxicity testing of magnesium materials
  25. Atomkraft international
  26. L'agenda 21 locale
  27. Transformational ethics to bridge the void between facts and truths