How Big Does Big Data Need to Be?

Publikation: Beiträge in SammelwerkenAufsätze in SammelwerkenForschungbegutachtet

Authors

Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,
a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.
OriginalspracheEnglisch
TitelEnterprise Big Data Engineering, Analytics, and Management
HerausgeberMartin Atzmueller, Samia Oussena, Thomas Roth-Berghofer
Anzahl der Seiten12
ErscheinungsortHershey
VerlagBusiness Science Reference
Erscheinungsdatum06.2016
Seiten1-12
ISBN (Print)9781522502937
ISBN (elektronisch)9781522502944
DOIs
PublikationsstatusErschienen - 06.2016

DOI

Zuletzt angesehen

Forschende

  1. Timur Sevincer

Publikationen

  1. Differences in adaptation to light and temperature extremes of Chlorella sorokiniana strains isolated from a wastewater lagoon
  2. Global Integration and Management of Professional Service Firms
  3. The significance of tree-tree interactions for forest ecosystem functioning
  4. Fines for Absuse of Dominance in "High tech" Markets
  5. Quand la mémoire devient image de souvenier
  6. How to Assess Knowledge Cumulation in Environmental Governance Research? Conceptual and Empirical Explorations
  7. Qualitative system analysis as a means for sustainable governance of emerging technologies
  8. New ways in engineering education for a sustainable and smart future
  9. Cognitive load and science text comprehension
  10. Learning Analytics an Hochschulen
  11. The case survey method and applications in political science
  12. Multitrophic diversity in a biodiverse forest is highly nonlinear across spatial scales
  13. Unsettling bodies of knowledge
  14. Mapping Swap Rate Projections on Bond Yields Considering Cointegration
  15. Learning how to request using textbooks
  16. Timing and fragmentation of daily working hours arrangements and income inequality
  17. Temporal and spatial scaling impacts on extreme precipitation
  18. From rebound to reinforcement effects.
  19. Vibration analysis based on the spectrum kurtosis for adjustment and monitoring of ball bearing radial clearance
  20. Lifelong learning in practice at Leuphana University
  21. Cultural influences on social feedback processing of character traits
  22. Proactivity and Adaptability
  23. C 615 Integrierte Berichterstattung
  24. Disrupting Business