How Big Does Big Data Need to Be?

Publikation: Beiträge in SammelwerkenAufsätze in SammelwerkenForschungbegutachtet

Authors

Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,
a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.
OriginalspracheEnglisch
TitelEnterprise Big Data Engineering, Analytics, and Management
HerausgeberMartin Atzmueller, Samia Oussena, Thomas Roth-Berghofer
Anzahl der Seiten12
ErscheinungsortHershey
VerlagBusiness Science Reference
Erscheinungsdatum06.2016
Seiten1-12
ISBN (Print)9781522502937
ISBN (elektronisch)9781522502944
DOIs
PublikationsstatusErschienen - 06.2016

DOI

Zuletzt angesehen

Publikationen

  1. Do Exporters Really Pay Higher Wages? First Evidence from German Linked Employer-Employee Data
  2. Stress corrosion of the Mg-Zn-Zr alloy system using C-ring tests
  3. Hot deformation behavior of Mg-2Sn-2Ca alloy in as-cast condition and after homogenization
  4. Rate constants for the gas-phase reaction of OH with amines
  5. Understanding the diffusion of domestic biogas technologies.
  6. Anamnesis of Architecture.
  7. Correlation of trends in cashmere production and declines of large wild mammals
  8. Business innovation symposium "At what Price? IP-Related Thoughts on New Business Models for Space Information"
  9. Export entry and exit by German firms
  10. “Caught in the Middle! Wealth Inequality and Conflict over Redistribution”
  11. Thermochemical heat storage materials
  12. Participation for effective environmental governance? Evidence from Water Framework Directive implementation in Germany, Spain and the United Kingdom
  13. Log in and breathe out: efficacy and cost-effectiveness of an online sleep training for teachers affected by work-related strain
  14. Effects of forest management intensity on herb layer plant diversity and composition of deciduous forest communities in Northern Germany
  15. Computer
  16. Positive impact of entrepreneurship training on entrepreneurial behavior in a vocational training setting
  17. Introduction: Two Centuries of the Sublime in American Landscape, Art, and Literature
  18. Mitteilung zur Kopula von Aeshna viridis
  19. Current development of creep-resistant magnesium cast alloys
  20. Kinetic Damping in the Spectrum of the Spherical Impedance Probe
  21. Biodiversity–stability relationships strengthen over time in a long-term grassland experiment