How Big Does Big Data Need to Be?

Research output: Contributions to collected editions/worksContributions to collected editions/anthologiesResearchpeer-review

Standard

How Big Does Big Data Need to Be? / Stange, Martin; Funk, Burkhardt.
Enterprise Big Data Engineering, Analytics, and Management. ed. / Martin Atzmueller; Samia Oussena; Thomas Roth-Berghofer. Hershey: Business Science Reference, 2016. p. 1-12.

Research output: Contributions to collected editions/worksContributions to collected editions/anthologiesResearchpeer-review

Harvard

Stange, M & Funk, B 2016, How Big Does Big Data Need to Be? in M Atzmueller, S Oussena & T Roth-Berghofer (eds), Enterprise Big Data Engineering, Analytics, and Management. Business Science Reference, Hershey, pp. 1-12. https://doi.org/10.4018/978-1-5225-0293-7.ch001

APA

Stange, M., & Funk, B. (2016). How Big Does Big Data Need to Be? In M. Atzmueller, S. Oussena, & T. Roth-Berghofer (Eds.), Enterprise Big Data Engineering, Analytics, and Management (pp. 1-12). Business Science Reference. https://doi.org/10.4018/978-1-5225-0293-7.ch001

Vancouver

Stange M, Funk B. How Big Does Big Data Need to Be? In Atzmueller M, Oussena S, Roth-Berghofer T, editors, Enterprise Big Data Engineering, Analytics, and Management. Hershey: Business Science Reference. 2016. p. 1-12 doi: 10.4018/978-1-5225-0293-7.ch001

Bibtex

@inbook{f6911d81026546388fc927733a57d709,
title = "How Big Does Big Data Need to Be?",
abstract = "Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.",
keywords = "Business informatics, Big Data, Predictive Analytics, Learning Curve",
author = "Martin Stange and Burkhardt Funk",
year = "2016",
month = jun,
doi = "10.4018/978-1-5225-0293-7.ch001",
language = "English",
isbn = "9781522502937",
pages = "1--12",
editor = "Martin Atzmueller and Samia Oussena and Thomas Roth-Berghofer",
booktitle = "Enterprise Big Data Engineering, Analytics, and Management",
publisher = "Business Science Reference",
address = "United States",

}

RIS

TY - CHAP

T1 - How Big Does Big Data Need to Be?

AU - Stange, Martin

AU - Funk, Burkhardt

PY - 2016/6

Y1 - 2016/6

N2 - Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.

AB - Collecting and storing of as many data as possible is common practice in many companies these days. To reduce costs of collecting and storing data that is not relevant, it is important to define which analytical questions are to be answered and how much data is needed to answer these questions. In this chapter,a process to define an optimal sampling size is proposed. Based on benefit/cost considerations, the authors show how to find the sample size that maximizes the utility of predictive analytics. By applying the proposed process to a case study is shown that only a very small fraction of the available data set is needed to make accurate predictions.

KW - Business informatics

KW - Big Data

KW - Predictive Analytics

KW - Learning Curve

UR - http://www.igi-global.com/chapter/how-big-does-big-data-need-to-be/154550

U2 - 10.4018/978-1-5225-0293-7.ch001

DO - 10.4018/978-1-5225-0293-7.ch001

M3 - Contributions to collected editions/anthologies

SN - 9781522502937

SP - 1

EP - 12

BT - Enterprise Big Data Engineering, Analytics, and Management

A2 - Atzmueller, Martin

A2 - Oussena, Samia

A2 - Roth-Berghofer, Thomas

PB - Business Science Reference

CY - Hershey

ER -

Recently viewed

Publications

  1. FragSAD: A database of diversity and species abundance distributions from habitat fragments
  2. Coauthoring collaborative strategy when voices are many and authority is ambiguous
  3. Misconceptions of Measurement Equivalence
  4. A panel cointegration rank test with structural breaks and cross-sectional dependence
  5. Diversity of Play
  6. Knowledge on global environmental change within social praxis: what do we know?
  7. Machine Learning Analysis in the Diagnostics of the Dynamics of Ball Bearing with Different Radial Internal Clearance
  8. Temperature changes using excimer laser irradiation in a cochlear model
  9. Competition response of European beech Fagus sylvatica L. varies with tree size and abiotic stress
  10. New validated liquid chromatographic and chemometrics-assisted UV spectroscopic methods for the determination of two multicomponent cough mixtures in syrup.
  11. Effect of extrusion and rotary swaging on the microstructural evolution and properties of Mg-5Li-5.3Al-0.7Si alloy
  12. Effects of pesticides on community structure and ecosystem functions in agricultural streams of three biogeographical regions in Europe
  13. Managing Research Environments
  14. Where Are the Organizations? Accounting for the Fluidity and Ambiguity of Organizing in the Arts
  15. New development in magnesium technology for light weight structures in transportation industries
  16. Generation of 3D representative volume elements for heterogeneous materials
  17. Host plant availability potentially limits butterfly distributions under cold environmental conditions
  18. Introduction: Modeling the Pacific Ocean
  19. Boundaryless working hours and recovery in Germany
  20. Extensive margins of imports in the great import recovery in Germany, 2009/2010
  21. Exploring the knowledge landscape of ecosystem services assessments in Mediterranean agroecosystems
  22. How does nature contribute to human mobility? A conceptual framework and qualitative analysis
  23. Precipitation Kinetics of AA6082: An Experimental and Numerical Investigation
  24. In vivo degradation of binary magnesium alloys - A long-term study
  25. Physicochemical properties and biodegradability of organically functionalized colloidal silica particles in aqueous environment
  26. Globalisierung
  27. The Microstructure of the Great Export Collapse in German Manufacturing Industries, 2008/2009
  28. Moorfinger
  29. A modeling assessment of the physicochemical properties and environmental fate of emerging and novel per- and polyfluoroalkyl substances
  30. Plasma arcing during contact separation of HVDC relays
  31. Evaluating ecosystem services in transhumance cultural landscapes. An interdisciplinary and participatory framework
  32. Heteroaggregation of titanium dioxide nanoparticles with model natural colloids under environmentally relevant conditions
  33. Toxic Waste
  34. The cuticular profiles of Australian stingless bees are shaped by resin of the eucalypt tree Corymbia torelliana
  35. Firms, the Framework Convention on Climate Change & the EU Emmissions Trading System
  36. Die vertrackte Urteilsform
  37. Intracellular Accumulation of Linezolid in Escherichia Coli, Citrobacter Freundii and Enterobacter Aerogenes