Big Data - Characterizing an Emerging Research Field using Topic Models

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Big Data is one of the latest emerging topics in the field of business information systems, and is marketed as being the key for companies' future success. Many analytic solutions are offered by IT companies to help other businesses with the flood of data that is generated within and outside of a company. Despite the extensive use of the notion Big Data for marketing purposes, there is no common understanding of how to characterize the elements of the Big Data concept. The authors contribute to the clarification of this concept with a methodologically enriched literature review by deriving characteristic dimensions from existing definitions of Big Data. These dimensions are validated and enriched with a two-step approach by applying topic models on 248 publications relevant to Big Data. The authors propose that the concept of Big Data can be described by the dimensions of data, IT infrastructure, applied methods, and an applications perspective. The assignment of the results to a generic data analysis process reveals that recent publications focus on data analysis and processing, and less attention is given to the initial data selection or the visualization and utilization of the analysis results.
Original languageEnglish
Title of host publicationProceedings - 2014 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2014 : Proceedings; 11-14 August 2014 Warsaw, Poland
EditorsDominik Ślęzak, Hung Son Nguyen, Marek Reformat, Eugene Santos
Number of pages9
Volume1
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date16.10.2014
Pages43-51
ISBN (print)978-147994143-8
ISBN (electronic)9781479941438
DOIs
Publication statusPublished - 16.10.2014
EventInternational Joint Conference on Web Intelligence and Intelligent Agent Technology - WI-IAT 2014 - University of Warsaw, Warschau, Poland
Duration: 11.08.201414.08.2014
https://ieeexplore.ieee.org/document/6927513

    Research areas

  • Informatics - Commerce; Data handling; Data visualization; Information analysis Analysis process; Analytic solution; Business information systems; Data characterizing; IT infrastructures; Literature reviews; Research fields; Two-step approach

DOI

Recently viewed

Publications

  1. Improving Human-Machine Interaction
  2. The Making of MEZ - Multilingual Development:
  3. Governing Objects from a Distance
  4. Individual differences and cognitive load theory
  5. Applying Quarter-Vehicle Model Simulation for Road Elevation Measurements Utilizing the Vehicle Level Sensor
  6. Optimal control strategies for PMSM with a decoupling super twisting SMC and inductance estimation in the presence of saturation
  7. Tree mixtures mediate negative effects of introduced tree species on bird taxonomic and functional diversity
  8. CHANGING RECREATIONAL ACTIVITIES FOR REDUCING INSOMNIA SEVERITY? RESULTS FROM A SERIAL MEDIATION ANALYSIS ON THE IMPACT OF RECREATIONAL BEHAVIOR AS A MECHANISM OF CHANGE IN DIGITAL INTERVENTIONS FOR INSOMNIA
  9. A Two-Stage Sliding-Mode High-Gain Observer to Reduce Uncertainties and Disturbances Effects for Sensorless Control in Automotive Applications
  10. Action Errors, Error Management, and Learning in Organizations
  11. What is normal?
  12. A Stacked Planar Sensor Concept for Minimally Invasive Plasma Monitoring
  13. Masked Autoencoder Pretraining for Event Classification in Elite Soccer
  14. How to attract visitors with strategic, value-based experience design
  15. Assessing authenticity in modelling test items: deriving a theoretical model
  16. From Planning to Implementation: Top-Down and Bottom-Up Approaches for Collaborative Watershed Management
  17. Endemic predators, invasive prey and native diversity
  18. Properties of some overlapping self-similar and some self-affine measures
  19. Early-Career Researchers’ Perceptions of the Prevalence of Questionable Research Practices, Potential Causes, and Open Science
  20. Model-based wind turbine control design with power tracking capability
  21. Guest Editorial
  22. How to move the transition to sustainable food consumption towards a societal tipping point
  23. General Patterns and Conclusions
  24. Control oriented modeling of DCDC converters
  25. Set-Oriented and Finite-Element Study of Coherent Behavior in Rayleigh-Bénard Convection
  26. Tree phylogenetic diversity structures multitrophic communities
  27. Predictors of adherence to public health behaviors for fighting COVID-19 derived from longitudinal data
  28. Microstructure refinement by a novel friction-based processing on Mg-Zn-Ca alloy
  29. How Differences in Ratings of Odors and Odor Labels Are Associated with Identification Mechanisms