Big Data - Characterizing an Emerging Research Field using Topic Models

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review


Big Data is one of the latest emerging topics in the field of business information systems, and is marketed as being the key for companies' future success. Many analytic solutions are offered by IT companies to help other businesses with the flood of data that is generated within and outside of a company. Despite the extensive use of the notion Big Data for marketing purposes, there is no common understanding of how to characterize the elements of the Big Data concept. The authors contribute to the clarification of this concept with a methodologically enriched literature review by deriving characteristic dimensions from existing definitions of Big Data. These dimensions are validated and enriched with a two-step approach by applying topic models on 248 publications relevant to Big Data. The authors propose that the concept of Big Data can be described by the dimensions of data, IT infrastructure, applied methods, and an applications perspective. The assignment of the results to a generic data analysis process reveals that recent publications focus on data analysis and processing, and less attention is given to the initial data selection or the visualization and utilization of the analysis results.
Original languageEnglish
Title of host publicationProceedings - 2014 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2014 : Proceedings; 11-14 August 2014 Warsaw, Poland
EditorsDominik Ślęzak, Hung Son Nguyen, Marek Reformat, Eugene Santos
Number of pages9
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date16.10.2014
ISBN (Print)978-147994143-8
ISBN (Electronic)9781479941438
Publication statusPublished - 16.10.2014
EventInternational Joint Conference on Web Intelligence and Intelligent Agent Technology - WI-IAT 2014 - University of Warsaw, Warschau, Poland
Duration: 11.08.201414.08.2014

    Research areas

  • Informatics - Commerce; Data handling; Data visualization; Information analysis Analysis process; Analytic solution; Business information systems; Data characterizing; IT infrastructures; Literature reviews; Research fields; Two-step approach