Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Traditional unsupervised topic modeling approaches like Latent Dirichlet Allocation (LDA) lack the ability to classify documents into a predefined set of topics. On the other hand, supervised methods require significant amounts of labeled data to perform well on such tasks. We develop a new unsupervised method based on word embeddings to classify documents into predefined topics. We evaluate the predictive performance of this novel approach and compare it to seeded LDA. We use a real-world dataset from online advertising, which is comprised of markedly short documents. Our results indicate the two methods may complement one another well, leading to remarkable sensitivity and precision scores of ensemble learners trained thereupon.
Original languageEnglish
Title of host publicationHuman Practice. Digital Ecologies. Our Future : 14. Internationale Tagung Wirtschaftsinformatik (WI 2019), Tagungsband
EditorsThomas Ludwig, Volkmar Pipek
Number of pages15
Place of PublicationSiegen
PublisherUniversitätsverlag Siegen
Publication date2019
Pages453-467
ISBN (electronic)978-3-96182-063-4
DOIs
Publication statusPublished - 2019
Event14. Internationale Tagung Wirtschaftsinformatik - WI 2019: Human Practice. Digital Ecologies. Our Future. - Universität Siegen, Institut für Wirtschaftsinformatik, Siegen, Germany
Duration: 24.02.201927.02.2019
Conference number: 14
https://wi2019.de/
https://wi2019.de/call-for-papers/
https://wi2019.de/

Links

DOI

Recently viewed

Publications

  1. Concept for Process Parameter-Based Inline Quality Control as a Basis for Pairing in a Production Line
  2. Control of a Three-Axis Robot with Super Twisting Sliding Mode Control
  3. Informatik
  4. Reading and Calculating in Word Problem Solving
  5. A New Framework for Production Planning and Control to Support the Positioning in Fields of Tension Created by Opposing Logistic Objectives
  6. Cognitive load and instructionally supported learning with provided and learner-generated visualizations
  7. Grazing, exploring and networking for sustainability-oriented innovations in learning-action networks
  8. Using Local and Global Self-Evaluations to Predict Students' Problem Solving Behaviour
  9. Using heuristic worked examples to promote solving of reality‑based tasks in mathematics in lower secondary school
  10. Errors, error taxonomies, error prevention, and error management
  11. Integrating the underlying structure of stochasticity into community ecology
  12. A tutorial introduction to adaptive fractal analysis
  13. Special Issue in Acquisitional Pragmatics in Foreign Language Learning
  14. Human–learning–machines: introduction to a special section on how cybernetics and constructivism inspired new forms of learning
  15. Mathematics in Robot Control for Theoretical and Applied Problems
  16. Using Complexity Metrics to Assess Silent Reading Fluency
  17. Globally asymptotic output feedback tracking of robot manipulators with actuator constraints
  18. Multilevel bridge governor by using model predictive control in wavelet packets for tracking trajectories
  19. Latent structure perceptron with feature induction for unrestricted coreference resolution
  20. Challenges and boundaries in implementing social return on investment
  21. XOperator - Interconnecting the semantic web and instant messaging networks
  22. Challenges in detecting proximal effects of existential threat on lie detection accuracy
  23. Experiments on the Fehrer-Raab effect and the ‘Weather Station Model’ of visual backward masking
  24. Parking space management through deep learning – an approach for automated, low-cost and scalable real-time detection of parking space occupancy
  25. Assembly Theory for Restoring Ecosystem Structure and Functioning
  26. Constructions and Reconstructions. The Architectural Image between Rendering and Photography
  27. Lyapunov stability analysis to set up a PI controller for a mass flow system in case of a non-saturating input
  28. Modeling of Logistic Processes in Assembly Areas