Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics. / Lommel, Lasse; Riebeling, Meike ; Funk, Burkhardt et al.
Human Practice. Digital Ecologies. Our Future: 14. Internationale Tagung Wirtschaftsinformatik (WI 2019), Tagungsband . Hrsg. / Thomas Ludwig; Volkmar Pipek. Siegen: Universitätsverlag Siegen, 2019. S. 453-467.

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Lommel, L, Riebeling, M, Funk, B & Junginger, C 2019, Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics. in T Ludwig & V Pipek (Hrsg.), Human Practice. Digital Ecologies. Our Future: 14. Internationale Tagung Wirtschaftsinformatik (WI 2019), Tagungsband . Universitätsverlag Siegen, Siegen, S. 453-467, 14. Internationale Tagung Wirtschaftsinformatik - WI 2019, Siegen, Deutschland, 24.02.19. https://doi.org/10.25819/ubsi/1016

APA

Lommel, L., Riebeling, M., Funk, B., & Junginger, C. (2019). Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics. In T. Ludwig, & V. Pipek (Hrsg.), Human Practice. Digital Ecologies. Our Future: 14. Internationale Tagung Wirtschaftsinformatik (WI 2019), Tagungsband (S. 453-467). Universitätsverlag Siegen. https://doi.org/10.25819/ubsi/1016

Vancouver

Lommel L, Riebeling M, Funk B, Junginger C. Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics. in Ludwig T, Pipek V, Hrsg., Human Practice. Digital Ecologies. Our Future: 14. Internationale Tagung Wirtschaftsinformatik (WI 2019), Tagungsband . Siegen: Universitätsverlag Siegen. 2019. S. 453-467 doi: 10.25819/ubsi/1016

Bibtex

@inbook{5413e328ea194031b724e97ab07c0d4d,
title = "Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics",
abstract = "Traditional unsupervised topic modeling approaches like Latent Dirichlet Allocation (LDA) lack the ability to classify documents into a predefined set of topics. On the other hand, supervised methods require significant amounts of labeled data to perform well on such tasks. We develop a new unsupervised method based on word embeddings to classify documents into predefined topics. We evaluate the predictive performance of this novel approach and compare it to seeded LDA. We use a real-world dataset from online advertising, which is comprised of markedly short documents. Our results indicate the two methods may complement one another well, leading to remarkable sensitivity and precision scores of ensemble learners trained thereupon.",
keywords = "Business informatics, topic modeling, word embeddings, LDA, seeded LDA, topic modeling, word embeddings, LDA, seeded LDA",
author = "Lasse Lommel and Meike Riebeling and Burkhardt Funk and Christian Junginger",
year = "2019",
doi = "10.25819/ubsi/1016",
language = "English",
pages = "453--467",
editor = "Thomas Ludwig and Volkmar Pipek",
booktitle = "Human Practice. Digital Ecologies. Our Future",
publisher = "Universit{\"a}tsverlag Siegen",
address = "Germany",
note = "14. Internationale Tagung Wirtschaftsinformatik - WI 2019 ; Conference date: 24-02-2019 Through 27-02-2019",
url = "https://wi2019.de/, https://wi2019.de/call-for-papers/, https://wi2019.de/",

}

RIS

TY - CHAP

T1 - Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics

AU - Lommel, Lasse

AU - Riebeling, Meike

AU - Funk, Burkhardt

AU - Junginger, Christian

N1 - Conference code: 14

PY - 2019

Y1 - 2019

N2 - Traditional unsupervised topic modeling approaches like Latent Dirichlet Allocation (LDA) lack the ability to classify documents into a predefined set of topics. On the other hand, supervised methods require significant amounts of labeled data to perform well on such tasks. We develop a new unsupervised method based on word embeddings to classify documents into predefined topics. We evaluate the predictive performance of this novel approach and compare it to seeded LDA. We use a real-world dataset from online advertising, which is comprised of markedly short documents. Our results indicate the two methods may complement one another well, leading to remarkable sensitivity and precision scores of ensemble learners trained thereupon.

AB - Traditional unsupervised topic modeling approaches like Latent Dirichlet Allocation (LDA) lack the ability to classify documents into a predefined set of topics. On the other hand, supervised methods require significant amounts of labeled data to perform well on such tasks. We develop a new unsupervised method based on word embeddings to classify documents into predefined topics. We evaluate the predictive performance of this novel approach and compare it to seeded LDA. We use a real-world dataset from online advertising, which is comprised of markedly short documents. Our results indicate the two methods may complement one another well, leading to remarkable sensitivity and precision scores of ensemble learners trained thereupon.

KW - Business informatics

KW - topic modeling, word embeddings, LDA, seeded LDA

KW - topic modeling

KW - word embeddings

KW - LDA

KW - seeded LDA

UR - https://wi2019.de/tagungsband/

UR - https://wi2019.de/wp-content/uploads/Tagungsband_WI2019_reduziert.pdf

UR - https://www.universi.uni-siegen.de/katalog/einzelpublikationen/897618.html

U2 - 10.25819/ubsi/1016

DO - 10.25819/ubsi/1016

M3 - Article in conference proceedings

SP - 453

EP - 467

BT - Human Practice. Digital Ecologies. Our Future

A2 - Ludwig, Thomas

A2 - Pipek, Volkmar

PB - Universitätsverlag Siegen

CY - Siegen

T2 - 14. Internationale Tagung Wirtschaftsinformatik - WI 2019

Y2 - 24 February 2019 through 27 February 2019

ER -

Links

DOI

Zuletzt angesehen

Forschende

  1. Neele Puhlmann

Publikationen

  1. Improve a 3D distance measurement accuracy in stereo vision systems using optimization methods’ approach
  2. The Open Anchoring Quest Dataset: Anchored Estimates from 96 Studies on Anchoring Effects
  3. Grazing effects on intraspecific trait variability vary with changing precipitation patterns in Mongolian rangelands
  4. Doing space in face-to-face interaction and on interactive multimodal platforms
  5. Using Conjoint Analysis to Elicit Preferences for Occupational Health Services in Small and Microenterprises
  6. Noise level estimation and detection
  7. Challenges for postdocs in Germany and beyond:
  8. Practice and carryover effects when using small interaction devices
  9. Detection of oscillations with application in the pantograph control
  10. Offline question answering over linked data using limited resources
  11. A Framework for Applying Natural Language Processing in Digital Health Interventions
  12. Leverage points 2019
  13. Statistical methods for the evaluation of hydrological parameters for landuse planning
  14. An approach for dynamic triangulation using servomotors
  15. Hot tearing behaviour of binary Mg-1Al alloy using a contraction force measuring method
  16. Kontext
  17. Separable models for interconnected production-inventory systems
  18. Programmierung einer DELTA-Roboterzelle nach PackML Standard
  19. An experience-based learning framework
  20. Performance incentives in activity-based management
  21. Rethinking the Spatiality of Spatial Planning
  22. Identification of Parameters and States in PMSMs
  23. PID Controller Application in a Gimbal Construction for Camera Stabilization and Tracking
  24. Eulerian and Lagrangian perspectives on turbulent superstructures in Rayleigh-Bénard convection
  25. Do Linguistic Features Influence Item Difficulty in Physics Assessments?
  26. The erosion of relational values resulting from landscape simplification
  27. Resolving conflicts between people and over time in the transformation toward sustainability
  28. Temporal and thermodynamic irreversibility in production theory
  29. Making the most out of timeseries symptom data
  30. Lyapunov stability analysis to set up a saturating PI controller with anti-windup for a mass flow system