Avoiding algorithm errors in textual analysis: A guide to selecting software, and a research agenda toward generative artificial intelligence

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Harvard

APA

Vancouver

Bibtex

@article{95c0aeaeed7b46098319df0ae5ca8545,
title = "Avoiding algorithm errors in textual analysis: A guide to selecting software, and a research agenda toward generative artificial intelligence",
abstract = "The use of textual analysis is expanding in organizational research, yet software packages vary in their compatibility with complex constructs. This study helps researchers select suitable tools by focusing on phrase-based dictionary methods. We empirically evaluate four software packages—LIWC, DICTION, CAT Scanner, and a custom Python tool—using the complex construct of value-based management as a test case. The analysis shows that software from the same methodological family produces highly consistent results, while popular but mismatched tools yield significant errors such as miscounted phrases. Based on this, we develop a structured selection guideline that links construct features with software capabilities. The framework enhances construct validity, supports methodological transparency, and is applicable across disciplines. Finally, we position the approach as a bridge to AI-enabled textual analysis, including prompt-based workflows, reinforcing the continued need for theory-grounded construct design.",
keywords = "Algorithm error, Generative AI, Large language models, Reliability, Software selection, Textual analysis, Validity, Value-based management, Management studies",
author = "Janice Wobst and Rainer Lueg",
note = "Publisher Copyright: {\textcopyright} 2025 The Authors",
year = "2025",
month = oct,
doi = "10.1016/j.jbusres.2025.115571",
language = "English",
volume = "199",
journal = "Journal of Business Research",
issn = "0148-2963",
publisher = "Elsevier Inc.",

}

RIS

TY - JOUR

T1 - Avoiding algorithm errors in textual analysis

T2 - A guide to selecting software, and a research agenda toward generative artificial intelligence

AU - Wobst, Janice

AU - Lueg, Rainer

N1 - Publisher Copyright: © 2025 The Authors

PY - 2025/10

Y1 - 2025/10

N2 - The use of textual analysis is expanding in organizational research, yet software packages vary in their compatibility with complex constructs. This study helps researchers select suitable tools by focusing on phrase-based dictionary methods. We empirically evaluate four software packages—LIWC, DICTION, CAT Scanner, and a custom Python tool—using the complex construct of value-based management as a test case. The analysis shows that software from the same methodological family produces highly consistent results, while popular but mismatched tools yield significant errors such as miscounted phrases. Based on this, we develop a structured selection guideline that links construct features with software capabilities. The framework enhances construct validity, supports methodological transparency, and is applicable across disciplines. Finally, we position the approach as a bridge to AI-enabled textual analysis, including prompt-based workflows, reinforcing the continued need for theory-grounded construct design.

AB - The use of textual analysis is expanding in organizational research, yet software packages vary in their compatibility with complex constructs. This study helps researchers select suitable tools by focusing on phrase-based dictionary methods. We empirically evaluate four software packages—LIWC, DICTION, CAT Scanner, and a custom Python tool—using the complex construct of value-based management as a test case. The analysis shows that software from the same methodological family produces highly consistent results, while popular but mismatched tools yield significant errors such as miscounted phrases. Based on this, we develop a structured selection guideline that links construct features with software capabilities. The framework enhances construct validity, supports methodological transparency, and is applicable across disciplines. Finally, we position the approach as a bridge to AI-enabled textual analysis, including prompt-based workflows, reinforcing the continued need for theory-grounded construct design.

KW - Algorithm error

KW - Generative AI

KW - Large language models

KW - Reliability

KW - Software selection

KW - Textual analysis

KW - Validity

KW - Value-based management

KW - Management studies

UR - http://www.scopus.com/inward/record.url?scp=105009249410&partnerID=8YFLogxK

U2 - 10.1016/j.jbusres.2025.115571

DO - 10.1016/j.jbusres.2025.115571

M3 - Journal articles

AN - SCOPUS:105009249410

VL - 199

JO - Journal of Business Research

JF - Journal of Business Research

SN - 0148-2963

M1 - 115571

ER -

Recently viewed

Activities

  1. Design of small touch screen interfaces for older users: The impact of screen size, task difficulty and task complexity
  2. Ant colony optimization algorithm and artificial immune system applied to a robot route
  3. Masked Autoencoder Pretraining for Event Classification in Elite Soccer
  4. Probabilistic and discrete methods for the computational study of coherent behavior in flows
  5. Global Classroom. Introduction, presentation and workshop
  6. International Symposium on Multiscale Computational Analysis of Complex Materials
  7. Self-tuning of a kalman filter applied in a DC drive and in a kalman-based sensor
  8. The paths and parts one picture paints: Tracing a visual’s multimodal and relational boundary work in an interorganizational team
  9. Towards an Emotional Geography of Urban Policing: Exploring the Materialization of Police Territoriality with Emotional Mapping Interviews
  10. Discriminative Identification of Duplicates
  11. Virtual Songwriting. Fostering Creative Processes through „Challenge“ and „Collaboration“.
  12. Managing Utopia - Artistic Visions of Sustainable Lifestyles and Their Realization
  13. On the measuring accuracy of the “Vehrs-Hebel”, a scaling apparatus for nonverbal real-time assessment of perceived quantity
  14. The use of digital communication media in cross-border knowledge transfer processes: A competitive advantage for multinational companies?
  15. New York Universität
  16. E-LENGUA Multiplier Event 2017
  17. Investigation of the evolution and kinetics of temperature-driven intermetallic compound during solid-state joining of an Al-Mg alloy via the multiphase-field method
  18. Managing Turnover
  19. Lodz University of Technology
  20. Case study analysis of laser-assisted Low-Cost Automation assembly
  21. Gudrun Fay
  22. Methodenworkshop: Einführung in die Grounded-Theory-Methodologie (GTM) - 2013
  23. Urban Sound Research
  24. 12th Interpretative Policy Analysis Conference - IPA 2017
  25. Neuchâtel Graduate Conference of Migration and Mobility Studies - 2019
  26. Analysing the Gender Wage Gap Using Personnel Records
  27. Ambivalenzen pränataler Verluste

Publications

  1. Neural network-based adaptive fault-tolerant control for strict-feedback nonlinear systems with input dead zone and saturation
  2. Simulating X-ray beam energy and detector signal processing of an industrial CT using implicit neural representations
  3. Mathematical Modeling for Robot 3D Laser Scanning in Complete Darkness Environments to Advance Pipeline Inspection
  4. Machine Learning and Knowledge Discovery in Databases
  5. An experience-based learning framework
  6. Dichotomy or continuum? A global review of the interaction between autonomous and planned adaptations
  7. Machine Learning Applications in Convective Turbulence
  8. Drafts in Action
  9. Grazing effects on intraspecific trait variability vary with changing precipitation patterns in Mongolian rangelands
  10. Precision Denoising in Medical Imaging via Generative Adversarial Network-Aided Low-Noise Discriminator Technique
  11. Money, not protection. Assisted return programmes and the timing of future harm in refugee status determination
  12. Reliability, factor structure, and measurement invariance of the dominic interactive across European countries
  13. The Too-Much-Mimicry Effect
  14. The blue-collar brain
  15. Processes for green and sustainable software engineering
  16. Soil carbon, multiple benefits
  17. Employing complementary multivariate methods for a designed nontarget LC-HRMS screening of a wastewater-influenced river
  18. Construct relation extraction from scientific papers
  19. Understanding cultural variation in cognition one child at a time
  20. Waste-to-nutrition
  21. An extended active learning framework of entrepreneurship education and training