An Off-the-shelf Approach to Authorship Attribution

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profile- vs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally deployable method that does not share these limitations. We treat authorship attribution as an anomaly detection problem where author regions are learned in feature space. The choice of the right feature space for a given task is identified automatically by representing the optimal solution as a linear mixture of multiple kernel functions (MKL). Our approach allows to include labelled as well as unlabelled examples to remedy the in-sample and out-of-sample problems. Empirically, we observe our proposed novel technique either to be better or on par with baseline competitors. However, our method relieves the user from critical design choices (e.g., feature set) and can therefore be used as an off-the-shelf method for authorship attribution.

Original languageEnglish
Title of host publicationCOLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014 : Technical Papers
Number of pages10
Place of PublicationDublin
PublisherAssociation for Computational Linguistics (ACL)
Publication date2014
Pages895-904
ISBN (print)978-194164326-6
ISBN (electronic)9781941643266
Publication statusPublished - 2014
Externally publishedYes
Event25th International Conference on Computational Linguistics - COLING 2014 - Dublin, Ireland
Duration: 23.08.201429.08.2014
Conference number: 25
https://aclanthology.info/volumes/proceedings-of-coling-2014-the-25th-international-conference-on-computational-linguistics-technical-papers

Links

Recently viewed

Researchers

  1. Rainer Paffrath

Publications

  1. Activating an Integrative Mindset Improves the Subjective Outcomes of Value-Driven Conflicts
  2. C 615 Integrierte Berichterstattung
  3. Aim and structure of this book
  4. Mobilizing Memes
  5. A microsystem for growth inhibition test of Enterococcus faecalis based on impedance measurement
  6. On some geometric control properties of active suspensions systems
  7. Ports
  8. Soziale Identität
  9. Great ape cognition is structured by stable cognitive abilities and predicted by developmental conditions
  10. „More than a game“
  11. Ecosystem Services as a Contested Concept
  12. Integrated simulation method for investment decisions of micro production systems
  13. Directives in ELF peer feedback
  14. On the Relation of Boredom and Sadistic Aggression
  15. Coming to work while sick
  16. Infelicitous communication or degrees of misunderstanding
  17. Understanding needs embodiment
  18. Paradoxe Kritik
  19. Quantitative Bildtypenanalyse
  20. Workshop
  21. Exercise program for riders
  22. Sense of Place in Spatial Planning
  23. Das Wissen des Profils
  24. Measurement approaches for inigrated reporting adoption and quality
  25. Diagrammieren/diagrammatische Praxis
  26. Comparing the fatigue performance of Ti-4Al-0.005B titanium alloy T-joints, welded via different friction stir welding sequences
  27. Improving the cost-effectiveness of a healthcare system for depressive disorders by implementing telemedicine
  28. Different ways lead to ambidexterity
  29. Entwicklung und realisierung eines computer-basierten lernprogramms zur GMP-schulung/Programm-entwicklung und benutzer-akzeptanz
  30. A specification schema for software connectors
  31. Strategic Self-Regulation in Groups
  32. The five-factor asset pricing model – A theoretical review and assessment
  33. Co-Shaping an Ecosystem for Responsible AI
  34. Technological opportunities and their rejection
  35. Elevator as a mediating technology of organization
  36. A mixed-method approach to post-retirement career planning
  37. From digitalisation to crowdfunding platforms
  38. Ohne Form kein Inhalt

Press / Media

  1. Grün ist trendy