An Off-the-shelf Approach to Authorship Attribution

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

An Off-the-shelf Approach to Authorship Attribution. / Nasir, Jamal Abdul; Görnitz, Nico; Brefeld, Ulf.
COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers. Dublin: Association for Computational Linguistics (ACL), 2014. p. 895-904 (COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Nasir, JA, Görnitz, N & Brefeld, U 2014, An Off-the-shelf Approach to Authorship Attribution. in COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers. COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers, Association for Computational Linguistics (ACL), Dublin, pp. 895-904, 25th International Conference on Computational Linguistics - COLING 2014 , Dublin, Ireland, 23.08.14. <https://www.aclweb.org/anthology/C14-1085>

APA

Nasir, J. A., Görnitz, N., & Brefeld, U. (2014). An Off-the-shelf Approach to Authorship Attribution. In COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers (pp. 895-904). (COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers). Association for Computational Linguistics (ACL). https://www.aclweb.org/anthology/C14-1085

Vancouver

Nasir JA, Görnitz N, Brefeld U. An Off-the-shelf Approach to Authorship Attribution. In COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers. Dublin: Association for Computational Linguistics (ACL). 2014. p. 895-904. (COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers).

Bibtex

@inbook{93d02cb3e083419c9d27891178f9937e,
title = "An Off-the-shelf Approach to Authorship Attribution",
abstract = "Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profile- vs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally deployable method that does not share these limitations. We treat authorship attribution as an anomaly detection problem where author regions are learned in feature space. The choice of the right feature space for a given task is identified automatically by representing the optimal solution as a linear mixture of multiple kernel functions (MKL). Our approach allows to include labelled as well as unlabelled examples to remedy the in-sample and out-of-sample problems. Empirically, we observe our proposed novel technique either to be better or on par with baseline competitors. However, our method relieves the user from critical design choices (e.g., feature set) and can therefore be used as an off-the-shelf method for authorship attribution.",
keywords = "Informatics, Business informatics",
author = "Nasir, {Jamal Abdul} and Nico G{\"o}rnitz and Ulf Brefeld",
year = "2014",
language = "English",
isbn = "978-194164326-6",
series = "COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers",
publisher = "Association for Computational Linguistics (ACL)",
pages = "895--904",
booktitle = "COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014",
address = "United States",
note = "25th International Conference on Computational Linguistics - COLING 2014 , COLING 2014 ; Conference date: 23-08-2014 Through 29-08-2014",
url = "https://aclanthology.info/volumes/proceedings-of-coling-2014-the-25th-international-conference-on-computational-linguistics-technical-papers",

}

RIS

TY - CHAP

T1 - An Off-the-shelf Approach to Authorship Attribution

AU - Nasir, Jamal Abdul

AU - Görnitz, Nico

AU - Brefeld, Ulf

N1 - Conference code: 25

PY - 2014

Y1 - 2014

N2 - Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profile- vs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally deployable method that does not share these limitations. We treat authorship attribution as an anomaly detection problem where author regions are learned in feature space. The choice of the right feature space for a given task is identified automatically by representing the optimal solution as a linear mixture of multiple kernel functions (MKL). Our approach allows to include labelled as well as unlabelled examples to remedy the in-sample and out-of-sample problems. Empirically, we observe our proposed novel technique either to be better or on par with baseline competitors. However, our method relieves the user from critical design choices (e.g., feature set) and can therefore be used as an off-the-shelf method for authorship attribution.

AB - Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profile- vs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally deployable method that does not share these limitations. We treat authorship attribution as an anomaly detection problem where author regions are learned in feature space. The choice of the right feature space for a given task is identified automatically by representing the optimal solution as a linear mixture of multiple kernel functions (MKL). Our approach allows to include labelled as well as unlabelled examples to remedy the in-sample and out-of-sample problems. Empirically, we observe our proposed novel technique either to be better or on par with baseline competitors. However, our method relieves the user from critical design choices (e.g., feature set) and can therefore be used as an off-the-shelf method for authorship attribution.

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84959886512&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/71fe9cf0-6be3-3329-b3d8-e329a11a5d88/

M3 - Article in conference proceedings

SN - 978-194164326-6

T3 - COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers

SP - 895

EP - 904

BT - COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014

PB - Association for Computational Linguistics (ACL)

CY - Dublin

T2 - 25th International Conference on Computational Linguistics - COLING 2014

Y2 - 23 August 2014 through 29 August 2014

ER -

Links

Recently viewed

Researchers

  1. Jochen Weihe

Publications

  1. Trade Dynamics, Trade Costs and Market Size: First Evidence from the Exporter and Importer Dynamics Database for Germany
  2. From digitalisation to crowdfunding platforms
  3. A geometric procedure for robust decoupling control of contact forces in robotic manipulation
  4. Organizations as Networks of Communication Episodes
  5. Indigenous and local values of nature through a gender lens: A literature review
  6. Stress corrosion of the Mg-Zn-Zr alloy system using C-ring tests
  7. The Connected Classroom
  8. Bolt load retention and creep response of AS41 alloyed with 0.15% Ca
  9. Lekcja 21-22
  10. The Ornament of Mass Customization
  11. King of Hearts
  12. Logistikcontrolling
  13. The Sustainability Balanced Scorecard as a Framework for Eco-Efficiency Analysis
  14. Slowing resource loops in the Circular Economy: an experimentation approach in fashion retail
  15. Dialogue on Writing
  16. Kooperation und Diversität von Netzwerken
  17. To assess progress in the social sciences, we should study knowledge cumulation, not disruptiveness
  18. Indigenous and local knowledge in biocultural approaches to sustainability:
  19. Purpurne Zeichen
  20. Interorganizational Diffusion in International Relations
  21. Nachhaltigkeit
  22. Umwelt-Governance und Partizipation
  23. Sorge im Kontext einer "neuen Ontologie der Körper"
  24. Species diversity and species composition of epiphytic bryophytes and lichens
  25. Developing ESD-specific professional action competence for teachers: knowledge, skills, and attitudes in implementing ESD at the school level
  26. The role of intuition in vaccination attitudes
  27. Empathy-motivated helping
  28. Activity-Based Costing as a Basis for Transfer Prices and Target Setting