Toward supervised anomaly detection

Nico Görnitz; Marius Kloft; Konrad Rieck; Ulf Brefeld

doi:10.1613/jair.3623

Toward supervised anomaly detection

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Standard

Toward supervised anomaly detection. / Görnitz, Nico; Kloft, Marius; Rieck, Konrad et al.
in: Journal of Artificial Intelligence Research, Jahrgang 46, 20.02.2013, S. 235-262.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Bibtex

@article{a8f717d595bd41368cab4a7c7732ac50,

title = "Toward supervised anomaly detection",

abstract = "Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails to match the required detection rates in many tasks and there exists a need for labeled data to guide the model generation. Our first contribution shows that classical semi-supervised approaches, originating from a supervised classifier, are inappropriate and hardly detect new and unknown anomalies. We argue that semi-supervised anomaly detection needs to ground on the unsupervised learning paradigm and devise a novel algorithm that meets this requirement. Although being intrinsically non-convex, we further show that the optimization problem has a convex equivalent under relatively mild assumptions. Additionally, we propose an active learning strategy to automatically filter candidates for labeling. In an empirical study on network intrusion detection data, we observe that the proposed learning methodology requires much less labeled data than the state-of-the-art, while achieving higher detection accuracies.",

keywords = "Informatics, learning strategies, Detection accuracy, Empirical studies, Network intrusion detection, Optimization problems, redictive performance, Supervised classifiers, Unsupervised anomaly detection, Business informatics",

author = "Nico G{\"o}rnitz and Marius Kloft and Konrad Rieck and Ulf Brefeld",

year = "2013",

month = feb,

day = "20",

doi = "10.1613/jair.3623",

language = "English",

volume = "46",

pages = "235--262",

journal = "Journal of Artificial Intelligence Research",

issn = "1076-9757",

publisher = "Morgan Kaufmann Publishers, Inc.",

}

RIS

TY - JOUR

T1 - Toward supervised anomaly detection

AU - Görnitz, Nico

AU - Kloft, Marius

AU - Rieck, Konrad

AU - Brefeld, Ulf

PY - 2013/2/20

Y1 - 2013/2/20

N2 - Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails to match the required detection rates in many tasks and there exists a need for labeled data to guide the model generation. Our first contribution shows that classical semi-supervised approaches, originating from a supervised classifier, are inappropriate and hardly detect new and unknown anomalies. We argue that semi-supervised anomaly detection needs to ground on the unsupervised learning paradigm and devise a novel algorithm that meets this requirement. Although being intrinsically non-convex, we further show that the optimization problem has a convex equivalent under relatively mild assumptions. Additionally, we propose an active learning strategy to automatically filter candidates for labeling. In an empirical study on network intrusion detection data, we observe that the proposed learning methodology requires much less labeled data than the state-of-the-art, while achieving higher detection accuracies.

AB - Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails to match the required detection rates in many tasks and there exists a need for labeled data to guide the model generation. Our first contribution shows that classical semi-supervised approaches, originating from a supervised classifier, are inappropriate and hardly detect new and unknown anomalies. We argue that semi-supervised anomaly detection needs to ground on the unsupervised learning paradigm and devise a novel algorithm that meets this requirement. Although being intrinsically non-convex, we further show that the optimization problem has a convex equivalent under relatively mild assumptions. Additionally, we propose an active learning strategy to automatically filter candidates for labeling. In an empirical study on network intrusion detection data, we observe that the proposed learning methodology requires much less labeled data than the state-of-the-art, while achieving higher detection accuracies.

KW - Informatics

KW - learning strategies

KW - Detection accuracy

KW - Empirical studies

KW - Network intrusion detection

KW - Optimization problems

KW - redictive performance

KW - Supervised classifiers

KW - Unsupervised anomaly detection

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84875512265&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/5182cca1-961d-3f09-a144-35dd9cc37f97/

U2 - 10.1613/jair.3623

DO - 10.1613/jair.3623

M3 - Journal articles

AN - SCOPUS:84875512265

VL - 46

SP - 235

EP - 262

JO - Journal of Artificial Intelligence Research

JF - Journal of Artificial Intelligence Research

SN - 1076-9757

ER -

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Fassmeyer, D., Cordes, M. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 15 S., 38.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Bengs, D., Brefeld, U., Kroehne, U. & Zehner, F., 2025, (Angenommen/Im Druck) in: Psychometrika.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Brefeld, U. (Herausgeber*in), Davis, J. (Herausgeber*in), Van Haaren, J. (Herausgeber*in) & Zimmermann, A. (Herausgeber*in), 2025, Cham: Springer Verlag. 119 S. (Communications in Computer and Information Science; Band 2460)

Publikation: Bücher und Anthologien › Konferenzbände und -dokumentationen › Forschung

Masked autoencoder for multiagent trajectories

Rudolph, Y. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 18 S., 44.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

Zantvoort, K., Matthiesen, J., Bjurner, P., Bendix, M., Brefeld, U., Funk, B. & Kaldo, V., 06.2025, in: Internet Interventions. 40, 7 S., 100828.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

DOI

https://doi.org/10.1613/jair.3623
Endgültige, publizierte Fassung

Toward supervised anomaly detection

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Masked autoencoder for multiagent trajectories

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

DOI

Zuletzt angesehen

Forschende

Projekte

Aktivitäten

Publikationen

Presse / Medien