Abstract
Out-of-distribution (OOD) detection, which maps high-dimensional data into a scalar OOD score, is critical for the reliable deployment of machine learning models. A key challenge in recent research is how to effectively leverage and aggregate token embeddings from language models to obtain the OOD score. In this work, we propose AP-OOD, a novel OOD detection method for natural language that goes beyond simple average-based aggregation by exploiting token-level information. AP-OOD is a semi-supervised approach that flexibly interpolates between unsupervised and supervised settings, enabling the use of limited auxiliary outlier data. Empirically, AP-OOD sets a new state of the art in OOD detection for text: in the unsupervised setting, it reduces the FPR95 (false positive rate at 95% true positives) from 27.77% to 5.91% on XSUM summarization, and from 75.19% to 68.13% on WMT15 En–Fr translation.
| Originalsprache | Englisch |
|---|---|
| Titel | EurIPS 2025 Workshop on Metacognition in Generative AI |
| Seitenumfang | 25 |
| Auflage | 1 |
| Publikationsstatus | Veröffentlicht - 2025 |
Wissenschaftszweige
- 101019 Stochastik
- 102003 Bildverarbeitung
- 103029 Statistische Physik
- 101018 Statistik
- 101017 Spieltheorie
- 102001 Artificial Intelligence
- 202017 Embedded Systems
- 101016 Optimierung
- 101015 Operations Research
- 101014 Numerische Mathematik
- 101029 Mathematische Statistik
- 101028 Mathematische Modellierung
- 101026 Zeitreihenanalyse
- 101024 Wahrscheinlichkeitstheorie
- 102032 Computational Intelligence
- 102004 Bioinformatik
- 102013 Human-Computer Interaction
- 101027 Dynamische Systeme
- 305907 Medizinische Statistik
- 101004 Biomathematik
- 305905 Medizinische Informatik
- 101031 Approximationstheorie
- 102033 Data Mining
- 102 Informatik
- 305901 Computerunterstützte Diagnose und Therapie
- 102019 Machine Learning
- 106007 Biostatistik
- 102018 Künstliche Neuronale Netze
- 106005 Bioinformatik
- 202037 Signalverarbeitung
- 202036 Sensorik
- 202035 Robotik
JKU-Schwerpunkte
- Digital Transformation
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver