Abstract
We study the problem of choosing algorithm hyper-parameters in unsupervised domain adaptation, i.e., with labeled data in a source domain and unlabeled data in a target domain, drawn from a different input distribution. We follow the strategy to compute several models using different hyper-parameters, and, to subsequently compute a linear aggregation of the models. While several heuristics exist that follow this strategy, methods are still missing that rely on thorough theories for bounding the target error. In this turn, we propose a method that extends weighted least squares to vector-valued functions, e.g., deep neural networks. We show that the target error of the proposed algorithm is asymptotically not worse than twice the error of the unknown optimal aggregation. We also perform a large scale empirical comparative study on several datasets, including text, images, electroencephalogram, body sensor signals and signals from mobile phones. Our method outperforms deep embedded validation (DEV) and importance weighted validation (IWV) on all datasets, setting a new state-of-the-art performance for solving parameter choice issues in unsupervised domain adaptation with theoretical error guarantees. We further study several competitive heuristics, all outperforming IWV and DEV on at least five datasets. However, our method outperforms each heuristic on at least five of seven datasets.
| Originalsprache | Englisch |
|---|---|
| Titel | International Conference On Learning Representations (ICLR) 2023 |
| Seitenumfang | 51 |
| Publikationsstatus | Veröffentlicht - 2023 |
Wissenschaftszweige
- 305907 Medizinische Statistik
- 202017 Embedded Systems
- 202036 Sensorik
- 101004 Biomathematik
- 101014 Numerische Mathematik
- 101015 Operations Research
- 101016 Optimierung
- 101017 Spieltheorie
- 101018 Statistik
- 101019 Stochastik
- 101024 Wahrscheinlichkeitstheorie
- 101026 Zeitreihenanalyse
- 101027 Dynamische Systeme
- 101028 Mathematische Modellierung
- 101029 Mathematische Statistik
- 101031 Approximationstheorie
- 102 Informatik
- 102001 Artificial Intelligence
- 102003 Bildverarbeitung
- 102004 Bioinformatik
- 102013 Human-Computer Interaction
- 102018 Künstliche Neuronale Netze
- 102019 Machine Learning
- 102032 Computational Intelligence
- 102033 Data Mining
- 305901 Computerunterstützte Diagnose und Therapie
- 305905 Medizinische Informatik
- 202035 Robotik
- 202037 Signalverarbeitung
- 103029 Statistische Physik
- 106005 Bioinformatik
- 106007 Biostatistik
JKU-Schwerpunkte
- Digital Transformation
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver