Abstract
Contrastive Language-Image Pre-training (CLIP) showed spectacular performances at zero-shot transfer learning. CLIP learns expressive embeddings directly from image-text pairs and leverages a much richer source of supervision than just labels.
Though CLIP excels at zero-shot transfer learning, it suffers from an “explaining away” problem, that is, it focuses on one or few features, while neglecting other relevant features. We suggest to use modern Hopfield networks (MHNs) to amplify co-occurrences and covariance structures of the original data.
However, MHNs increase the saturation effect of the InfoNCE objective which hampers learning. We propose to use the InfoLOOB objective to mitigate this saturation effect.
We introduce “Contrastive Leave One Out Boost” (CLOOB) which combines modern Hopfield networks with the InfoLOOB objective. CLOOB overcomes CLIP’s problem of explaining away by extracting more covariance structure from the original data.
| Original language | English |
|---|---|
| Title of host publication | International Conference on Machine Learning (ICML 2022), 3rd Women in Machine Learning Un-Workshop |
| Number of pages | 1 |
| Publication status | Published - 2022 |
Fields of science
- 305907 Medical statistics
- 202017 Embedded systems
- 202036 Sensor systems
- 101004 Biomathematics
- 101014 Numerical mathematics
- 101015 Operations research
- 101016 Optimisation
- 101017 Game theory
- 101018 Statistics
- 101019 Stochastics
- 101024 Probability theory
- 101026 Time series analysis
- 101027 Dynamical systems
- 101028 Mathematical modelling
- 101029 Mathematical statistics
- 101031 Approximation theory
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102004 Bioinformatics
- 102013 Human-computer interaction
- 102018 Artificial neural networks
- 102019 Machine learning
- 102032 Computational intelligence
- 102033 Data mining
- 305901 Computer-aided diagnosis and therapy
- 305905 Medical informatics
- 202035 Robotics
- 202037 Signal processing
- 103029 Statistical physics
- 106005 Bioinformatics
- 106007 Biostatistics
JKU Focus areas
- Digital Transformation