Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

Modern Hopfield Networks

Aktivität: Vortrag oder PräsentationEingeladener VortragScience-to-science

Beschreibung

Associative memories are one of the earliest artificial neural models dating back to the 1960s and 1970s. Best known are Hopfield Networks, presented by John Hopfield in 1982. Recently, Modern Hopfield Networks have been introduced, which tremendously increase the storage capacity and converge extremely fast. We generalize the energy function of modern Hopfield Networks to continuous patterns and propose a new update rule. The new Hopfield Network has exponential storage capacity. Its update rule ensures global convergence to energy minima and converges in one update step with exponentially low error. The new Hopfield network has three types of energy minima (fixed points of the update): (1) global fixed point averaging over all patterns, (2) metastable states averaging over a subset of patterns, and (3) fixed points, which store a single pattern. Surprisingly, the transformer attention mechanism is equal to the update rule of our new modern Hopfield Network with continuous states. Transformer and BERT models operate in their first layers preferably in the global averaging regime, while they operate in higher layers in metastable states. We provide a new PyTorch layer called "Hopfield", which allows equipping deep learning architectures with modern Hopfield networks as a new powerful concept comprising pooling, memory, and attention. The layer serves for applications like multiple instance learning, set-based and permutation invariant learning, associative learning, and many more. We show some tasks, for which we could increase the performance by integrating the new Hopfield layer into a deep learning architecture.
Zeitraum18 Nov. 2020
Ereignistitel20th IEEE International Conference on Data Mining (ICDM 2020)
VeranstaltungstypKonferenz
OrtÖsterreichAuf Karte anzeigen

Wissenschaftszweige

  • 101031 Approximationstheorie
  • 102 Informatik
  • 305901 Computerunterstützte Diagnose und Therapie
  • 102033 Data Mining
  • 102032 Computational Intelligence
  • 101029 Mathematische Statistik
  • 102013 Human-Computer Interaction
  • 305905 Medizinische Informatik
  • 101028 Mathematische Modellierung
  • 101027 Dynamische Systeme
  • 101004 Biomathematik
  • 101026 Zeitreihenanalyse
  • 202017 Embedded Systems
  • 101024 Wahrscheinlichkeitstheorie
  • 305907 Medizinische Statistik
  • 102019 Machine Learning
  • 202037 Signalverarbeitung
  • 102018 Künstliche Neuronale Netze
  • 103029 Statistische Physik
  • 202036 Sensorik
  • 202035 Robotik
  • 106005 Bioinformatik
  • 106007 Biostatistik
  • 101019 Stochastik
  • 101018 Statistik
  • 101017 Spieltheorie
  • 101016 Optimierung
  • 102001 Artificial Intelligence
  • 101015 Operations Research
  • 102004 Bioinformatik
  • 101014 Numerische Mathematik
  • 102003 Bildverarbeitung

JKU-Schwerpunkte

  • Digital Transformation