MC-LSTM: Adding mass conservation to RNNs

  • Hoedt, P. (Speaker)
  • Frederik Kratzert (Speaker)

Activity: Talk or presentationOther talk or presentationscience-to-public

Description

The success of Convolutional Neural Networks (CNNs) in computer vision is mainly driven by their strong inductive bias, which is strong enough to allow CNNs to solve vision-related tasks with random weights, meaning without learning. Similarly, Long Short-Term Memory (LSTM) has a strong inductive bias towards storing information over time. However, many real-world systems are governed by conservation laws, which lead to the redistribution of particular quantities —e.g. in physical and economical systems. Our novel Mass-Conserving LSTM (MC-LSTM) adheres to these conservation laws by extending the inductive bias of LSTM to model the redistribution of those stored quantities. MC-LSTMs set a new state-of-the-art for neural arithmetic units at learning arithmetic operations, such as addition tasks, which have a strong conservation law, as the sum is constant overtime. Further, MC-LSTM is applied to traffic forecasting, modeling a pendulum, and a large benchmark dataset in hydrology, where it sets a new state-of-the-art for predicting peak flows. In the hydrology example, we show that MC-LSTM states correlate with real world processes and are therefore interpretable.
Period16 Apr 2021
Event titleML Collective (MLC)
Event typeOther
LocationAustriaShow on map

Fields of science

  • 101031 Approximation theory
  • 102 Computer Sciences
  • 305901 Computer-aided diagnosis and therapy
  • 102033 Data mining
  • 102032 Computational intelligence
  • 101029 Mathematical statistics
  • 102013 Human-computer interaction
  • 305905 Medical informatics
  • 101028 Mathematical modelling
  • 101027 Dynamical systems
  • 101004 Biomathematics
  • 101026 Time series analysis
  • 202017 Embedded systems
  • 101024 Probability theory
  • 305907 Medical statistics
  • 102019 Machine learning
  • 202037 Signal processing
  • 102018 Artificial neural networks
  • 103029 Statistical physics
  • 202036 Sensor systems
  • 202035 Robotics
  • 106005 Bioinformatics
  • 106007 Biostatistics
  • 101019 Stochastics
  • 101018 Statistics
  • 101017 Game theory
  • 101016 Optimisation
  • 102001 Artificial intelligence
  • 101015 Operations research
  • 102004 Bioinformatics
  • 101014 Numerical mathematics
  • 102003 Image processing

JKU Focus areas

  • Digital Transformation