Winning the Lottery Twice with Iterative Magnitude Pruning

Maximilian Burger

Research output: ThesisMaster's / Diploma thesis

Abstract

The Lottery Ticket Hypothesis by Frankle and Carbin (2019) has sparked a novel field of research that focuses on finding well-trainable sparse neural networks. Iterative Magnitude Pruning is a central method in this field. It successfully uncovers sparse subnetworks that achieve performance comparable to the original dense network when trained in isolation. These networks are called Winning Tickets. Why and how iterative magnitude pruning works and what characteristics of the winning tickets make them successful remain elusive. To learn more about winning tickets, their network structure is studied in this thesis. Since they are sparse subnetworks, their structure may contain valuable insights into their functionality. Training data with a known structure is used to examine whether winning tickets trained on that data resemble its structure in any way. The experiments in this thesis are conducted on datasets that contain two independent tasks a simple toy dataset and a combination of the MNIST and Fashion-MNIST datasets. With both datasets, the resulting winning tickets resemble the structure of the datasets, namely the independence. The winning tickets contain separate, independent subnetworks where each subnetwork solves one independent task.
Original languageEnglish
QualificationMaster
Awarding Institution
  • Johannes Kepler University Linz
Supervisors/Reviewers
  • Klambauer, Günter, Supervisor
  • Hoedt, Pieter-Jan, Co-supervisor
Publication statusPublished - Apr 2024

Fields of science

  • 102018 Artificial neural networks
  • 102019 Machine learning
  • 102032 Computational intelligence
  • 101016 Optimisation
  • 101028 Mathematical modelling
  • 101031 Approximation theory
  • 101019 Stochastics
  • 102003 Image processing
  • 103029 Statistical physics
  • 101018 Statistics
  • 101017 Game theory
  • 102001 Artificial intelligence
  • 202017 Embedded systems
  • 101015 Operations research
  • 101014 Numerical mathematics
  • 101029 Mathematical statistics
  • 101026 Time series analysis
  • 101024 Probability theory
  • 102004 Bioinformatics
  • 102013 Human-computer interaction
  • 101027 Dynamical systems
  • 305907 Medical statistics
  • 101004 Biomathematics
  • 305905 Medical informatics
  • 102033 Data mining
  • 102 Computer Sciences
  • 305901 Computer-aided diagnosis and therapy
  • 106007 Biostatistics
  • 106005 Bioinformatics
  • 202037 Signal processing
  • 202036 Sensor systems
  • 202035 Robotics

JKU Focus areas

  • Digital Transformation

Cite this