Adapting Segment Anything for Retinal OCT Fluid Segmentation

  • Ariharasudhan Muthusami

Research output: ThesisMaster's / Diploma thesis

Abstract

Recent advancements in deep learning have highlighted the potential of segmentation models like the Segment Anything Model (SAM) to enhance segmentation performance in natural images. SAM, a versatile deep learning framework, excels in natural image contexts due to its ability to adapt flexibly without specific pre-training, utilizing user-provided prompts to segment a wide variety of objects. However, the application of SAM’s zero-shot capabilities to medical images presents challenges, primarily due to the significant differences between natural and medical imaging characteristics. Fine-tuning foundational models for medical imaging adaptation has been the standard approach. However, this process is time-consuming and computationally expensive, especially due to the shortage of good quality of annotated medical images and variability of medical images. To mitigate these challenges, adapters are lightweight modules integrated into foundational models, designed to learn domain-specific details. Existing adapters often fall short of capturing the intricate, data-specific features due to their simplistic design. To address these limitations, we propose a novel adapter that effectively learns and captures features unique to the medical domain, particularly for segmenting retinal fluids from Optical Coherence Tomography (OCT) data. Our adapter employs dilated convolutions combined with a shared parameter strategy to enhance multi-scale feature extraction, significantly improving the feature extraction pipeline by capturing details at varying scales. Notably, integrating our adapter into the SAM's encoder results in a 30% reduction in fine-tunable parameters compared to the SAM's encoder, enabling more efficient fine-tuning. Additionally, our approach demonstrates robustness across different prompting locations, leading to consistent improvements in segmentation metrics.
Original languageEnglish
Supervisors/Reviewers
  • Klambauer, Günter, Supervisor
  • Bogunović, Hrvoje, Supervisor, External person
Publication statusPublished - Oct 2024

Fields of science

  • 102019 Machine learning
  • 102018 Artificial neural networks
  • 305905 Medical informatics
  • 102032 Computational intelligence
  • 305901 Computer-aided diagnosis and therapy
  • 101016 Optimisation
  • 101028 Mathematical modelling
  • 202037 Signal processing
  • 101019 Stochastics
  • 102003 Image processing
  • 103029 Statistical physics
  • 101018 Statistics
  • 101017 Game theory
  • 102001 Artificial intelligence
  • 202017 Embedded systems
  • 101015 Operations research
  • 101014 Numerical mathematics
  • 101029 Mathematical statistics
  • 101026 Time series analysis
  • 101024 Probability theory
  • 102004 Bioinformatics
  • 102013 Human-computer interaction
  • 101027 Dynamical systems
  • 305907 Medical statistics
  • 101004 Biomathematics
  • 101031 Approximation theory
  • 102033 Data mining
  • 102 Computer Sciences
  • 106007 Biostatistics
  • 106005 Bioinformatics
  • 202036 Sensor systems
  • 202035 Robotics

JKU Focus areas

  • Digital Transformation

Cite this