Empirical Analysis of Learnable Image Resizer for Large-Scale Medical Classification and Segmentation

Date of Award

2023

Degree Name

M.S. in Computer Engineering

Department

Department of Electrical and Computer Engineering

Advisor/Chair

Tarek M. Taha

Abstract

Deep Convolutional Neural Networks demonstrate state-of-art performance in computer vision and medical image tasks. However, handling a large-scale image is still a challenging task that usually deals with resizing and patching methods to embed in the lower dimensional space. Recently, Learnable Resizer (LR) has been proposed to analyze large-scale images for computer vision tasks. This study proposes two DCNN models for classification and segmentation tasks constructed with LR in combination with successful classification and segmentation architectures. The performance of the proposed models is evaluated for the Diabetic Retinopathy (DR) analysis and skin cancer segmentation tasks. The proposed model demonstrated better performance than the existing methods for segmentation and classification tasks. For classification tasks, the proposed architectures achieved a 5.34% improvement in accuracy compared to ResNet50. Besides, around 0.62% accuracy over the base model and 0.28% in Intersection-over-Union (IoU) from state-of-the-art performance. The proposed model with the resizer network enhances the capability of the existing R2U-Net for medical image segmentation tasks. Moreover, the proposed methods enable a significant advantage in learning better with a few samples. The experimental results reveal that the proposed models are better than the current approaches.

Keywords

Segmentation, Medical Image Segmentation, Medical Image Classification, Deep learning for large-scale images, large-scale image analysis, medical image with deep learning, artificial intelligence

Rights Statement

Copyright © 2023, Author

Share

COinS