Improving Semantic Image Segmentation by Object Localization

Zichen Zhang

doi:10.7939/r3tx35n29

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving Semantic Image Segmentation by Object Localization

Zichen Zhang

Year: 2018 Journal: University of Alberta Library

DOI: 10.7939/r3tx35n29

Get Full-Text PDF Get Analytical Report

Abstract

Semantic segmentation is about classifying every pixel in an image. In recent years, methods based on Fully Convolutional Networks (FCN) have dominated this field in terms of segmentation accuracy. We are interested in tackling the challenges that these methods are faced with. First, it is expensive to acquire pixel level labels to train the network. Second, FCN often has trouble with data that present imbalanced positive and negative samples. This issue often comes up in domains such as medical imaging and satellite imagery analysis, where the object of interest can be very small. The large number of negative samples can overwhelm the positive samples during training, leading to a biased representation learned by the network. In this thesis, we investigate how an object localization mechanism can address these two challenges. We propose an end-to-end neural network that improves the segmentation accuracy of FCN by incorporating an object localization unit. This network performs object localization first, which is then used as a cue to guide the training of the segmentation network. The two steps share convolutional features. This allows us to leverage object detection labels to help with the training of the segmentation network, alleviating the need for large-scale pixel level labels. To avoid applying max pooling on object proposals that limits the spatial accuracy, we introduce a new type of convolutional layer named ROI convolution. It applies convolution directly on the object proposals in one shot, without the need of passing them individually through the downstream network. We show that this layer is differentiable therefore allowing the network to be trained end-to-end. To demonstrate the efficacy of our method, we apply it to the problem of medical image segmentation. With the object localization unit, our method performs well despite the high class imbalance and it outperforms existing methods on small object segmentation. To understand further about the proposed method and the impact of ROI convolution, we also conducted ablation studies and experimented on an endoscopic image dataset with balanced data.

Keywords:

Artificial intelligence Computer vision Object (grammar) Computer science Segmentation Image segmentation Image (mathematics) Natural language processing

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Improving Semantic Image Segmentation by Object Localization

Abstract

Metrics

Topics

Related Documents

Semantic Image Segmentation and Object Labeling

From Weakly Supervised Object Localization to Semantic Segmentation by Probabilistic Image Modeling

Smooth Attention: Improving Image Semantic Segmentation

Multi-image object semantic segmentation by fusing segmentation priors

Efficient image segmentation for semantic object generation