Multi-View Representation Learning for Multi-Instance Learning with Applications to Medical Image Classification

Lu Zhao; Liming Yuan; Zhenliang Li; Xianbin Wen

doi:10.1109/bibm55620.2022.9995079

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-View Representation Learning for Multi-Instance Learning with Applications to Medical Image Classification

Lu Zhao Liming Yuan Zhenliang Li Xianbin Wen

Year: 2022 Journal: 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) Vol: 5 Pages: 3415-3422

DOI: 10.1109/bibm55620.2022.9995079

Get Full-Text PDF Get Analytical Report

Abstract

Multi-Instance Learning (MIL) is a weakly supervised learning paradigm, in which every training example is a labeled bag of unlabeled instances. In typical MIL applications, instances are often used for describing the features of regions/parts in a whole object, e.g., regional patches/lesions in an eye-fundus image. However, for a (semantically) complex part the standard MIL formulation puts a heavy burden on the representation ability of the corresponding instance. To alleviate this pressure, we still adopt a bag-of-instances as an example in this paper, but extract from each instance a set of representations using $1 \times1$ convolutions. The advantages of this tactic are two-fold: i) This set of representations can be regarded as multi-view representations for an instance; ii) Compared to building multi-view representations directly from scratch, extracting them automatically using $1 \times1$ convolutions is more economical, and may be more effective since $1 \times1$ convolutions can be embedded into the whole network. Furthermore, we apply two consecutive multi-instance pooling operations on the reconstituted bag that has actually become a bag of sets of multi-view representations. We have conducted extensive experiments on several canonical MIL data sets from different application domains. The experimental results show that the proposed framework outperforms the standard MIL formulation in terms of classification performance and has good interpretability.

Keywords:

Computer science Interpretability Representation (politics) Artificial intelligence Set (abstract data type) Object (grammar) Machine learning Pattern recognition (psychology)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.18

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Digital Imaging for Blood Diseases

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multi-View Representation Learning for Multi-Instance Learning with Applications to Medical Image Classification

Abstract

Metrics

Topics

Related Documents

Image classification with multi-view multi-instance metric learning

Instance-wise multi-view representation learning

Multi-View Multi-Instance Learning Based on Joint Sparse Representation and Multi-View Dictionary Learning

Sparse Representation Based Multi-Instance Learning for Breast Ultrasound Image Classification

Joint multi-label multi-instance learning for image classification