Zhe GuoXiang LiHeng HuangNing GuoQuanzheng Li
Multi-modality medical imaging techniques have been increasingly applied in clinical practice and research studies. Corresponding multi-modal image analysis and ensemble learning schemes have seen rapid growth and bring unique value to medical applications. Motivated by the recent success of applying deep learning methods to medical image processing, we first propose an algorithmic architecture for supervised multi-modal image analysis with cross-modality fusion at the feature learning level, classifier level, and decision-making level. We then design and implement an image segmentation system based on deep Convolutional Neural Networks (CNN) to contour the lesions of soft tissue sarcomas using multi-modal images, including those from Magnetic Resonance Imaging (MRI), Computed Tomography (CT) and Positron Emission Tomography (PET). The network trained with multi-modal images shows superior performance compared to networks trained with single-modal images. For the task of tumor segmentation, performing image fusion within the network (i.e. fusing at convolutional or fully connected layers) is generally better than fusing images at the network output (i.e. voting). This study provides empirical guidance for the design and application of multi-modal image analysis.
Pradeep Kumar TripathiSarvachan VermaBirendra KumarAchintya Kumar PandeyPankaj SinghJagendra SinghJyotsna Ghildiyal Bijawan
G. Vennira SelviT. Ganesh KumarV Sheeja KumariSeema Dev AksathaPriti Rishi
Jun ShiTiantong WangZiqi ZhuMinfan ZhaoBingxun WangHong An