Advances in deep convolutional neural networks (CNNs) have shown excellent performances on image processing applications including segmentation for medical images. Nevertheless, CNN-based approaches like the Fully Convolutional Neural Networks (FCNs), Unet and variants for image segmentation often meet difficulties when expressing long-range dependency because of the locality properties of convolutional operations. In an alternative, the network models based on transformers have global context of the image and features, thus better expressing long-range dependency. Though having advantages, the transformer-based approach often lacks local information context, thus limiting certain applications like medical images. In the current study, we propose a new model that can inherit advantages of both global and local contexts of the two above approaches by using CNN and Transformer branches, and introduced the Convmixer and Progressive Atrous Spatial Pyramidal Pooling modules in the bottlenecks of each branches. The proposed model has been validated on various medical image databases including the Data science bowls 2018, and GlaS datasets. High evaluation scores including Dice score and Intersection Over Union metric have shown performance of the proposed segmentation model while compared with recent neural network models.
Mingkun ZhouXueyun NieYuhang LiuDoudou Li
Lanhu WuMiao ZhangYongri PiaoZhenyan YaoWeibing SunFeng TianHuchuan Lu
Pengfei GuYejia ZhangChaoli WangDanny Z. Chen
Yu ShengYiyi HongYongchang JiaGuihua Duan
Xinyi ZengTang ChengZeng PinxianJiaqi CuiYan Bin-yuPeng WangYan Wang